pygooglenews vs oldweb-today

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

pygooglenews		oldweb-today
	Project
8	Mentions	22
1,234	Stars	234
-	Growth	0.4%
0.0	Activity	4.2
6 months ago	Latest Commit	4 months ago
Python	Language	JavaScript
MIT License	License	GNU Affero General Public License v3.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

pygooglenews

Posts with mentions or reviews of pygooglenews. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-11-12.

Tips for Making a Popular Open-Source Project in 2021 [Ultimate Guide]
12 projects | news.ycombinator.com | 12 Nov 2021

I have ~4k start in 2 Python libraries. Both help fetch live news articles. Links below.
These were my first libraries.
I took the approach of promoting them as any other product. You have to "sell" your code. Even if it's 100% free.
In my opinion, the most important thing is DEMO. Just make a GIF where you showcase what your software does:
* 80% of engineers won't even bother to read the description
No one will spend their precious time trying to get through your code.
[0] https://github.com/kotartemiy/newscatcher Programmatically collect normalized news from (almost) any website.
[1] https://github.com/kotartemiy/pygooglenews If Google News had a Python library
NLP beginner dataset for text classification, sentiment analysis and/or NER
1 project | /r/datasets | 11 Oct 2021

I wrote a pygooglenews package for news mining out of google news.
68k.news: A Netscape 1.1 makeover of Google News
4 projects | news.ycombinator.com | 29 Mar 2021

I'm curious where the data get fetched from. The Author mentions that Mozilla Readability and SimplePie are used.
Readability to parse the content. SimplePie to fetch the data (I assume). Dat from RSS feeds?
In case you want to make something similar, I recently wrote a blog on where you could get news data for free [1]
(self-promo) I'd recommend to take a look at my Python package to mine news data from Google News [2]. Also, in 3 days we're releasing an absolutely free News API [3] that will support ~50-100k top stories per day.
[1] https://blog.newscatcherapi.com/an-ultimate-list-of-open-sou...
[2] https://github.com/kotartemiy/pygooglenews
[3] https://newscatcherapi.com/free-news-api
Available data set for news headlines or articles over 2019-2020?
2 projects | /r/datasets | 26 Mar 2021

I wrote a python package to scrape google news headlines at scale: https://github.com/kotartemiy/pygooglenews
A Happy and lightweight Python Package that searches Google News and returns a usable JSON response.
1 project | /r/django | 19 Mar 2021
Write libraries instead of services, where possible
9 projects | news.ycombinator.com | 9 Mar 2021

Write libraries AND services, where it makes sense.
I wrote a Python library to scrape google news [0]
We also have it as a service [1]
Want to know why? Because devs who can't pay won't pay. Businesses who can pay will rather pay for a service (API in our case), and not care about maintaining it.
[0] https://github.com/kotartemiy/pygooglenews
[1] https://newscatcherapi.com/google-news-api
Financial news
1 project | /r/algotrading | 2 Mar 2021

You can use this Python project to scrape Google News (https://github.com/kotartemiy/pygooglenews) If you look at the code you can get a feel of how to call Google News directly.
Interview brownie
2 projects | dev.to | 10 Jan 2021

https://github.com/kotartemiy/pygooglenews https://pypi.org/project/google-play-scraper/

oldweb-today

Posts with mentions or reviews of oldweb-today. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-03.

Ask HN: What products other than Obsidian share the file over app philosophy?
6 projects | news.ycombinator.com | 3 Apr 2024

There are flat-file CMSes (content management systems) like Grav: https://getgrav.org/
I guess, in some vague/broad sense, config-as-code systems also implement something similar? Maybe even OpenAPI schemas could count to some degree...?
In the old days, the "semantic web" movement was an attempt to make more webpages both human- and machine-readable indefinitely by tagging them with proper schema: https://en.wikipedia.org/wiki/Resource_Description_Framework. Even Google was on board for a while, but I guess it never saw much uptake. As far as I can tell it's basically dead now, both because of non-semantic HTML (everything as a React div), general laziness, and LLMs being able to parse things loosely.
-------------
Side thoughts...
Philosophically, I don't know that capturing raw data alone as files is really sufficient to capture the nuances of any particular experience, or the overall zeitgeist of an era. You can archive Geocities pages, but that doesn't really capture the novelty and indie-ness of that era. Similarly, you can save TikTok videos, but absent the cultural environment that created them (and a faithful recreation of the recommendation algorithm), they wouldn't really show future archaeologists how teenagers today lived.
I worked for a natural history museum for a while, and while we were there, one of the interesting questions (well, to me anyway) was whether our web content was in and of itself worth preserving as a cultural artifact -- both so that future generations can see what exhibits were interesting/apropos for the cultures of our times, but also so they could see how our generation found out about those exhibitions to begin with (who knows what the Web will morph into 50 years later). It wasn't enough to simply save the HTML of our web pages, both because they tie into various other APIs and databases (like zoological collections) and because some were interactive experiences, like games designed to be played with a mouse (before phones were popular), or phone chatbots with some of our specimens. To really capture the experience authentically would've required emulating not just our tech stacks and devices, among other things.
Like for the earlier Geocities example, sure you could just save the old HTML and render it with a modern browser, but that's not the same as something like https://oldweb.today/?browser=ns3-mac#http://geocities.com/ , which emulates the whole OS and browser too. And that still isn't the same as having to sit in front of a tiny CRT and wait minutes for everything to download over a 14.4k modem, only to be interrupted when mom had to make a call.
I guess that's a longwinded of critiquing "file over app": It only makes sense for things that are originally files/documents to begin with. Much of our lives now are not flat docs but "experiences" that take much more thought and effort to archive. If the goal is truly to preserve that posterity, it's not enough to just archive their raw data, but to develop ways to record and later emulate entire experiences, both technological and cultural. It ain't easy!
Ask HN: It's 1997, how do you build a website?
1 project | news.ycombinator.com | 6 Feb 2024

1. Anything that can emulate Windows 95
2. HotDog, Dreamweaver, and/or Frontpage for HTML editing. They each have their strengths & weaknesses and work relatively well together. Normally I'd lay out pages in Dreamweaver or Frontpage and then clean up the HTML with HotDog. https://en.wikipedia.org/wiki/HotDog. Upload them with BulletproofFTP (http://www.bpftp.com/), an ancient predecessor of Filezilla.
3. Not sure if any of the old hosts are still around. Maybe Neocities.org comes close to the Geocities of old, or else you can get barebones and very cheap but reliable web hosting from https://www.nearlyfreespeech.net/
Get AOL dialup: https://getonline.aol.com/dialup and a 56k modem off eBay, no cheating with megabit connections.
You can emulate old browsers online: https://oldweb.today/
People generally weren't self-hosting their websites back then because phone lines weren't very reliable, and the dialup modems of the day (like today's cable modems) generally had faster download than upload speeds. After the BBS era, commercial internet hosts started becoming pretty common, but they usually used expensive ISDN or T1 lines.
Apache was still in its infancy then, and Microsoft IIS was common.
---------
Overall, I don't think the network stack is really THAT far removed from what we have today. The basics of HTTP were pretty fleshed out already (it was more the HTTPS and DNS security extensions that really evolved), along with better compression and parallelization protocols in HTTP2/3. And of there's been a huge amount of backend server optimizations, caching, reverse proxies, etc. But the old network stack would still work today and you can still run a basic barebones http daemon and firewall the same way you could back then, open up port 80, and watch your box get pwned by the bots. Hell, there are probably still zombie bots leftover from that era running on someone's basement PC, just casually scanning the internet day in and day out, waiting for its chance...
What's changed a lot more (IMO) is more the language of the web itself. HTML was very basic then, as was JS and CSS. Making websites without modern CSS, in particular, is very painful... requiring a lot of nested tables, frames (remember those? they're different from iframes!), etc. A lot of the pain was abstracted away to backend scripting languages (Perl, ASP, Coldfusion) or frontend extensions (Java, Shockwave Flash) and people didn't really work in vanilla HTML/JS very often (because they were so weak by then). Plain HTML is for your basic Astro Marmalade site with a bunch of animated gifs and whatever, but probably real ecommerce sites (such as it were) wouldn't be written in plain HTML :)
SURF THE WEB LIKE IT'S 1999!
2 projects | news.ycombinator.com | 19 Jan 2024

I like https://oldweb.today ... actually emulates old OS/browser combinations and proxies stuff from archive.org
Is there a website that shows chronological versions of websites/apps/aplatforms?
1 project | /r/UI_Design | 14 Jul 2023

Another option to what you are looking for https://oldweb.today/
Old Web, Today
1 project | news.ycombinator.com | 14 Apr 2023
I miss the old Internet
1 project | /r/Millennials | 22 Jan 2023

Good news! You can go back in time https://oldweb.today/
Casual Friday - 🎵 Talking to the moon 🌝 We could talk forever 🎵
1 project | /r/CasualConversation | 26 Aug 2022
How To See What a Website Used To Look Like In 2019?
1 project | /r/developersIndia | 24 May 2022

https://web.archive.org/ does not have a screenshot from that month. https://oldweb.today doesn't work either because it pulls up a broken version (I tried all of the browsers it allows on the left). I tried looking online for another site to be able to pull an archive but was unsuccessful.

1 project | /r/techsupport | 24 May 2022

I need to see what gravitytransformation.com/macro-calculator looked like in September of 2019. https://web.archive.org/ does not have a screenshot from that month. https://oldweb.today doesn't work either.
macOS 8 Running in the Browser
11 projects | news.ycombinator.com | 26 Apr 2022

https://oldweb.today/ gives you that experience, using some of the same building blocks.

What are some alternatives?

When comparing pygooglenews and oldweb-today you can also consider the following projects:

newscatcher - Programmatically collect normalized news from (almost) any website.

web-archives - Browser extension for viewing archived and cached versions of web pages, available for Chrome, Edge and Safari

Phoenix - Peace of mind from prototype to production

archiveweb.page - A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!

libgit2 - A cross-platform, linkable library implementation of Git that you can use in your application.

wrp - Web Rendering Proxy: Use vintage, historical, legacy browsers on modern web

fastapi-azure-auth - Easy and secure implementation of Azure Entra ID (previously AD) for your FastAPI APIs 🔒 B2C, single- and multi-tenant support.

vaporBoy - Gameboy / Gameboy Color Emulator PWA built with Preact. ⚛️ Powered by wasmBoy. 🎮Themed with VaporWave. 🌴🐬

jOOQ - jOOQ is the best way to write SQL in Java

neocities - Neocities.org - the web site. The entire thing. Yep, we're completely open source.

asgi-correlation-id - Request ID propagation for ASGI apps

picotcp.js

pygooglenews vs newscatcher oldweb-today vs web-archives pygooglenews vs Phoenix oldweb-today vs archiveweb.page pygooglenews vs libgit2 oldweb-today vs wrp pygooglenews vs fastapi-azure-auth oldweb-today vs vaporBoy pygooglenews vs jOOQ oldweb-today vs neocities pygooglenews vs asgi-correlation-id oldweb-today vs picotcp.js

Compare pygooglenews vs oldweb-today and see what are their differences.

pygooglenews

oldweb-today

pygooglenews

oldweb-today

What are some alternatives?