Solutions like Dependabot or Renovate update but don't merge dependencies. You need to do it manually while it could be fully automated! Add a Merge Queue to your workflow and stop caring about PR management & merging. Try Mergify for free. Learn more →
Top 23 News Open-Source Projects
-
-
Stream-Framework
Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:
An ultralight social media app with no dependencies that can run on shared web hosting. It's an API like Getstream, so F/E is up to you. I've had a fork of it in production for 2.5 years on a subscription site that generates a small income.
-
Sonar
Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
-
-
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Project mention: Trafilatura: Python tool to gather text on the Web | news.ycombinator.com | 2023-08-14The feature list answers that question pretty well: https://github.com/adbar/trafilatura#features
Basically: you could implement all of this on top of BeautifulSoup - polite crawling policies, sitemap and feed parsing, URL de-duplication, parallel processing, download queues, heuristics for extracting just the main article content, metadata extraction, language detection... but it would require writing an enormous amount of extra code.
-
news-please
news-please - an integrated web crawler and information extractor for news that just works
Look at news-please, you can find it in GitHub. I did something similar and it was very helpful. You can hit me up with if you have any questions.
-
-
simorgh
The BBC's Open Source Single Page Application. Contributions welcome! Used on some of our biggest websites, e.g.
Project mention: Does Thai language uses question mark in the end of the sentence to denote an interrogative sentence? | /r/thai | 2023-05-25But on some articles here I can see question marks.
-
InfluxDB
Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.
-
Project mention: Show HN: Circumflex, browse HN in your terminal | /r/patient_hackernews | 2022-10-14
-
Project mention: Vocês já desenvolveram projetos pessoais grandes? Como isso te afetou? | /r/brdev | 2023-06-25
-
One of the first things I do when I test aggregator sites like this one is to look for the RSS feed. I may have missed something but I did not find one here - could that be added? It would make it much easier to integrate into my normal 'news flow' which is more or less entirely built around RSS feeds presented through Nextcloud News [1]. I can scrape sites which do not provide their own feed but that is tedious and just increases the load on those sites.
-
sdupdates
A mega collection of all resources and news related to Stable Diffusion. Focused around AUTOMATIC1111's webui (https://github.com/AUTOMATIC1111/stable-diffusion-webui)
This will get you caught up to the 2nd of March: https://github.com/questianon/sdupdates
-
Giveme5W1H
Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?
-
Great list, maybe double check with https://github.com/ligurio/awesome-openbsd
-
GNews
A Happy and lightweight Python Package that Provides an API to search for articles on Google News and returns a JSON response.
Project mention: Need some help with my personal project (interactive world map with real-time data) | /r/datascience | 2023-05-15The web crawling part wasn't much of an issue - I am using an existing API (https://pypi.org/project/gnews/) which does what I needed. The issue lies in, well, pretty much the rest of the task described above. I need to create an interactive world map with real-time data (news articles) - more specifically, maintaining the data server, figuring out the data mapping part, etc. Since I pretty much have no experience in this, I would like to ask you guys for some directions. What tool would I need to use and how would I store/load the data? Is it possible to do so without writing some Javascript code myself?
-
To have a better idea of what we have now you can check the platform demo: - https://demo.reldens.com/
-
Readflow
readflow is a news-reading (or read-it-later) solution focused on versatility and simplicity.
-
-
-
Project mention: Python script that opens my bookmarks and returns only links posted in the last 14 days | /r/learnpython | 2023-05-07
Another option you could consider would be using a wrapper library around google news if you struggle with implementing the scarping logic yourself. The downside is that you'll still have to be careful so your IP doesn't get blocked. Make sure you limit the amount of requests per second/minute...
-
research-threats
Collection of legal threats against good faith Security Researchers; vulnerability disclosure gone wrong. A continuation of work started by @attritionorg
-
-
www.getpocket.com
-
You could try django-newsfeed but this might make things harder in the long run if you want to make modifications to it.
-
Mergify
Updating dependencies is time-consuming.. Solutions like Dependabot or Renovate update but don't merge dependencies. You need to do it manually while it could be fully automated! Add a Merge Queue to your workflow and stop caring about PR management & merging. Try Mergify for free.
News related posts
- Archive Today (a.k.a. archive.is) seems to be down
- The way I limit my social media consumption
- News Homepages – open-source archive saves, shares, and analyzes news homepages
- Community Spotlight: Keeping open source free with Dirk Müller, openSUSE
- FreeNews DNS Unpaywaller now supports basic Adblock and JavaScript disabler
- Show HN: It's Like HN, but for Science
- Created a free and open-source TV news app that features free, legal and publicly available English TV News Channels from All Around the World in just One App. Now available on Google Play Store!
-
A note from our sponsor - Mergify
blog.mergify.com | 29 Sep 2023
Index
What are some of the best open-source News projects? This list will help you:
Project | Stars | |
---|---|---|
1 | newspaper | 13,135 |
2 | Stream-Framework | 4,695 |
3 | Refinery CMS | 3,881 |
4 | trafilatura | 2,060 |
5 | news-please | 1,749 |
6 | pygooglenews | 1,179 |
7 | simorgh | 1,177 |
8 | circumflex | 843 |
9 | burlesco | 831 |
10 | news | 735 |
11 | sdupdates | 505 |
12 | Giveme5W1H | 476 |
13 | awesome-openbsd | 394 |
14 | GNews | 388 |
15 | reldens | 366 |
16 | Readflow | 314 |
17 | marquee-scroller | 305 |
18 | journalist | 248 |
19 | GoogleNews | 248 |
20 | research-threats | 247 |
21 | FLUTTER_NewsApp | 194 |
22 | pocket-ios | 191 |
23 | django-newsfeed | 188 |