News

Open-source projects categorized as News

Top 23 News Open-Source Projects

  • newspaper

    News, full-text, and article metadata extraction in Python 3. Advanced docs:

  • Stream-Framework

    Stream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:

    Project mention: On what side project you guys are working on? | /r/webdev | 2023-06-29

    An ultralight social media app with no dependencies that can run on shared web hosting. It's an API like Getstream, so F/E is up to you. I've had a fork of it in production for 2.5 years on a subscription site that generates a small income.

  • Sonar

    Write Clean Python Code. Always.. Sonar helps you commit clean code every time. With over 225 unique rules to find Python bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.

  • Refinery CMS

    An extendable Ruby on Rails CMS that supports Rails 6.0+

  • trafilatura

    Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

    Project mention: Trafilatura: Python tool to gather text on the Web | news.ycombinator.com | 2023-08-14

    The feature list answers that question pretty well: https://github.com/adbar/trafilatura#features

    Basically: you could implement all of this on top of BeautifulSoup - polite crawling policies, sitemap and feed parsing, URL de-duplication, parallel processing, download queues, heuristics for extracting just the main article content, metadata extraction, language detection... but it would require writing an enormous amount of extra code.

  • news-please

    news-please - an integrated web crawler and information extractor for news that just works

    Project mention: Data extraction from news media outlets? | /r/datasets | 2022-10-13

    Look at news-please, you can find it in GitHub. I did something similar and it was very helpful. You can hit me up with if you have any questions.

  • pygooglenews

    If Google News had a Python library

  • simorgh

    The BBC's Open Source Single Page Application. Contributions welcome! Used on some of our biggest websites, e.g.

    Project mention: Does Thai language uses question mark in the end of the sentence to denote an interrogative sentence? | /r/thai | 2023-05-25

    But on some articles here I can see question marks.

  • InfluxDB

    Collect and Analyze Billions of Data Points in Real Time. Manage all types of time series data in a single, purpose-built database. Run at any scale in any environment in the cloud, on-premises, or at the edge.

  • circumflex

    🌿 It's Hacker News in your terminal

    Project mention: Show HN: Circumflex, browse HN in your terminal | /r/patient_hackernews | 2022-10-14
  • burlesco

    Leia notícias sem ser assinante, burle o paywall (WebExtension)

    Project mention: Vocês já desenvolveram projetos pessoais grandes? Como isso te afetou? | /r/brdev | 2023-06-25
  • news

    :newspaper: RSS/Atom feed reader

    Project mention: Show HN: It's Like HN, but for Science | news.ycombinator.com | 2023-09-09

    One of the first things I do when I test aggregator sites like this one is to look for the RSS feed. I may have missed something but I did not find one here - could that be added? It would make it much easier to integrate into my normal 'news flow' which is more or less entirely built around RSS feeds presented through Nextcloud News [1]. I can scrape sites which do not provide their own feed but that is tedious and just increases the load on those sites.

    [1] https://github.com/nextcloud/news

  • sdupdates

    A mega collection of all resources and news related to Stable Diffusion. Focused around AUTOMATIC1111's webui (https://github.com/AUTOMATIC1111/stable-diffusion-webui)

    Project mention: What happened while I was gone? | /r/StableDiffusion | 2023-03-14

    This will get you caught up to the 2nd of March: https://github.com/questianon/sdupdates

  • Giveme5W1H

    Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?

  • awesome-openbsd

    A curated list of awesome OpenBSD resources

    Project mention: Privacy Guides for OpenBSD | /r/openbsd | 2023-05-15

    Great list, maybe double check with https://github.com/ligurio/awesome-openbsd

  • GNews

    A Happy and lightweight Python Package that Provides an API to search for articles on Google News and returns a JSON response.

    Project mention: Need some help with my personal project (interactive world map with real-time data) | /r/datascience | 2023-05-15

    The web crawling part wasn't much of an issue - I am using an existing API (https://pypi.org/project/gnews/) which does what I needed. The issue lies in, well, pretty much the rest of the task described above. I need to create an interactive world map with real-time data (news articles) - more specifically, maintaining the data server, figuring out the data mapping part, etc. Since I pretty much have no experience in this, I would like to ask you guys for some directions. What tool would I need to use and how would I store/load the data? Is it possible to do so without writing some Javascript code myself?

  • reldens

    Reldens - You can make it - Open Source MMORPG Platform

    Project mention: [Paid] - Game artist / designer for a 2D MRPG | /r/INAT | 2023-02-08

    To have a better idea of what we have now you can check the platform demo: - https://demo.reldens.com/

  • Readflow

    readflow is a news-reading (or read-it-later) solution focused on versatility and simplicity.

  • marquee-scroller

    Marquee Scroller Clock News Weather and More

  • journalist

    Journalist. An RSS aggregator.

  • GoogleNews

    Script for GoogleNews

    Project mention: Python script that opens my bookmarks and returns only links posted in the last 14 days | /r/learnpython | 2023-05-07

    Another option you could consider would be using a wrapper library around google news if you struggle with implementing the scarping logic yourself. The downside is that you'll still have to be careful so your IP doesn't get blocked. Make sure you limit the amount of requests per second/minute...

  • research-threats

    Collection of legal threats against good faith Security Researchers; vulnerability disclosure gone wrong. A continuation of work started by @attritionorg

  • FLUTTER_NewsApp

    Live News App Using Rest API with Searching Functionality

  • pocket-ios

    Mozilla's Pocket iOS App, Rebuilt in Swift

    Project mention: How do you entertain yourself on The Path? | /r/jerseycity | 2023-07-11

    www.getpocket.com

  • django-newsfeed

    A news curator and newsletter subscription package for Django

    Project mention: How to make a "ads wall" | /r/djangolearning | 2022-10-30

    You could try django-newsfeed but this might make things harder in the long run if you want to make modifications to it.

  • Mergify

    Updating dependencies is time-consuming.. Solutions like Dependabot or Renovate update but don't merge dependencies. You need to do it manually while it could be fully automated! Add a Merge Queue to your workflow and stop caring about PR management & merging. Try Mergify for free.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2023-09-09.

News related posts

Index

What are some of the best open-source News projects? This list will help you:

Project Stars
1 newspaper 13,135
2 Stream-Framework 4,695
3 Refinery CMS 3,881
4 trafilatura 2,060
5 news-please 1,749
6 pygooglenews 1,179
7 simorgh 1,177
8 circumflex 843
9 burlesco 831
10 news 735
11 sdupdates 505
12 Giveme5W1H 476
13 awesome-openbsd 394
14 GNews 388
15 reldens 366
16 Readflow 314
17 marquee-scroller 305
18 journalist 248
19 GoogleNews 248
20 research-threats 247
21 FLUTTER_NewsApp 194
22 pocket-ios 191
23 django-newsfeed 188
Updating dependencies is time-consuming.
Solutions like Dependabot or Renovate update but don't merge dependencies. You need to do it manually while it could be fully automated! Add a Merge Queue to your workflow and stop caring about PR management & merging. Try Mergify for free.
blog.mergify.com