mlscraper
RSSHub
mlscraper | RSSHub | |
---|---|---|
10 | 26 | |
1,229 | 29,714 | |
- | - | |
0.6 | 10.0 | |
about 2 months ago | 4 days ago | |
Python | TypeScript | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
mlscraper
-
What are the best tools for web scraping and analysis of natural language to populate a dataset?
See if something like autoscraper or mlscraper suits your needs.
-
Experimental library for scraping websites using OpenAI's GPT API
Why GPT-based then? There are libraries that do this: You give examples, they generate the rules for you and give you a scraper object that takes any html and returns the scraped data.
Mine: https://github.com/lorey/mlscraper
-
Could someone recommend me a library for c# like one of these two (they are for python) : mlscraper and autoscraper
GitHub - lorey/mlscraper: 🤖 Scrape data from HTML websites automatically by just providing examples
-
Smart Scraper
Check it out here: https://github.com/lorey/mlscraper Example: https://github.com/lorey/mlscraper/blob/master/examples/quotes\_to\_scrape.py
- Pre-trained Webscraping Models
- 🤖 Scrape data from HTML websites automatically by just providing examples
- mlscraper: Scrape data from HTML pages automatically with Machine Learning
-
Show HN: RSS feeds for arbitrary websites using CSS selectors
In case anyone wants to detect the selectors automatically, here's a small python library I wrote that does it for you: https://github.com/lorey/mlscraper
RSSHub
-
Ask HN: Nitter officially declared "over" today, alternatives?
you could run your own instance of rsshub
https://docs.rsshub.app/
-
Show HN: Extract RSS feed from almost anything
This is a good application. However, I think that among similar products, https://github.com/DIYgod/RSSHub is a more usable choice(able to generate RSS). In addition, using rsshub with the https://github.com/DIYgod/RSSHub-Radar browser extension would be more convenient.
- Generate RSS feed for any website using CSS selectors
- Any alternative to Feedly?
-
I was fed up with endless scrolling on reddit, so I wrote some scripts to give me only the top 10 posts from the last day. It keeps me in the loop without wasting much time, and have my own personalized reddit newspaper. The code runs daily at 8AM and 8PM on my server. GitHub link in the comments.
Check out RSSHub.
-
The animated series (meant to be released on December 3rd) was rescheduled. The latest release date will be announced as soon as possible.
Found it from RSSHub: an open source and extensible RSS feed generator.
- Get Twitter, Telegram, Instagram, and other content into your RSS feed with RSSHub, a self-hosted RSS feed aggregator
-
A script to convert any url into an rss feed
Not a script, but You could try something like RSSHub to get the site into a RSS-digestible format…
-
35 thought-provoking websites that will help you learn new things - AI powered research assistant, list of Rss feed readers, open links from the web in apps instead
https://github.com/DIYgod/RSSHub - open source, easy to use, and extensible RSS feed generator. It's capable of generating RSS feeds from pretty much everything.
-
Org Feed + esxml: make an RSS feed out of any website!
There's also RSSHub, which is pure js.
What are some alternatives?
scrapingant-client-python - ScrapingAnt API client for Python.
RSS-Bridge - The RSS feed for websites missing it
ttrss_plugin-feediron - Evolution of ttrss_plugin-af_feedmod
RSS3 - RSS3 is a next-generation feed standard that aims to support efficient and decentralized information distribution. [Moved to: https://github.com/NaturalSelectionLabs/RSS3-Protocol]
furss - Fix Up RSS (and atom): Make full-text versions of rss/atom feeds
rss-proxy - RSS-proxy allows you to do create an RSS or ATOM feed of almost any website, just by analyzing just the static HTML structure.
feed-me-up-scotty
full-text-rss - Full-Text RSS can transform partial feeds to deliver the full content stripped of clutter and ads
rssify - Tool that generates an rss feed out of websites that don't have one
feedgen - Generates RSS/ATOM/JSON feeds. Can be reasonably extended or create a feed using the CSS generator.
pixiv-omina - Pixiv Omina is a software for downloading artworks and comics from Pixiv and Pixiv Comic