HungryHippo vs mlscraper

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

HungryHippo		mlscraper
	Project
2	Mentions	10
46	Stars	1,229
-	Growth	-
5.4	Activity	0.6
4 months ago	Latest Commit	about 2 months ago
TypeScript	Language	Python
GNU General Public License v3.0 or later	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

HungryHippo

Posts with mentions or reviews of HungryHippo. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2021-07-05.

Show HN: RSS feeds for arbitrary websites using CSS selectors
15 projects | news.ycombinator.com | 5 Jul 2021

It seems that RSS feed generators are a bit like static site generators: it's often thought to be easier to make your own than to learn to use someone else's.
Anyway, here's another self-hosted open source RSS feed generator for arbitrary websites: https://github.com/hueyy/HungryHippo
Looking for a website changes monitor plus notifications.
4 projects | /r/selfhosted | 7 Apr 2021

HungryHippo generates RSS feeds for websites that don't have one. If the website you're trying to monitor is public it might be supported already. If not, you can always send in a pull request or open an issue. It can be hosted with a single docker image, so it's quite straightforward.

mlscraper

Posts with mentions or reviews of mlscraper. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-12.

What are the best tools for web scraping and analysis of natural language to populate a dataset?
3 projects | /r/datasets | 12 Apr 2023

See if something like autoscraper or mlscraper suits your needs.
Experimental library for scraping websites using OpenAI's GPT API
7 projects | news.ycombinator.com | 25 Mar 2023

Why GPT-based then? There are libraries that do this: You give examples, they generate the rules for you and give you a scraper object that takes any html and returns the scraped data.
Mine: https://github.com/lorey/mlscraper
Could someone recommend me a library for c# like one of these two (they are for python) : mlscraper and autoscraper
2 projects | /r/learnprogramming | 19 Mar 2023

GitHub - lorey/mlscraper: 🤖 Scrape data from HTML websites automatically by just providing examples
Smart Scraper
1 project | /r/webscraping | 14 Feb 2023

Check it out here: https://github.com/lorey/mlscraper Example: https://github.com/lorey/mlscraper/blob/master/examples/quotes\_to\_scrape.py
Pre-trained Webscraping Models
2 projects | /r/webscraping | 4 Dec 2022
🤖 Scrape data from HTML websites automatically by just providing examples
1 project | /r/coolgithubprojects | 18 Oct 2022
mlscraper: Scrape data from HTML pages automatically with Machine Learning
1 project | news.ycombinator.com | 5 Jul 2021
Show HN: RSS feeds for arbitrary websites using CSS selectors
15 projects | news.ycombinator.com | 5 Jul 2021

In case anyone wants to detect the selectors automatically, here's a small python library I wrote that does it for you: https://github.com/lorey/mlscraper

What are some alternatives?

When comparing HungryHippo and mlscraper you can also consider the following projects:

feed-me-up-scotty

scrapingant-client-python - ScrapingAnt API client for Python.

feedgen - Generates RSS/ATOM/JSON feeds. Can be reasonably extended or create a feed using the CSS generator.

ttrss_plugin-feediron - Evolution of ttrss_plugin-af_feedmod

rssify - script that generates an rss feed out of websites that don't have one

furss - Fix Up RSS (and atom): Make full-text versions of rss/atom feeds

RSSHub - 🧡 Everything is RSSible

urlwatch - Watch (parts of) webpages and get notified when something changes via e-mail, on your phone or via other means. Highly configurable.

rssify - Tool that generates an rss feed out of websites that don't have one

telegram-to-rss - Telegram Bot to generate an RSS feed from group messages

HungryHippo vs feed-me-up-scotty mlscraper vs scrapingant-client-python HungryHippo vs feedgen mlscraper vs ttrss_plugin-feediron HungryHippo vs rssify mlscraper vs furss HungryHippo vs RSSHub mlscraper vs feed-me-up-scotty HungryHippo vs urlwatch mlscraper vs rssify HungryHippo vs telegram-to-rss mlscraper vs RSSHub

Compare HungryHippo vs mlscraper and see what are their differences.

HungryHippo

mlscraper

HungryHippo

mlscraper

What are some alternatives?