ttrss_plugin-feediron
mlscraper
ttrss_plugin-feediron | mlscraper | |
---|---|---|
9 | 10 | |
203 | 1,229 | |
1.0% | - | |
3.2 | 0.6 | |
24 days ago | about 2 months ago | |
PHP | Python | |
MIT License | - |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ttrss_plugin-feediron
-
Show HN: Twine – Gorgeous open source multiplatform RSS app
It Technically could but as far as I am aware it doesn't.
Some alternatives
TT-RSS has multiple plugins, including the one I currently support https://github.com/feediron/ttrss_plugin-feediron
FressRSS has CSS selector support out of the box and has a readability extension that supports Readability or Mercury
I've been dreaming of porting Feediron to both FreshRSS and Nextcloud news. But I barely have any free time as is... one day
-
If you do happen to switch to an alternative, remember to also consider RSS syndication - it can be very useful
Back when I was using Tiny Tiny RSS I've developed af_feedmod to download the article from the linked webpage so you'd end up with a full feed. This was later forked into FeedIron and seems to be somewhat popular by now.
-
TinyTinyRSS vs. FreshRSS
I'm a die hard TT-RSS user, mainly because of the Feediron plugin (A full text page parser) that I now maintain.
-
RSS-Bridge – The RSS feed for websites missing it
The official plugin uses a php port of Mozilla's Readability, which is used for Firefox Reader Mode. There is also the 3rd party FeedIron that is more configurable.
https://github.com/feediron/ttrss_plugin-feediron
-
Show HN: RSS feeds for arbitrary websites using CSS selectors
Always good to see RSS projects pop up on hackernews. I'm still maintaining the Feediron plugin for TT-RSS - https://github.com/feediron/ttrss_plugin-feediron
Unlike this project Feediron is only for modifying existing RSS feeds to extract the desired information. Typically uses xpaths to select content
- How image search works at Dropbox
-
I Still Use RSS
> I've never open sourced it though because I guess it's a bit of a grey area
I'm maintaining the TT-RSS plugin feediron https://github.com/feediron/ttrss_plugin-feediron that fetches full-text data, so my thinking is this:
At the end of the day if it's a openly available website and you are personally (through your own server) fetching the resources I don't think anyone has a right to complain.
Now if you were offering it as a service it might arguably be a bit more grey, but only if you're ignoring the robots.txt file
-
Journalist: A RSS aggregator that speaks the Fever API
So you plan something like the FeedIron TT-RSS Plugin (https://github.com/feediron/ttrss_plugin-feediron) to allow customization of a feed to get relevant content?
mlscraper
-
What are the best tools for web scraping and analysis of natural language to populate a dataset?
See if something like autoscraper or mlscraper suits your needs.
-
Experimental library for scraping websites using OpenAI's GPT API
Why GPT-based then? There are libraries that do this: You give examples, they generate the rules for you and give you a scraper object that takes any html and returns the scraped data.
Mine: https://github.com/lorey/mlscraper
-
Could someone recommend me a library for c# like one of these two (they are for python) : mlscraper and autoscraper
GitHub - lorey/mlscraper: 🤖 Scrape data from HTML websites automatically by just providing examples
-
Smart Scraper
Check it out here: https://github.com/lorey/mlscraper Example: https://github.com/lorey/mlscraper/blob/master/examples/quotes\_to\_scrape.py
- Pre-trained Webscraping Models
- 🤖 Scrape data from HTML websites automatically by just providing examples
- mlscraper: Scrape data from HTML pages automatically with Machine Learning
-
Show HN: RSS feeds for arbitrary websites using CSS selectors
In case anyone wants to detect the selectors automatically, here's a small python library I wrote that does it for you: https://github.com/lorey/mlscraper
What are some alternatives?
FreshRSS - A free, self-hostable news aggregator…
scrapingant-client-python - ScrapingAnt API client for Python.
mercury_fulltext - 📖 Enjoy full text for tt-rss.
furss - Fix Up RSS (and atom): Make full-text versions of rss/atom feeds
full-text-rss-docker - A debian:buster-slim full-text-rss Docker Container
feed-me-up-scotty
RSSHub - 🧡 Everything is RSSible
rssify - Tool that generates an rss feed out of websites that don't have one
elfeed - An Emacs web feeds client
ALL-about-RSS - A list of RSS related stuff: tools, services, communities and tutorials, etc.
feedgen - Generates RSS/ATOM/JSON feeds. Can be reasonably extended or create a feed using the CSS generator.