unclutter
Readability4J
Our great sponsors
unclutter | Readability4J | |
---|---|---|
39 | 3 | |
1,184 | 128 | |
1.2% | - | |
8.1 | 4.3 | |
about 1 month ago | over 2 years ago | |
TypeScript | HTML | |
GNU Affero General Public License v3.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
unclutter
-
Show HN: Reader Mode, but Better
Another question: do you look at your saved links frequently, for example to browse by the tag you assigned? What's the primary purpose of tagging?
I just created a ticket for this: https://github.com/lindylearn/unclutter/issues/595
Hey thank you! I'm really glad you like the extension.
There actually is an "auto-activate" feature you can enable in the settings. Is this what you had in mind?
Regarding mobile support, I know. I'm not sure how to handle mobile Chrome (which doesn't allow extensions), but for Safari this should be possible. See https://github.com/lindylearn/unclutter/issues/529
I like it. Ought to be a built-in browser feature, in addition to generic reader mode. But then so should uBO-level ad blocking...so you may have many years.
Have you considered donations/sponsors/patrons? Presumably more an occasional coffee requires massive numbers of users for something like this, but maybe you could do something like prioritize attention to site-specific fixes (which seem to be the main thing in https://github.com/lindylearn/unclutter/issues) for supporters.
It is good enough for many people and readability.js is re-used in many other projects. I'm really grateful for it.
Unclutter just produces more visually pleasing results by keeping the original style of the website intact. Here's a side-by-side comparison: https://github.com/lindylearn/unclutter/blob/main/docs/compa...
It's possible for extensions to only get access when you activate it for a specific tab. If you want Unclutter to work this way you can manually set "Site access" to "On click" in the Chrome extension settings.
The reason I enabled "all sites" by default is to make the automatic activation feature work (which used to be more powerful). Possibly this can be done with an optional all-sites permission now, I'll look into it: https://github.com/lindylearn/unclutter/issues/527
Thanks for the feedback! Also, if you don't like something build the extension yourself or submit a PR ;)
I was also skeptical at first, but the code is open source, and there's clear documentation on privacy policy and metrics collected.
https://github.com/lindylearn/unclutter/blob/main/docs/metri...
Bravo to the author, well done for earning the users' trust.
Makes sense!
I've actually had a similar feature request before: https://github.com/lindylearn/unclutter/issues/297
Have you tried some of the existing solutions mentioned in the ticket? I'm curious if they solve the problem for you, and if not, what Unclutter could do better.
Not an edge case, I've been tracking this for a while: https://github.com/lindylearn/unclutter/issues/13
Someone else in this thread suggested a version for mobile Safari which made supporting this even more interesting. No promises, but hopefully I can get to this before the end of the year.
There already is crowdsourcing of broken page reports: https://github.com/lindylearn/unclutter/issues?q=is%3Aissue+...
And twitter.com is a special case: https://github.com/lindylearn/unclutter/issues/570
I'm working on those, but it's never going to be perfect unfortunately.
-
Unclutter — a browser extension to read & save articles
All of this is only possible through the feedback and Open-Source contributions from all of you! Here’s more info: unclutter.lindylearn.io
Readability4J
-
Creating an advanced search engine with PostgreSQL
Depending upon the type of content, one might want to look into using the Readability (Browder's reader view) to parse the webpage. It will give you all the useful info without the junk. Then you can put it in the DB as needed.
https://github.com/mozilla/readability
Btw, readability, is also available in few other languages like Kotlin:
-
How does Firefox's Reader View work?
My Hacker News client HACK for iOS and Android has a reader mode ability browser. While on iOS, I was able to use the reader mode feature provided by SFSafariViewController, that wasn't available on android.
So I had to read a ton about this. I ended up using a heavily modified Kotlin version of Readability:
https://github.com/dankito/Readability4J
https://play.google.com/store/apps/details?id=com.pranapps.h...
-
Show HN: Instantly Listen to Any URL
Not sure about OP but I just implemented this in my Hacker News android client (thanks for the idea OP).
This is how I implemented it. I had already achieved article to "reader mode" by heavily customizing the Kotlin port of Mozilla‘s Readability:
https://github.com/dankito/Readability4J
Then I pass the text via Android's TextToSpeech library and it works very well:
fun trySpeaking(str:String){
What are some alternatives?
go-readability - Go package that cleans a HTML page for better readability.
murder - Large scale server deploys using BitTorrent and the BitTornado library
article-extractor - To extract main article from given URL with Node.js
Just-Read - A customizable read mode web extension.
readability - A standalone version of the readability lib
percollate - A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.
web-clipper - For Notion,OneNote,Bear,Yuque,Joplin。Clip anything to anywhere
dom-distiller - Distills the DOM
bypass-paywalls-chrome - Bypass Paywalls web browser extension for Chrome and Firefox.
arc90-readability - A copy of the original Arc90 repo with links to many of the current ports.
htmldate - Fast and robust date extraction from web pages, with Python or on the command-line
soup-strainer - A reimplementation of the Readability/Decruft algorithm using BeautifulSoup and html5lib