parser
awesome-reMarkable
parser | awesome-reMarkable | |
---|---|---|
12 | 146 | |
5,254 | 5,853 | |
1.8% | 0.7% | |
1.1 | 7.3 | |
6 months ago | about 2 months ago | |
JavaScript | ||
Apache License 2.0 | Creative Commons Zero v1.0 Universal |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
parser
-
Show HN: I made a tool to clean and convert any webpage to Markdown
Thoroughly scraping is challenging, especially in an environment where you don’t have (or want) a JavaScript runtime.
For content extraction, I found the approach the Postlight library takes quite neat. It scores individual html nodes based on some heuristics (text length, link density, css classes). It the selects the nodes with the highest score. [1] I ported it to Swift for a personal read later app.
[1] https://github.com/postlight/parser
-
Trouble Building Chrome Extension to Get News Article Content
I've been working on an enhanced reader mode extension for the last few months. I found that Mercury Reader's parser tool is useful for extracting content. If that's not exactly what you're looking for, readibility is another good option. It's a library used inside Firefox's reader moder that you can use in any project.
-
What Are The Coolest Virtual Machines You Currently Run 24/7?
I currently have it turned off while I search for better sources, but I have a VM that runs a custom cron script that combines a custom RSS reader, podfox, mercury-parser, and coqui-ai to generate audio podcasts from RSS news feeds. I should probably clean it up and release the script/setup process. With a few tweaks and some AI text-to-speech and a little machine learning audio processing you can get a really good podcast experience from text posts.
-
Extracting Text button no longer works
It looks like Relay could be updated to convert it locally though, since the parser that it uses appears to be open source.
-
Which are some open-source Chrome extensions you want to use on Firefox?
https://github.com/postlight/mercury-parser The only one I need, shit's too good
-
API for getting news fulltext
An alternative would be to extract the plain text from the article's page with either some "readability" API or a library like Mercury Parser: https://github.com/postlight/mercury-parser
-
How does Firefox's Reader View work?
I haven’t directly compared them, but I have also found mercury parser (https://github.com/postlight/mercury-parser) to be very reliable.
Since it turns a website into very plain (X)HTML it‘s fairly easy to use it to make a browsing proxy or automatically produce epub files for e-readers, which is what I do.
-
Build your self-hosted Evernote
Make sure that at the end of the process you have the node and npm executables installed - the http.webpage integration uses the Mercury Parser API to convert web pages to Markdown.
-
Reading from the web offline and distraction-free
Good luck! Those HTML issues you're coming across are tough and so varied across the web!
I was working with Mercury Parser (pluggable parsing for different sites) in the past.
https://github.com/postlight/mercury-parser
- The most underused browser feature
awesome-reMarkable
-
E-ink is so Retropunk
> As much as I love the hacker spirit of cracking open hardware and software and bending it to your will (whether or not it was designed towards that end), I enjoy my reMarkable precisely because I can get away from the ubiquity of computing and needing to constantly tinker with and repair software.
Personally I completely agree with you, and could have written almost exactly that paragraph - I too have a ReMarkable (the 2nd / current version), and love using it as it ships for both note taking and especially for reading ebooks/PDFs ("especially" just because it's what I use it for more, not because that's what it's better at - in fact, it's UI for reading documents is among its weaker points and I hope they improve it in future software updates).
However it's worth pointing out that you can SSH into it, and there are a fair few 3rd party tools and hacks for it - so far I've avoided trying any of them as there's nothing that I want enough to have even a 1% risk of bricking it to worry about. But I'm tempted to start playing around with it someday.
This is the best list of stuff for the ReMarkable that I'm aware of, though I don't know how complete it is / how many released tools or guides there might be that aren't included here:
https://github.com/reHackable/awesome-reMarkable
-
Just bought a reMarkable - quite UNremarkable
There are options for USB/wifi syncing and lots of other community mods if you're handy with a terminal: https://github.com/reHackable/awesome-reMarkable
-
Dumb questions
If you follow the instructions and you are fine to turn automatic updates off, you may have a lool at awesome-remarkable https://github.com/reHackable/awesome-reMarkable
-
My First reMarkable will be arriving sometime today! What are some things or tips and tricks I should know?
This sentence doesn't make sense. People apply hacks because they want to make full use of their device. reMarkable has shortcomings, yes, but they can be overcome with the software that others have written. The Awesome reMarkable link the sidebar was basically a founding document of this very subreddit.
-
Best E-Ink tablet for self-hosting
More info can be found at awesome-ReMarkable: https://github.com/reHackable/awesome-reMarkable
-
created templates disappeared after update
Use a software to manage your templates automatically. See the Awesome reMarkable list, and Ctrl-F "templates".
- Linux friendly eInk tablets
-
If I broke or lost my ReMarkable 2, would I be able to download all the old notes onto a new one?
You can also take backups using easy, convenient, community-written software, like RCU (which I'm the author of), reMy, reMarkable HyUtilities, rmExplorer, rmAPI, and many others found in the Awesome reMarkable list.
- What are you doing with community projects?
-
Big note files - timeout on usb webserver export
You could try reMy, which has its own renderer. There are more rendering programs in the Awesome reMarkable list, many of which will work with 2.15 and below--just avoid anything saying 'cloud' or 'web UI'.
What are some alternatives?
readability - A standalone version of the readability lib
zotero-remarkable - Sync papers from Zotero to a reMarkable tablet
hn-search - Hacker News Search
google-drive-remarkable-sync - Apps Script library for synchronising Google Drive folder with Remarkable reader.
Just-Read - A customizable read mode web extension.
remarkable-hacks - additional functionality via binary patching
FParsec - A parser combinator library for F#
mendeley-rMsync - Script to sync papers from Mendeley to reMarkable tablet
tidy-html5 - The granddaddy of HTML tools, with support for modern standards
koreader - An ebook reader application supporting PDF, DjVu, EPUB, FB2 and many more formats, running on Cervantes, Kindle, Kobo, PocketBook and Android devices
rdrview - Firefox Reader View as a command line tool
reMarkableSync - An OneNote AddIn for importing digitized notes from the reMarkable tablet.