InfluxDB is the Time Series Platform where developers build real-time applications for analytics, IoT and cloud-native services. Easy to start, it is available in the cloud or on-premises. Learn more →
Percollate Alternatives
Similar projects and alternatives to percollate
-
-
koodo-reader
A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux and Web
-
Klotho
AWS Cloud-aware infrastructure-from-code toolbox [NEW]. Build cloud backends with Infrastructure-from-Code (IfC), a revolutionary technique for generating and updating cloud infrastructure. Try IfC with AWS and Klotho now (Now open-source)
-
SingleFile
Web Extension and CLI tool for saving a faithful copy of an entire web page in a single HTML file
-
BasicCrawler
Basic web crawler that automates website exploration and producing web resource trees.
-
-
monolith-of-web
A chrome extension to make a single static HTML file of the web page using a WebAssembly port of monolith CLI
-
-
Appwrite
Appwrite - The Open Source Firebase alternative introduces iOS support . Appwrite is an open source backend server that helps you build native iOS applications much faster with realtime APIs for authentication, databases, files storage, cloud functions and much more!
-
Readability4J
A Kotlin port of Mozilla‘s Readability. It extracts a website‘s relevant content and removes all clutter from it.
-
-
ArchiveBox
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
-
-
-
-
-
-
keepassxc
KeePassXC is a cross-platform community-driven port of the Windows application “Keepass Password Safe”.
-
-
Bitwarden
The core infrastructure backend (API, database, Docker, etc). (by bitwarden)
-
BorgBackup
Deduplicating archiver with compression and authenticated encryption.
-
Wallabag
wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.
-
Sonar
Write Clean JavaScript Code. Always.. Sonar helps you commit clean code every time. With over 300 unique rules to find JavaScript bugs, code smells & vulnerabilities, Sonar finds the issues while you focus on the work.
percollate reviews and mentions
-
ArchiveBox Alternative
The Cli Tool Percollate offers a different approach, but is also very good: https://github.com/danburzo/percollate
-
Is there a command line program to convert web pages into readable markdown/htm/pdf format? preferably markdown
Concerning pdf there is the well known wkhtmltopdf , but let me say that I love the not so well known percollate
-
Show HN: Lurnby, a tool for better learning, is now open source
Since I'm working on a similar project, this is how I am planning to pull content from the web, utilizing percollate[1] to get the HTML content, I haven't written any implementation for this in Python yet.
If you don't mind me asking, how were you going to implement spaced repetition? Since the Incremental Reading algorithm has never been published as far as I know.
- What Are The Best Linux Apps?
-
Alternatives to ArchiveBox?
Maybe https://github.com/danburzo/percollate, I didnt try it and I am not sure if the html output looks like u want it.
-
Reading from the web offline and distraction-free
I do a lot of this work[3] (web to documents) and it's interesting to see other approaches. The medium image problem is something I've faced as well, but never got around to fixing. I'm planning to get a Remarkable soon, so will definitely be trying this out.
My personal solution has been https://github.com/captn3m0/url-to-epub/ (Node/readability), which I've tested against the entirety of Tor's original fiction collection[0] where it performs well enough (I'm biased). Another tool that does this beautifully well is percollate[1], but it doesn't give enough control of the metadata to the user - something I really care about.
I've also started to use rdrview[2], which is a C-port of the current Firefox implementation of "reader view". It is very unix-y, so it is easy to pipe content to it (I usually run it through tidy first). Quite helpful in building web-archiving or web-to-pdf or web-to-kindle pipelines easily.
[0]: https://www.tor.com/category/all-fiction/original-fiction/
[1]: https://github.com/danburzo/percollate
-
A little npm head-scratcher
A JavaScript project I maintain has the following file structure, abridged:
-
A note from our sponsor - InfluxDB
www.influxdata.com | 29 Jan 2023
Stats
danburzo/percollate is an open source project licensed under MIT License which is an OSI approved license.