go-readability
rod
go-readability | rod | |
---|---|---|
4 | 20 | |
649 | 4,808 | |
3.1% | 2.8% | |
4.2 | 7.9 | |
6 days ago | 3 days ago | |
HTML | Go | |
MIT License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
go-readability
-
Ask HN: Full-text browser history search forever?
I've had a lot of success by running HTML pages through mozilla's readability[0] tool (actually the go port of it[1]) before indexing it.
[0]: https://github.com/mozilla/readability
[1]: https://github.com/go-shiori/go-readability
-
Which library/project do you wish was ported to golang?
https://github.com/go-shiori/go-readability https://github.com/mauidude/go-readability
-
Show HN: Forlater.email – an email-based bookmarking service
I'm using https://github.com/go-shiori/go-readability -- a Go re-implementation of Mozilla's readability-js library. It does a pretty good job.
-
Show HN: Hackernews_tui – A Terminal UI to Browse Hacker News Discussions
Two projects that do this with nearly identical output:
- https://github.com/eafer/rdrview
- https://github.com/go-shiori/go-readability
Pipe the filtered HTML output into your favorite textual web browser for an ideal reading experience.
rod
-
Need help authenticating to Okta programatically.
I have tried the following. 1. Login to Okta via browser programatically using go-rod. Which I managed to do so successfully, but I'm failing to load up Slack as it's stuck in the browser loader screen for Slack. 2. I tried to authenticate via Okta RESTful API. So far, I have managed to authenticate using {{domain}}/api/v1/authn, and then subsequently using MFA via the verify endpoint {{domain}}/api/v1/authn/factors/{{factorID}}/verify which returns me a sessionToken. From here, I can successfully create a sessionCookie which have proven quite useless to me. Perhaps I am doing it wrongly.
- Library to convert HTML to pdf in Golang
- Web scraping with Go
- Best option for browser automation
-
I’m messed up with Go libraries
I usually find libraries by googling them or searching awesome go on GitHub, for selenium/puppeteer I've always found go-rod useful and easy in every way, Sometimes I also Google "X in Nodejs for Golang"
-
Go for web scraping
I recently tried out https://github.com/go-rod/rod. I think it's based on chromedp (so, Chrome dev tools and headless browser) but it also has code to download and run a supported version of Chrome that doesn't interfere with your local browser.
- Reducir tiempo de Web Scraping con concurrencia - GO
-
VHS: CLI Home Video Recorder
One of the dependencies is `rod`[0], which is a web scraping/automation library, and I believe requires a browser to work. I don't know what they're using it for though as I haven't looked at the code (and I'm not familiar with Go anyways).
0: https://github.com/go-rod/rod
-
Thoughts on Go headless browser tools for testing and scraping?
I don't have personal experience, but https://github.com/go-rod/rod is far more active than chromedp
- Project with a Web scraper GO binary
What are some alternatives?
Readability4J - A Kotlin port of Mozilla‘s Readability. It extracts a website‘s relevant content and removes all clutter from it.
playwright-go - Playwright for Go a browser automation library to control Chromium, Firefox and WebKit with a single API.
rdrview - Firefox Reader View as a command line tool
chromedp - A faster, simpler way to drive browsers supporting the Chrome DevTools Protocol.
hnrss - Custom, realtime RSS feeds for Hacker News
colly - Elegant Scraper and Crawler Framework for Golang
readability - A standalone version of the readability lib
WebDumper - A tool for scraping, dumping and unpacking (webpacked) javascript source files.
nb - CLI and local web plain text note‑taking, bookmarking, and archiving with linking, tagging, filtering, search, Git versioning & syncing, Pandoc conversion, + more, in a single portable script.
realize - Realize is the #1 Golang Task Runner which enhance your workflow by automating the most common tasks and using the best performing Golang live reloading.
wayback-machine-downloader - Download an entire website from the Wayback Machine.
gotests - Automatically generate Go test boilerplate from your source code.