scrapeghost
duf
scrapeghost | duf | |
---|---|---|
10 | 26 | |
1,396 | 12,280 | |
- | - | |
8.2 | 2.9 | |
5 months ago | 2 months ago | |
Python | Go | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
scrapeghost
-
Those of you who have developed product features using GPT4 API (or failed to do so), how did it go?
Not my project but an ex-colleague has been having some success in this direction: https://jamesturk.github.io/scrapeghost/
-
What are the best tools for web scraping and analysis of natural language to populate a dataset?
Yes, there is something like that available - ScrapeGhost.
- FLaNK Stack Weekly 3 April 2023
- Scraping Websites Using GPT
-
@TwitterDev Announces New Twitter API Tiers
With AI scraping, tools can be far more resilient than soon enough to minor dom changes. See - https://jamesturk.github.io/scrapeghost/.
-
Experimental library for scraping websites using OpenAI's GPT API
Their ToS mentions scraping but it pertains to scraping their frontend instead of using their API, which they don't want you to do.
Also - this library requests the HTML by itself [0] and ships it as a prompt but with preset system messages as the instruction [1].
[0] - https://github.com/jamesturk/scrapeghost/blob/main/src/scrap...
[1] - https://github.com/jamesturk/scrapeghost/blob/main/src/scrap...
- scrapeghost. Web scrape using gpt-4 (experimental)
duf
-
Go: What We Got Right, What We Got Wrong
Not sure these are really popular, but I cannot resist advertising a few utilities written in Go that I regularly use in my daily workflow:
- gdu: a NCDU clone, much faster on SSD mounts [1]
- duf: a `df` clone with a nicer interface [2]
- massren: a `vidir` clone (simpler to use but with fewer options) [3]
- gotop: a `top` clone [4]
- micro: a nice TUI editor [5]
Building this kind of tools in Go makes sense, as the executables are statically compiled and are thus easy to install on remote servers.
[1]: https://github.com/dundee/gdu
[2]: https://github.com/muesli/duf
[3]: https://github.com/laurent22/massren
[4]: https://github.com/xxxserxxx/gotop
[5]: https://github.com/zyedidia/micro
-
Clean mount lists in Linux
Somewhat related - `duf` is "a better `df` alternative":
https://github.com/muesli/duf
-
dysk, a better df
I'm normally using duf but this looks pretty neat.
- FLaNK Stack Weekly 3 April 2023
-
PPA or not to PPA
Otherwise the last option is to get the deb/appimage files from their official git repos or website, like for my use cases, MongoDB Compass (which was not officially maintained on flatpak) or duf (not available in Ubuntu repos)
-
What "nice-to-have" CLI tools do you know?
duf
-
What little CLI tools do you know, that do something useful and faster than regular commands? For example DUF.
What cool CLI tools do you know, that are do something faster than regular commands, and do something useful? For example: https://github.com/muesli/duf.
- Ncdu – NCurses Disk Usage
-
I wrote a "12 favourite terminal tools" list-article, what did I left out that should be absolutely included?
duf - Disk Usage/Free Utility - a better 'df' alternative.
- DUF - Linux “DU” clone, shows all the details about the Linux systems disks & storage
What are some alternatives?
autoscraper - A Smart, Automatic, Fast and Lightweight Web Scraper for Python
hacktoberfest-swag-list - Multiple companies go above and beyond for Hacktoberfest, and this repo tries to list them all.
tmx-solver - ThreatMetrix (anti-bot/fraud-detection) solver, deobfuscator & data harvester
gdu - Fast disk usage analyzer with console interface written in Go
wikipedia_ql - Query language for efficient data extraction from Wikipedia
rust-memchr - Optimized string search routines for Rust.
Bandwhich - Terminal bandwidth utilization tool
lakeFS - lakeFS - Data version control for your data lake | Git for data
bpytop - Linux/OSX/FreeBSD resource monitor
visx - 🐯 visx | visualization components
exiftool - ExifTool meta information reader/writer
QDirStat - QDirStat - Qt-based directory statistics (KDirStat without any KDE - from the original KDirStat author)