requests-cache
cachew
Our great sponsors
requests-cache | cachew | |
---|---|---|
7 | 1 | |
1,254 | 207 | |
1.9% | - | |
8.7 | 7.4 | |
6 days ago | 27 days ago | |
Python | Python | |
BSD 2-clause "Simplified" License | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
requests-cache
-
Web Scraping with Python: from Fundamentals to Practice
For anyone who goes with requests as your HTTP client, I would highly recommend adding requests-cache for a nice performance boost.
-
What does the process of web scraping actually look like?
The hardest part is actually running a web scraper at scale and that's where many people fail. We have all of the working pieces - we can find the products and parse the raw data. Time to scale it up! Best tip here is to start off with caching. Using caching libraries like requests-cache or whatever library equivalent will speed up process significantly.
- If I keep making URL requests in a forloop, is that harmful?
-
Requests-Cache – An easy way to get better performance with the python requests library
And would you be willing to add some example Terraform config? If you wouldn't mind making a PR for that, it could go under the /examples folder.
cachew
-
How I collect and use 50 sources of my personal data
Yep! In fact I've tried interoperating with Datasette (e.g. shared here https://news.ycombinator.com/item?id=25090643 )
One secret sauce is using 'automatic' caching of data in sqlite -- this allows both for faster access and having an additional interface for the data as a collateral https://github.com/karlicoss/cachew#readme
Still need to polish this a bit, but ultimately hoping to properly plug into Datasette, I'm impressed by its data exploration capabilities!
What are some alternatives?
aiohttp-client-cache - An async persistent cache for aiohttp requests
patina - Python adaptations of Rust's Result, Option, and HashMap types. Ready for Python 3.10 pattern matching!
requests - A simple, yet elegant, HTTP library.
dashboard
requests - A simple, yet elegant HTTP library. [Moved to: https://github.com/psf/requests]
RightToBeRemembered - A ław requiring services to enable auto experts of personal data
notionSnapshot - notion web scraper
typedload - Python library to load dynamically typed data into statically typed data structures
requests-html - Pythonic HTML Parsing for Humans™
docarray - Represent, send, store and search multimodal data
Uplink - A Declarative HTTP Client for Python
parsel-cli - cli for evaluating css and xpath selectors