BaseCase-3
internetarchive-downloader
BaseCase-3 | internetarchive-downloader | |
---|---|---|
2 | 7 | |
2 | 119 | |
- | - | |
0.0 | 3.6 | |
almost 3 years ago | 4 months ago | |
Python | Python | |
GNU General Public License v3.0 only | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
BaseCase-3
- Looking for some help in downloading a few thousand files from archive.org on ubuntu. wget is estimated to take 2 months... I figured I should ask the fellow data-hoarders!
-
What would be the best way to archive an archive.org account? This person has been uploading thousands of high quality rare vinyl rips with lossless high-resolution scans. I don't wanna lose it
Hi, I made a tool a little while ago to help me accomplish what I believe OP is asking for, here us the link: https://github.com/henry386/BaseCase-3
internetarchive-downloader
- Does anyone know how to download the images from borrow-only Internet Archive books?
-
Is there a way to download all files in the URLs list for an archived site?
this tool works well for what you're asking for. https://github.com/john-corcoran/internetarchive-downloader
- Looking for some help in downloading a few thousand files from archive.org on ubuntu. wget is estimated to take 2 months... I figured I should ask the fellow data-hoarders!
-
How to view more than 25 results in an archive collection?
Another option to get all items in a collection that I used for a script I put together for Internet Archive downloads is the Internet Archive Python Library - official documentation on the relevant function is at https://archive.org/services/docs/api/internetarchive/quickstart.html#searching - and example of using it in code is around line 839 of https://github.com/john-corcoran/internetarchive-downloader/blob/61395ae4fbc826d9578678ed3299ada45d5ec3fd/ia_downloader.py
-
Pause Downloading of Collection From the Internet Archive?
Using the ‘-r’ flag with my Python script will allow resuming in-progress files, and if you run the script with the same command line arguments each time, you can pick up a collection where you left off - it’s at https://github.com/john-corcoran/internetarchive-downloader
-
Extracting all links from a webpage without html?
You may want to try this Python script I’ve finished recently for Internet Archive downloads: https://github.com/john-corcoran/internetarchive-downloader - collections should work fine if you pass it with the prefix ‘collection:’, e.g. ‘collection:nasa’ - if you want to give it a try, let me know if any questions!
-
What are the odds of the Internet Archive getting shut in the next 5 years and what will we do after it is shut?
I’ve made a Python script for this at https://github.com/john-corcoran/internetarchive-downloader which may assist?
What are some alternatives?
bin2txt - Software to convert text to and from binary, written as a string of 1s and 0s
distributed-wikipedia-mirror - Putting Wikipedia Snapshots on IPFS
holehe - holehe allows you to check if the mail is used on different sites like twitter, instagram and will retrieve information on sites with the forgotten password function.
archive-downloader - A downloader for archive.org
news-fetch - A Python Package which helps to scrape all news details from any news websites
GGet - Multithreaded download accelerator written in Go
youtube-cdl - 📼 Bulk youtube subscription download
pup - Parsing HTML at the command line
iadownloader - Auto-download files and collections from Internet Archive
internetarchive - A Python and Command-Line Interface to Archive.org
ipfs - Peer-to-peer hypermedia protocol