iadownloader
Auto-download files and collections from Internet Archive (by rsvensson)
internetarchive-downloader
Simultaneous, resumable and hash-verified downloads from Internet Archive (archive.org) (by john-corcoran)
iadownloader | internetarchive-downloader | |
---|---|---|
1 | 7 | |
4 | 121 | |
- | - | |
10.0 | 3.6 | |
about 3 years ago | 4 months ago | |
Python | Python | |
GNU General Public License v3.0 only | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
iadownloader
Posts with mentions or reviews of iadownloader.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-06-04.
-
Looking for some help in downloading a few thousand files from archive.org on ubuntu. wget is estimated to take 2 months... I figured I should ask the fellow data-hoarders!
I've used this [https://github.com/rsvensson/iadownloader] for a similar use case, but you might also consider one of these...
internetarchive-downloader
Posts with mentions or reviews of internetarchive-downloader.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2022-06-04.
- Does anyone know how to download the images from borrow-only Internet Archive books?
-
Is there a way to download all files in the URLs list for an archived site?
this tool works well for what you're asking for. https://github.com/john-corcoran/internetarchive-downloader
- Looking for some help in downloading a few thousand files from archive.org on ubuntu. wget is estimated to take 2 months... I figured I should ask the fellow data-hoarders!
-
How to view more than 25 results in an archive collection?
Another option to get all items in a collection that I used for a script I put together for Internet Archive downloads is the Internet Archive Python Library - official documentation on the relevant function is at https://archive.org/services/docs/api/internetarchive/quickstart.html#searching - and example of using it in code is around line 839 of https://github.com/john-corcoran/internetarchive-downloader/blob/61395ae4fbc826d9578678ed3299ada45d5ec3fd/ia_downloader.py
-
Pause Downloading of Collection From the Internet Archive?
Using the ‘-r’ flag with my Python script will allow resuming in-progress files, and if you run the script with the same command line arguments each time, you can pick up a collection where you left off - it’s at https://github.com/john-corcoran/internetarchive-downloader
-
Extracting all links from a webpage without html?
You may want to try this Python script I’ve finished recently for Internet Archive downloads: https://github.com/john-corcoran/internetarchive-downloader - collections should work fine if you pass it with the prefix ‘collection:’, e.g. ‘collection:nasa’ - if you want to give it a try, let me know if any questions!
-
What are the odds of the Internet Archive getting shut in the next 5 years and what will we do after it is shut?
I’ve made a Python script for this at https://github.com/john-corcoran/internetarchive-downloader which may assist?
What are some alternatives?
When comparing iadownloader and internetarchive-downloader you can also consider the following projects:
archive-downloader - A downloader for archive.org
distributed-wikipedia-mirror - Putting Wikipedia Snapshots on IPFS
BaseCase-3 - This is a Python Application that can be used to gather all files of a certain type from any archive.com repository
GGet - Multithreaded download accelerator written in Go
pup - Parsing HTML at the command line
internetarchive - A Python and Command-Line Interface to Archive.org
ipfs - Peer-to-peer hypermedia protocol
iadownloader vs archive-downloader
internetarchive-downloader vs distributed-wikipedia-mirror
iadownloader vs BaseCase-3
internetarchive-downloader vs archive-downloader
internetarchive-downloader vs BaseCase-3
internetarchive-downloader vs GGet
internetarchive-downloader vs pup
internetarchive-downloader vs internetarchive
internetarchive-downloader vs ipfs