internetarchive

A Python and Command-Line Interface to Archive.org (by jjjake)

Internetarchive Alternatives

Similar projects and alternatives to internetarchive

  1. requests

    A simple, yet elegant, HTTP library.

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
  3. instaloader

    Download pictures (or videos) along with their captions and other metadata from Instagram.

  4. tubesync

    Syncs YouTube channels and playlists to a locally hosted media server

  5. WinPython

    A free Python-distribution for Windows platform, including prebuilt packages for Scientific Python.

  6. panel

    Panel: The powerful data exploration & web app framework for Python (by holoviz)

  7. wrolpi

    Create your own off-grid library

  8. Coconut

    Simple, elegant, Pythonic functional programming.

  9. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  10. SCrawler

    25 internetarchive VS SCrawler

    🏳️‍🌈 Media downloader from any sites, including Twitter, Reddit, Instagram, TikTok, Threads, Facebook, OnlyFans, YouTube, Pinterest, PornHub, XHamster, XVIDEOS, ThisVid etc.

  11. szurubooru

    Image board engine, Danbooru-style.

  12. sanic

    17 internetarchive VS sanic

    Accelerate your web app development | Build fast. Run fast.

  13. rfsh

    RFSH: Run shell scripts in batch, concurrently, fully customized with variable .

  14. internetarchive-downloader

    Simultaneous, resumable and hash-verified downloads from Internet Archive (archive.org)

  15. archiveOrgImageDownloader

    Discontinued A python script that will download pages from a borrowed book from the Internet Archive archive.org library and save them as images.

  16. python-aria-mirror-bot

    A telegram bot for all your mirror needs | OG Repo

  17. GGet

    Multithreaded download accelerator written in Go

  18. elasticsearch-dsl-py

    High level Python client for Elasticsearch

  19. BaseCase-3

    Discontinued This is a Python Application that can be used to gather all files of a certain type from any archive.com repository

  20. archive-downloader

    A downloader for archive.org

  21. iamine

    Discontinued Internet Archive Data Mining Tools

  22. iadownloader

    Auto-download files and collections from Internet Archive

  23. SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better internetarchive alternative or higher similarity.

internetarchive discussion

Log in or Post with

internetarchive reviews and mentions

Posts with mentions or reviews of internetarchive. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-10-10.
  • Preppers Paradise: A collection of three DIY, prepping and tech libraries
    1 project | news.ycombinator.com | 28 May 2024
    https://help.archive.org/help/collections-a-basic-guide/

    https://help.archive.org/help/how-to-upload-files-to-create-...

    https://help.archive.org/help/uploading-tips/

    https://github.com/jjjake/internetarchive (for bulk uploading of items)

  • Official CLI Tool for the Internet Archive
    3 projects | news.ycombinator.com | 10 Oct 2023
    https://github.com/jjjake/internetarchive/commit/952ace47e0e...

    Me too, first commit was a bit more than 11 years ago.

  • What do you use to verify the hashes provided by Archive.org?
    1 project | /r/DataHoarder | 8 Jul 2023
    The --checksum switch of ia verifies the hashes.
  • Mass downloading from Archive.org...how?
    1 project | /r/DHExchange | 23 Dec 2022
  • Using Python for Internet Archive Bulk Upload
    1 project | /r/internetarchive | 3 Dec 2022
    first, i've tried python and internetarchive scripts only on XP/Vista with the corresponding version for those OS, without success. I moved to linux, instead. While I have a Raspberry Pi (RPi), I tried first on a Virtual Machine, under Windows. I chose Debian (that's what I run on the RPi) but also had a go at FreeBSD. Both have packages (binaries) ready to go and worked flawlessly. From your post, you have enough skills to set up a virtual machine and install a mainstream linux distro, which is basically downloading an iso, mounting it on the VM, clicking next,next,next,ok,done. You then would boot into the desktop and open the CLI (command line interface). Installing internet archive and python is just a matter of copy pasting a couple of commands. On linux, the internet archive package is https://packages.debian.org/stable/utils/internetarchive and I find it easier than grabbing the binaries through cURL, setting up permissions and whatnot. same for python3. it'll do it's thing (grabs all the files it needs, installs, cleans, all automated, and when it's done you're back at the prompt ($ <-- you asked what this operator means in Python but I think you mean when it shows on the documentation; it's just a command prompt, like it would be on windows cmd, for example c:\archives\uploads> waiting for a command) and ready to throw commands. you first need to setup with your credentials. just ia configure it'll ask all it needs and you're ready to upload stuff. mass uploading different items s basically entering the same command for as many times as it's needed. ia does this for you, using a CSV file -- this involves a bit of pre-processing but when set and done it'll save you a lot of time and wait.
  • I'm using 'screen' for some background tasks on a headless RPi server and it doesn't show progress info. Works fine outside it.
    1 project | /r/linuxquestions | 10 Oct 2022
    More specifically i'm using ia internetarchive, and Putty 0.75 to log into the Pi. All is updated and outside a screen session works fine. When transfering files I get a progress bar, %, speed and timestamps. But when on a screen all I get it the name of the file being uploaded and nothing else. It only changes when one file finishes and moves to the next or when all is uploaded. No other progress info.
  • Top Python Coding Repos
    6 projects | dev.to | 5 Sep 2022
    requests - A simple, yet elegant, HTTP library. sanic - Next generation Python web server/framework | Build fast. Run fast. click - Python composable command line interface toolkit elasticsearch-dsl-py - High level Python client for Elasticsearch panel - A high-level app and dashboarding solution for Python internetarchive - A Python and Command-Line Interface to Archive.org coconut - Simple, elegant, Pythonic functional programming
  • It finally happened. Something I archived was erased from the Internet.
    5 projects | /r/DataHoarder | 14 Jul 2022
  • Looking for some help in downloading a few thousand files from archive.org on ubuntu. wget is estimated to take 2 months... I figured I should ask the fellow data-hoarders!
    6 projects | /r/DataHoarder | 4 Jun 2022
  • How can I mirror big folder from Archive.org
    2 projects | /r/Roms | 2 May 2022
    You can do that with the Internet Archive's Python client by jjjake: https://github.com/jjjake/internetarchive
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 23 May 2025
    InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →

Stats

Basic internetarchive repo stats
18
1,698
8.9
9 days ago

Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Python is
the 2nd most popular programming language
based on number of references?