internetarchive vs szurubooru

internetarchive

A Python and Command-Line Interface to Archive.org (by jjjake)

Suggest topics

Source Code

Suggest alternative

Edit details

szurubooru

Image board engine, Danbooru-style. (by rr-)

image-board-engine Danbooru Python ES6

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

internetarchive		szurubooru
	Project
17	Mentions	17
1,519	Stars	643
-	Growth	-
8.3	Activity	5.0
3 days ago	Latest Commit	8 days ago
Python	Language	Python
GNU Affero General Public License v3.0	License	GNU General Public License v3.0 only

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

internetarchive

Posts with mentions or reviews of internetarchive. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-10-10.

Official CLI Tool for the Internet Archive
3 projects | news.ycombinator.com | 10 Oct 2023

https://github.com/jjjake/internetarchive/commit/952ace47e0e...
Me too, first commit was a bit more than 11 years ago.
What do you use to verify the hashes provided by Archive.org?
1 project | /r/DataHoarder | 8 Jul 2023

The --checksum switch of ia verifies the hashes.
Mass downloading from Archive.org...how?
1 project | /r/DHExchange | 23 Dec 2022
Using Python for Internet Archive Bulk Upload
1 project | /r/internetarchive | 3 Dec 2022

first, i've tried python and internetarchive scripts only on XP/Vista with the corresponding version for those OS, without success. I moved to linux, instead. While I have a Raspberry Pi (RPi), I tried first on a Virtual Machine, under Windows. I chose Debian (that's what I run on the RPi) but also had a go at FreeBSD. Both have packages (binaries) ready to go and worked flawlessly. From your post, you have enough skills to set up a virtual machine and install a mainstream linux distro, which is basically downloading an iso, mounting it on the VM, clicking next,next,next,ok,done. You then would boot into the desktop and open the CLI (command line interface). Installing internet archive and python is just a matter of copy pasting a couple of commands. On linux, the internet archive package is https://packages.debian.org/stable/utils/internetarchive and I find it easier than grabbing the binaries through cURL, setting up permissions and whatnot. same for python3. it'll do it's thing (grabs all the files it needs, installs, cleans, all automated, and when it's done you're back at the prompt ($ <-- you asked what this operator means in Python but I think you mean when it shows on the documentation; it's just a command prompt, like it would be on windows cmd, for example c:\archives\uploads> waiting for a command) and ready to throw commands. you first need to setup with your credentials. just ia configure it'll ask all it needs and you're ready to upload stuff. mass uploading different items s basically entering the same command for as many times as it's needed. ia does this for you, using a CSV file -- this involves a bit of pre-processing but when set and done it'll save you a lot of time and wait.
I'm using 'screen' for some background tasks on a headless RPi server and it doesn't show progress info. Works fine outside it.
1 project | /r/linuxquestions | 10 Oct 2022

More specifically i'm using ia internetarchive, and Putty 0.75 to log into the Pi. All is updated and outside a screen session works fine. When transfering files I get a progress bar, %, speed and timestamps. But when on a screen all I get it the name of the file being uploaded and nothing else. It only changes when one file finishes and moves to the next or when all is uploaded. No other progress info.
Top Python Coding Repos
6 projects | dev.to | 5 Sep 2022

requests - A simple, yet elegant, HTTP library. sanic - Next generation Python web server/framework | Build fast. Run fast. click - Python composable command line interface toolkit elasticsearch-dsl-py - High level Python client for Elasticsearch panel - A high-level app and dashboarding solution for Python internetarchive - A Python and Command-Line Interface to Archive.org coconut - Simple, elegant, Pythonic functional programming
It finally happened. Something I archived was erased from the Internet.
5 projects | /r/DataHoarder | 14 Jul 2022
Looking for some help in downloading a few thousand files from archive.org on ubuntu. wget is estimated to take 2 months... I figured I should ask the fellow data-hoarders!
6 projects | /r/DataHoarder | 4 Jun 2022
How can I mirror big folder from Archive.org
2 projects | /r/Roms | 2 May 2022

You can do that with the Internet Archive's Python client by jjjake: https://github.com/jjjake/internetarchive
Wii WBFS games?
1 project | /r/Roms | 12 Apr 2022

If you're comfortable with command line, you can use the internet archive python script to download stuff from archive.org ( https://github.com/jjjake/internetarchive )

szurubooru

Posts with mentions or reviews of szurubooru. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-06-26.

How to tag and store pictures downloaded from the internet?
1 project | news.ycombinator.com | 12 Jul 2023

I've been accumulating art and pictures downloaded from the internet, what's a good way to store and tag them with the original artist and source? Maybe a simple solution would be just adding appropriate EXIF data to the files? This approach would still require a good folder structure to make sense of. Another option I came across was https://hydrusnetwork.github.io/hydrus/index.html, but after trying it, the experience was really janky and not pleasant. Or I could even host an image booru so the collection would benefit others too, with https://github.com/rr-/szurubooru perhaps?
How do other people deal with it, I'm curious to hear.
Datahoarding image library organizer
1 project | /r/DataHoarder | 11 Jul 2023

also agree with hydrus network. im at 18million items. next best thing is probably something like a selfhosted booru server so that you can remotely organize files during your downtime on the toilet or jury duty.
I want to make a website with the format of danbooru for sharing and archiving images. How would I start going about that?
4 projects | /r/selfhosted | 26 Jun 2023

I'm your savior, together with https://github.com/rr-/szurubooru
code for creating a POST in Image Board instance "szurubooru"
3 projects | /r/selfhosted | 4 Apr 2023

There is also https://github.com/rr-/szurubooru/discussions
Smart Home and Homelab network diagram after 4 years of evolution
9 projects | /r/homelab | 14 Dec 2022

On unraid postgres is used by szurubooru, mealie, and grafana. I don't know specifically what but I know redis is also used by one of the apps in the cloud group as well.
Can't access Docker site (szurubooru) in LAN, only on the same machine
2 projects | /r/selfhosted | 11 Dec 2022

name: mybooruname #yes this next line is empty as i mention above, but the [INSTALL.md](https://github.com/rr-/szurubooru/blob/master/doc/INSTALL.md) says you can skip lines of which you want to use the defaults. or should i add something here? domain: secret: mysecretstring delete_source_files: no thumbnails: avatar_width: 300 avatar_height: 300 post_width: 300 post_height: 300 user_agent: max_dl_filesize: 25.0E+6 convert: gif: to_webm: false to_mp4: false allow_broken_uploads: false smtp: host: localhost port: 25 user: myusername pass: thisisapass from: noreply@localhost enable_safety: yes # the rest is just regexes for tags, pools, usernames, and passwords, and user rank privilege stuff
Using Rust as my Backend
8 projects | /r/rust | 2 Nov 2022

If you need tagging / users and stuff for the images, I’ve used https://github.com/rr-/szurubooru
Photofield v0.5 released: Google Photos alternative now even faster and with 100% more demo
7 projects | /r/selfhosted | 4 Sep 2022

Honestly, your project makes me want to contribute to I but I have absolutely no experience with Golang. I see a lot of potential in it and because it's comparatively new and barebones, there aren't any entrenched concepts like with more mature projects. Like, if you ever get around to implementing a more comprehensive organization system, do check out Szurubooru. Tags and tag categories can replace Albums, Faces, Places, Objects, Themes, Colors, EXIF data and so much more. You can still use tooling like AI recognition, geotagging, EXIF readers, and whatnot to populate them accordingly but being standardized as TAGS makes searching and crawling much easier. Not to mention defining custom tags would allow for a very versatile usage. And with a fluid browsing like yours, it will be a dream app.
Your top 5 best self hosted apps?
36 projects | /r/selfhosted | 22 Aug 2022

I can share something like that. There is this imageboard called (Szurubooru)[https://github.com/rr-/szurubooru]. People use such boards to host their anime and hentai stuff.
It finally happened. Something I archived was erased from the Internet.
5 projects | /r/DataHoarder | 14 Jul 2022

I kinda have something for Twitter accounts, said thing being this extension, but I stupidly don't use it enough. You see, I archive my posts using szurubooru, which is on a by-post basis, so everything has to be added one by one. (Technically, you can upload multiple at once but there's no function to add tags before upload, only after.)

What are some alternatives?

When comparing internetarchive and szurubooru you can also consider the following projects:

archiveOrgImageDownloader - A python script that will download pages from a borrowed book from the Internet Archive archive.org library and save them as images.

DeepDanbooru - AI based multi-label girl image classification system, implemented by using TensorFlow.

rfsh - RFSH: Run shell scripts in batch, concurrently, fully customized with variable .

hydrus - A personal booru-style media tagger that can import files and tags from your hard drive and popular websites. Content can be shared with other users via user-run servers.

wrolpi - Create your own off-grid library

shimmie2 - An easy-to-install community image gallery (aka booru)

WinPython - A free Python-distribution for Windows platform, including prebuilt packages for Scientific Python.

python - Official Python client library for kubernetes

SCrawler - 🏳️‍🌈 Media downloader from any sites, including Twitter, Reddit, Instagram, Threads, Facebook, OnlyFans, YouTube, Pinterest, PornHub, XHamster, XVIDEOS, ThisVid etc.

system-design-primer - Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

instaloader - Download pictures (or videos) along with their captions and other metadata from Instagram.

TheAlgorithms - All Algorithms implemented in Python