library vs mergerfs

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

library		mergerfs
	Project
16	Mentions	164
166	Stars	3,881
-	Growth	-
9.7	Activity	7.7
3 days ago	Latest Commit	22 days ago
Python	Language	C++
GNU General Public License v3.0 or later	License	GNU General Public License v3.0 or later

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

library

Posts with mentions or reviews of library. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-21.

Show HN: Find similar folders based on folder name, folder size, and count
1 project | news.ycombinator.com | 29 Apr 2024
Ask HN: Anyone looking for contributors for their open source projects
13 projects | news.ycombinator.com | 21 Mar 2024

Sure, I write small python CLI utils that help me solve media organization, media consumption, and sometimes data analysis. I use this every day on Linux and Android but I haven't tested it on other platforms. There are a lot of different subcommands and, although the CLI package will always be opinionated to some extent, there is a lot of niche functionality which might not need to exist. So I'm open to things being refactored or new subcommands being added. [1]
I have a lot of ideas for new ones, for example, I want a CLI that can take an artist name like "Theodor Kittelsen" and fetch highest quality public domain images--but I realize any implementation that does this well will be somewhat fragile so I haven't really attempted that yet. Other ideas that I have are often solved by piping output from one of my existing commands to another.
1. https://github.com/chapmanjacobd/library
Show HN: I built an open-source data copy tool called ingestr
3 projects | news.ycombinator.com | 27 Feb 2024

I used sqlite-utils to create a tool that can merge SQLITE files and split them:
https://github.com/chapmanjacobd/library?tab=readme-ov-file#...
CurlyQ: Command line helper for curl and web scraping
2 projects | news.ycombinator.com | 11 Jan 2024

I've created a few similar tools for link scraping: https://github.com/chapmanjacobd/library#usage
- library links-extract: extract inner links from pages (stdin, local files, or remote sites)
Show HN: Merge folders and simulate merging–count of conflicts, trumps, and new
2 projects | news.ycombinator.com | 7 Dec 2023
FileTrove: A file indexer
5 projects | /r/DataHoarder | 7 Dec 2023

okay https://github.com/chapmanjacobd/library
Sunday Daily Thread: What's everyone working on this week?
9 projects | /r/Python | 25 Jun 2023

I'm adding more unit tests: https://github.com/chapmanjacobd/library/commit/bd2e138897fdf41b8d8eade89bcdb34fee2b6abd
Show HN: Trogon – An automatic TUI for command line apps
11 projects | news.ycombinator.com | 21 May 2023

I would also[0] be interested in an argparse equivalent of this for my tool Library[1]
[0] https://github.com/Textualize/textual/discussions/228
[1] https://github.com/chapmanjacobd/library
After over 15 years of ripping and downloading, my music library just reached 20TB. AMA
4 projects | /r/DataHoarder | 26 Mar 2023

I have a little over 2 million songs as well and I wrote my own media management system to deal with it all. I save everything as Opus so the size is relatively small but still high-quality.
Those of you with 100TB+, what do you do for backups?
3 projects | /r/DataHoarder | 25 Feb 2023

Losing the data from the hard drive AND the internet is less likely than just one of those events happening. Recently I accidentally deleted 12 TB of media and I was able to redownload 80% of it using a script that I wrote. 20% of it I had to manually redownload but everything was still there.

mergerfs

Posts with mentions or reviews of mergerfs. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-10.

Mergerfs – A Featureful Union Filesystem
1 project | news.ycombinator.com | 26 Apr 2024
How do I use multiple hard drives on Kubuntu for steam?
2 projects | /r/Kubuntu | 10 Dec 2023

Have a look at mergerfs.
mergerfs v2.38.0 released
1 project | /r/mergerfs | 1 Nov 2023
Mergerfs and Snapraid installation
1 project | /r/DataHoarder | 10 Oct 2023

I am planning to use ubuntu server, and I would like to ask an advice: according to snapraid's download page and mergerfs' github page, it seems to be suggested to download directly their source instead of using ubuntu's package manager.
The Next Gen Database Servers Powering Let's Encrypt(2021)
5 projects | news.ycombinator.com | 17 Sep 2023

Like most people on r/homelab, it started out with Plex. Rough timeline/services below:
0. Got a Synology DS413 with 4x WD Red 3TB drives. Use Playstation Media Server to stream videos from it. Eventually find some Busybox stuff to add various functionality to the NAS, but it had a habit of undoing them periodically, which was frustrating. I also experienced my first and (knock on wood) only drive failure during this time, which concluded without fanfare once the faulty drive was replaced, and the array repaired itself.
1. While teaching self Python as an Electrical Distribution Engineer at a utility, I befriended the IT head, who gave me an ancient (I think Nehalem? Quad-core Xeon) Dell T310. Promptly got more drives, totaling 7, and tried various OS / NAS platforms. I had OpenMediaVault for a while, but got tired of the UI fighting me when I knew how to do things in shell, so I switched to Debian (which it's based on anyway). Moved to MergerFS [0] + SnapRAID [1] for storage management, and Plex for media. I was also tinkering with various Linux stuff on it constantly.
1.1 Got tired of my tinkering breaking things and requiring troubleshooting/fixing (in retrospect, this provided excellent learning), so I installed Proxmox, reinstalled Debian, and made a golden image with everything set up as desired so I could easily revert.
1.2 A friend told me about Docker. I promptly moved Plex over to it, and probably around this time also got the *Arr Stack [2] going.
2. Got a Supermicro X9DRi-LN4F+ in a 2U chassis w/ 12x 3.5" bays. Got faster/bigger CPUs (E5-2680v2), more RAM, more drives, etc. Shifted container management to Docker Compose. Modded the BIOS to allow it to boot from a NVMe drive on a PCIe adapter.
2.1 Shifted to ZFS on Debian. Other than DKMS occasionally losing its mind during kernel upgrades, this worked well.
2.2 Forked [3] some [4] Packer/Ansible projects to suit my needs, made a VM for everything. NAS, Dev, Webserver, Docker host, etc. Other than outgrowing (IMO) MergerFS/SnapRAID, honestly at this point I could have easily stopped, and could to this day revert back to this setup. It was dead reliable and worked extremely well. IIRC I was also playing with Terraform at this time.
2.3 Successfully broke into tech (Associate SRE) as a mid-career shift, due largely (according to the hiring manager) to what I had done with my homelab. Hooray for hobbies paying off.
3. Got a single Dell R620. I think the idea was to install either pfSense or VyOS on it, but that never came to fruition. Networking was from a Unifi USG (their tiny router + firewall + switch) and 8-port switch, with some AC Pro APs.
4. Got two more R620s. Kubernetes all the things. Each one runs Proxmox in a 3-node cluster with two VMs - a control plane, and worker.
4.0.1 Perhaps worth noting here that I thoroughly tested my migration plan via spinning up some VMs in, IIRC, Digital Ocean that mimicked my home setup. I successfully ran it twice, which was good enough for me.
4.1 Played with Ceph via Rook, but a. disliked (and still to this day) running storage for everything out of K8s b. kept getting clock skew between nodes. Someone on Reddit mentioned it was my low-power C-state settings, but since that was saving me something like ~50 watts/node, I didn't want to deal with the higher power/heat. I landed on Longhorn [5] for cluster storage (i.e. anything that wasn't being handled by the ZFS pool), which was fine, but slow. SATA SSDs (used Intel enterprise drives with PLP, if you're wondering) over GBe aren't super fast, but they should be able to exceed 30 MBps.
4.1.1 Again, worth noting that I spent literally a week poring over every bit of Ceph documentation I could find, from the Red Hat stuff to random Wikis and blog posts. It's not something you just jump into, IMO, and most of the horror stories I read boiled down to "you didn't follow the recommended practices."
5. Got a newer Supermicro, an X11SSH-F, thinking that it would save power consumption over the older dual-socket I had for the NAS. It turned out to not make a big difference. For some reason I don't recall, I had a second X9DRi-LN4F+ mobo, so I sold the other one with the faster CPUs, bought some cheaper CPUs for the other one, and bought more drives for it. It's now a backup target that boots up daily to ingest ZFS snapshots. I have 100% on-site backups for everything. Important things (i.e. anything that I can't get from a torrent) are also off-site.
6. Got some Samsung PM863 NVMe SSDs mounted on PCIe adapters for the Dells, and set up Ceph, but this time handled by Proxmox. It's dead easy, and for whatever reason isn't troubled by the same clock skew issues as I had previously. Still in the process of shifting cluster storage from Longhorn, but I have been successfully using Ceph block storage as fast (1 GBe, anyway - a 10G switch is on the horizon) storage for databases.
So specifically, you asked what I do with the hardware. What I do, as far as my family is concerned, is block ads and serve media. On a more useful level, I try things out related to my job, most recently database-related (I moved from SRE to DBRE a year ago). I have MySQL and Postgres running, and am constantly playing with them. Can you actually do a live buffer pool resize in MySQL? (yes) Is XFS actually faster than ext4 for large DROP TABLE operations? (yes, but not by much) Is it faster to shut down a MySQL server and roll back to a previous ZFS snapshot than to rollback a big transaction? (often yes, although obviously a full shutdown has its own problems) Does Postgres suffer from the same write performance issue as MySQL with random PKs like UUIDv4, despite not clustering by default? (yes, but not to the same extent - still enough to matter, and you should use UUIDv7 if you absolutely need them)
I legitimately love this stuff. I could quite easily make do without a fancy enclosed rack and multiple servers, but I like them, so I have them. The fact that it tends to help my professional growth out at the same time is a bonus.
[0]: https://github.com/trapexit/mergerfs
[1]: https://www.snapraid.it
[2]: https://wiki.servarr.com
[3]: https://github.com/stephanGarland/packer-proxmox-templates
[4]: https://github.com/stephanGarland/ansible-initial-server
[5]: https://longhorn.io
EXT4 corrupted on a Seagate Drive several times. Any help appreciated
1 project | /r/DataHoarder | 12 Jul 2023
Merge/Raid HDD documentation
2 projects | /r/CasaOS | 29 Jun 2023

it seems similar to mergerfs https://github.com/trapexit/mergerfs . I havent gone through any code to verify but this is what it seems like
Can Rclone be reliably used as a R/W cache or is there something better suited to that task?
1 project | /r/rclone | 28 Jun 2023

Something else to try is mergerfs, https://github.com/trapexit/mergerfs
Drive Spin Up
1 project | /r/mergerfs | 25 Jun 2023
Looking for a solution to merge storage accross WAN
2 projects | /r/selfhosted | 28 May 2023

I use mergerfs for my Google drive, Dropbox and local drives to appear as a single folder structure on my server so my plex doesn't require multiple mappings.

What are some alternatives?

When comparing library and mergerfs you can also consider the following projects:

ffmpy - Pythonic interface for FFmpeg/FFprobe command line

OpenMediaVault - openmediavault is the next generation network attached storage (NAS) solution based on Debian Linux. Thanks to the modular design of the framework it can be enhanced via plugins. openmediavault is primarily designed to be used in home environments or small home offices.

yark - YouTube archiving made simple.

Greyhole - Greyhole uses Samba to create a storage pool of all your available hard drives, and allows you to create redundant copies of the files you store.

keenwrite-themes - Document typesetting configurations using ConTeXt

mergerfs-tools - Optional tools to help manage data in a mergerfs pool

cookwherever - Cook Wherever is an open source project to attempt to making cooking more accessible and engaging for everyone.

Seaweed File System - SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding. [Moved to: https://github.com/seaweedfs/seaweedfs]

squid-dl - a massively parallel yt-dlp-based YouTube downloader

chia-plotter-deployment - A Bunch of Scripts to setup a Chia Farm. Focusing on, but not limited to, the MadMax Plotter, and HPool.

jExifToolGUI - jExifToolGUI is a multi-platform java/Swing graphical frontend for the excellent command-line ExifTool application by Phil Harvey

cloudplow - Automatic rclone remote uploader, with support for multiple remote/folder pairings. UnionFS Cleaner functionality: Deletion of UnionFS whiteout files and their corresponding files on rclone remotes. Automatic remote syncer: Sync between different remotes via a Scaleway server instance, that is created and destroyed at every sync.

library vs ffmpy mergerfs vs OpenMediaVault library vs yark mergerfs vs Greyhole library vs keenwrite-themes mergerfs vs mergerfs-tools library vs cookwherever mergerfs vs Seaweed File System library vs squid-dl mergerfs vs chia-plotter-deployment library vs jExifToolGUI mergerfs vs cloudplow

Compare library vs mergerfs and see what are their differences.

library

mergerfs

library

mergerfs

What are some alternatives?