extfuse vs mindcastle.io

extfuse

Extension Framework for FUSE (by extfuse)

Suggest topics

Source Code

extfuse.github.io

Suggest alternative

Edit details

mindcastle.io

Massively scalable, cloud-backed distributed block device for Linux and VMs (by jacobgorm)

Suggest topics

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

extfuse		mindcastle.io
	Project
2	Mentions	2
200	Stars	21
-	Growth	-
2.9	Activity	10.0
about 1 year ago	Latest Commit	over 1 year ago
C	Language	C
GNU Lesser General Public License v3.0 only	License	GNU General Public License v3.0 only

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

extfuse

Posts with mentions or reviews of extfuse. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-06.

Gcsfuse: A user-space file system for interacting with Google Cloud Storage
15 projects | news.ycombinator.com | 6 Sep 2023

FUSE does not work well with a large number of small files (due to high metadata ops such as inode/dentry lookups).
ExtFUSE (optimized FUSE with eBPF) [1] can offer you a high performance. It caches metadata in the kernel to avoid lookups in user space.
1. https://github.com/extfuse/extfuse
Google Cloud Storage FUSE
17 projects | news.ycombinator.com | 2 May 2023

mindcastle.io

Posts with mentions or reviews of mindcastle.io. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-06.

Prolly Trees
1 project | news.ycombinator.com | 9 Mar 2024

I don’t know who came first, but https://github.com/jacobgorm/mindcastle.io also uses the rsync/LBFS rolling hashes trick to split the tree data into chunks. I presented the idea at Usenix Vault 2019 https://m.youtube.com/watch?v=QgOkDiP0C4c&embeds_referring_e...
Gcsfuse: A user-space file system for interacting with Google Cloud Storage
15 projects | news.ycombinator.com | 6 Sep 2023

It is not how you would want do it for a typical ML workload, where the samples have to get randomly permuted each epoch.
Instead, tar up the files in some random order, and put the tar file on a web server or bucket, then stream then in during the first epoch, while keeping track of their byte offsets in the tar file, which you cache locally, assuming ample local Flash storage. Then permute the list of offsets and use those when reading samples for the next epoch.
If you only have local HDD then you will need a more advanced data structure like the one provided by https://github.com/jacobgorm/mindcastle.io , which will allow you to write out permuted samples at close to disk sequential write bandwidth. See my talk at USENIX Vault 2019 for a full explanation, linked from https://vertigo.ai/mindcastle/

What are some alternatives?

When comparing extfuse and mindcastle.io you can also consider the following projects:

geesefs - Finally, a good FUSE FS implementation over S3

seafowl - Analytical database for data-driven Web applications 🪶

gcs-fuse-csi-driver - The Google Cloud Storage FUSE Container Storage Interface (CSI) Plugin.

thumbhash - A very compact representation of an image placeholder

csi-gcs - Kubernetes CSI driver for Google Cloud Storage

azure-storage-fuse-aur - AUR package for Azure Storage Blobfuse

gcp-filestore-csi-driver - The Google Cloud Filestore Container Storage Interface (CSI) Plugin.

azurefs - Mount Microsoft Azure Blob Storage as local filesystem in Linux (inactive)

curve - Curve is a sandbox project hosted by the CNCF Foundation. It's cloud-native, high-performance, and easy to operate. Curve is an open-source distributed storage system for block and shared file storage.

gcsfuse - A user-space file system for interacting with Google Cloud Storage

rclone - "rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files

s3fs-fuse - FUSE-based file system backed by Amazon S3

extfuse vs geesefs mindcastle.io vs seafowl extfuse vs gcs-fuse-csi-driver mindcastle.io vs thumbhash extfuse vs csi-gcs mindcastle.io vs azure-storage-fuse-aur extfuse vs gcp-filestore-csi-driver mindcastle.io vs azurefs extfuse vs curve mindcastle.io vs gcsfuse extfuse vs rclone mindcastle.io vs s3fs-fuse

Compare extfuse vs mindcastle.io and see what are their differences.

extfuse

mindcastle.io

extfuse

mindcastle.io

What are some alternatives?