juicefs
hdfs
Our great sponsors
juicefs | hdfs | |
---|---|---|
42 | 3 | |
9,791 | 1,342 | |
2.8% | - | |
9.7 | 4.2 | |
about 23 hours ago | about 2 months ago | |
Go | Go | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
juicefs
-
South Korea's No.1 Search Engine Chose JuiceFS over Alluxio for AI Storage
Support for Kerberos keytab files
-
5 Open Source tools written in Golang that you should know about
JuiceFS under the Apache License 2.0, is a high-performance POSIX file system optimized for cloud-native environments. It stores data in Object Storage (e.g., Amazon S3) and metadata in databases like Redis, MySQL, or TiKV. JuiceFS integrates massive cloud storage with big data, machine learning, and AI applications efficiently, akin to local storage. It features full POSIX and Hadoop compatibility, S3 interface, Kubernetes support, and shared file storage for numerous clients. Some cool features are - strong consistency, scalable performance, data encryption, global file locks, and compression with LZ4 or Zstandard.
-
How to Build a Ceph Cluster and Integrate with the JuiceFS File System
To improve the handling process of capacity overrun, the JuiceFS client supports deletion operations in the case of Ceph cluster fullness (see related code changes in JuiceFS Community Edition). Therefore, for newer client versions, there is no need to use set-full-ratio for temporary adjustments.
-
A Deep Dive into the Design of Directory Quotas in JuiceFS
If you have any questions or would like to learn more, feel free to join discussions about JuiceFS on GitHub and the JuiceFS community on Slack.
- JuiceFS 1.1 - Distributed File System written in Go
-
Gcsfuse: A user-space file system for interacting with Google Cloud Storage
The architecture image shows GCS and others, so I suspect it does.
https://github.com/juicedata/juicefs#architecture
-
Google Cloud Storage FUSE
See also: JuiceFS: https://juicefs.com/
Adds a DBMS or key-value store for metadata, making the filesystem much faster (POSIX, small overwrites don't have to replace a full object in the GCS/S3 backend).
Almost certainly a better solution if you want to turn your object storage into a mountable filesystem, with the (big) caveat that you can't access the files directly in the bucket (they are not stored transparently).
- Using S3 as shared storage
-
s3fs-fuse VS juicefs - a user suggested alternative
2 projects | 19 Feb 2023
JuiceFS can do the same thing as s3fs-fuse, but better. Because it supports robust data consistency and caching policies to improve performance.
- JuiceFS: Turn Cloud Blob Storage into Local Posix Filesystems
hdfs
-
MIT 6.824 MapReduce: Having trouble in connecting Hadoop file system with Golang
I have completed the part for MapReduce in Go on a local machine with multiple processes. Now, I want to run it over different machines for which I'll need a distributed file system like Hadoop. I am using HDFS Client for making queries to HDFS, but I am not able to set the appropriate file path in Go for input files. It is giving me an error similar to this issue.
-
Hadoop with Golang?
I find Go and Hadoop combination good when it comes loading loading data to Hadoop. For example this library and client is way faster that the Java based hdfs cli. https://github.com/colinmarc/hdfs
-
Read / weite HDFS file using java or python sdk in go sdk
This is a library and a cli to connect to hdfs. I have used it from inside docker containers. https://github.com/colinmarc/hdfs
What are some alternatives?
cubefs - cloud-native file store
seaweedfs - SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
goofys - a high-performance, POSIX-ish Amazon S3 file system written in Go
Seaweed File System - SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding. [Moved to: https://github.com/seaweedfs/seaweedfs]
s3-benchmark - Measure Amazon S3's performance from any location.
gotests - Automatically generate Go test boilerplate from your source code.
gcsfuse - A user-space file system for interacting with Google Cloud Storage
gossamr - Run Hadoop programs with Go
Golang-PDF-to-Image-Converter - This project will help you to convert PDF file to IMAGE using golang.
barkfetch - Alternative to neofetch, written in go
containers-roadmap - This is the public roadmap for AWS container services (ECS, ECR, Fargate, and EKS).
MapReduce