hdfs
juicefs
hdfs | juicefs | |
---|---|---|
3 | 43 | |
1,347 | 9,836 | |
- | 1.5% | |
3.6 | 9.8 | |
5 days ago | 1 day ago | |
Go | Go | |
MIT License | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
hdfs
-
MIT 6.824 MapReduce: Having trouble in connecting Hadoop file system with Golang
I have completed the part for MapReduce in Go on a local machine with multiple processes. Now, I want to run it over different machines for which I'll need a distributed file system like Hadoop. I am using HDFS Client for making queries to HDFS, but I am not able to set the appropriate file path in Go for input files. It is giving me an error similar to this issue.
-
Hadoop with Golang?
I find Go and Hadoop combination good when it comes loading loading data to Hadoop. For example this library and client is way faster that the Java based hdfs cli. https://github.com/colinmarc/hdfs
-
Read / weite HDFS file using java or python sdk in go sdk
This is a library and a cli to connect to hdfs. I have used it from inside docker containers. https://github.com/colinmarc/hdfs
juicefs
-
JuiceFS 1.2 Beta 1: Gateway Upgrade, Enhanced Multi-User Permission Management
Feel free to download and try JuiceFS 1.2-beta1 here. If you have any questions, join JuiceFS discussions on GitHub and our community on Slack.
-
South Korea's No.1 Search Engine Chose JuiceFS over Alluxio for AI Storage
Support for Kerberos keytab files
-
5 Open Source tools written in Golang that you should know about
JuiceFS under the Apache License 2.0, is a high-performance POSIX file system optimized for cloud-native environments. It stores data in Object Storage (e.g., Amazon S3) and metadata in databases like Redis, MySQL, or TiKV. JuiceFS integrates massive cloud storage with big data, machine learning, and AI applications efficiently, akin to local storage. It features full POSIX and Hadoop compatibility, S3 interface, Kubernetes support, and shared file storage for numerous clients. Some cool features are - strong consistency, scalable performance, data encryption, global file locks, and compression with LZ4 or Zstandard.
-
How to Build a Ceph Cluster and Integrate with the JuiceFS File System
To improve the handling process of capacity overrun, the JuiceFS client supports deletion operations in the case of Ceph cluster fullness (see related code changes in JuiceFS Community Edition). Therefore, for newer client versions, there is no need to use set-full-ratio for temporary adjustments.
-
A Deep Dive into the Design of Directory Quotas in JuiceFS
If you have any questions or would like to learn more, feel free to join discussions about JuiceFS on GitHub and the JuiceFS community on Slack.
- JuiceFS 1.1 - Distributed File System written in Go
-
Gcsfuse: A user-space file system for interacting with Google Cloud Storage
The architecture image shows GCS and others, so I suspect it does.
https://github.com/juicedata/juicefs#architecture
-
Google Cloud Storage FUSE
See also: JuiceFS: https://juicefs.com/
Adds a DBMS or key-value store for metadata, making the filesystem much faster (POSIX, small overwrites don't have to replace a full object in the GCS/S3 backend).
Almost certainly a better solution if you want to turn your object storage into a mountable filesystem, with the (big) caveat that you can't access the files directly in the bucket (they are not stored transparently).
- Using S3 as shared storage
-
s3fs-fuse VS juicefs - a user suggested alternative
2 projects | 19 Feb 2023
JuiceFS can do the same thing as s3fs-fuse, but better. Because it supports robust data consistency and caching policies to improve performance.
What are some alternatives?
seaweedfs - SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding.
cubefs - cloud-native file store
Seaweed File System - SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC active-active replication, Kubernetes, POSIX FUSE mount, S3 API, S3 Gateway, Hadoop, WebDAV, encryption, Erasure Coding. [Moved to: https://github.com/seaweedfs/seaweedfs]
goofys - a high-performance, POSIX-ish Amazon S3 file system written in Go
gotests - Automatically generate Go test boilerplate from your source code.
s3-benchmark - Measure Amazon S3's performance from any location.
barkfetch - Alternative to neofetch, written in go
gcsfuse - A user-space file system for interacting with Google Cloud Storage
gossamr - Run Hadoop programs with Go
Golang-PDF-to-Image-Converter - This project will help you to convert PDF file to IMAGE using golang.
MapReduce
containers-roadmap - This is the public roadmap for AWS container services (ECS, ECR, Fargate, and EKS).