s3parcp
Faster than s3cp (by chanzuckerberg)
s5cmd
Parallel S3 and local filesystem execution tool. (by peak)
s3parcp | s5cmd | |
---|---|---|
2 | 11 | |
37 | 2,324 | |
- | 1.6% | |
2.6 | 7.3 | |
about 2 years ago | about 2 months ago | |
Go | Go | |
MIT License | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
s3parcp
Posts with mentions or reviews of s3parcp.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2021-04-10.
-
Downloading files from S3 with multithreading and Boto3
Yes, you would. The CPU and memory overhead of multiprocessing for this application is why we ended up migrating away from boto3 and to the AWS Go SDK for this specific purpose (https://github.com/chanzuckerberg/s3parcp as I mentioned in another comment). We still use boto3 in other areas, but for maxing out the network connection, golang is far more scalable.
s5cmd
Posts with mentions or reviews of s5cmd.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2024-01-15.
-
GitHub issues from top Open Source Golang Repositories that you should contribute to
s5cmd - Extended character support for s3 compatible backend
-
Migrate 5 TB S3 bucket from one AWS account to another
I've used a tool in the past called s5cmd to copy millions of objects, and it was strikingly fast: https://github.com/peak/s5cmd
-
Those using AWS, have you ever tried to use AWS Transfer Family to transfer files into an S3 bucket? Can I use python to make these uploads, and if so how do I set it up in aws?
Some folks say https://github.com/peak/s5cmd is faster than the two options above.
- Gcloud storage: up to 94% faster data transfers for Cloud Storage
- Faster way to empty S3 buckets?
-
A Dockerfile for Perl 5.36 / Alpine, with working SSL
RUN mkdir /tmp/output && cd /tmp/output RUN wget --no-check-certificate https://github.com/peak/s5cmd/releases/download/v1.2.1/s5cmd_1.2.1_Linux-64bit.tar.gz RUN tar xvzf s5cmd_1.2.1_Linux-64bit.tar.gz && mv s5cmd /usr/bin/s5cmd && rm -rf /tmp/output && rm s5cmd_1.2.1_Linux-64bit.tar.gz
-
DataSync Vs AWS S3 sync?
Not that I’ve seen but you might checkout https://github.com/peak/s5cmd
-
S3/100gbps question
I like to use https://github.com/peak/s5cmd
-
Downloading files from S3 with multithreading and Boto3
Excellent walkthrough, love boto. We’ve recently been using s5cmd which we’ve found is ridiculously faster than boto without any extra boto tricks.
https://github.com/peak/s5cmd
- How to download millions of files from S3? (AWS CLI stops working after 1st million)