fpart
criu
fpart | criu | |
---|---|---|
5 | 14 | |
216 | 2,663 | |
- | 1.7% | |
7.9 | 8.9 | |
3 months ago | 9 days ago | |
C | C | |
BSD 2-clause "Simplified" License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
fpart
-
Rsync extremely slow on two ZFS local pools
Native rsync is terrible for lots of small file as it copies each file one by one sequentially. If you have lots of cores to work with, use the fpsync utility that comes with the fpart command to run parallel rsync's. You can easily saturate a 10Gb link with multiple rsync processes in parallel
-
Am I crazy to expect 100gbps across the pacific ocean?
You should probably use something like fpsync and multiple rsync jobs to get the most bandwidth.
-
Advice on 100gbps WAN?
My favorite free solution is fpsync/fpart from https://github.com/martymac/fpart -- basically that is a highly optimized filesystem crawler and indexer that can spit out balanced lists of files to transfer to a waiting army of parallel rsync workers. Tools are provided to manage the rsync fleet. Combining fpsync/fpart with an army of parallel rsync workers is a great design pattern especially for HPC as you can farm the rsync workers out to compute nodes
-
zfs replication vs multithreaded rsync
I've migrated data from our Isilon to zfs hostA using the fpsync tool that comes with the fpart utility. I get reasonably good throughput from this. 15TB in 5 and 1/2 hours
- How to back up 100TB NAS to USB HDDs??
criu
-
When "letting it crash" is not enough
Checkpoint/Restore I feel is a bigger concept than just saving state. At the zeroth level it's a system that can correctly stop and serialize a running process (as criu https://github.com/checkpoint-restore/criu has shown is a huge pain in the ass to still not be perfect) in a way that can initiated from within the process itself.
The 1st level more-work-but-easier way to do this is to build or use a heavily constrained VM/language you run from within your main application that doesn't allow for most of the hard problems to even exist.
I can't find any ready-made tools to do this that I wouldn't consider an endeavor.
- CRIU – Checkpoint/restore Linux tasks
-
Live Switching Pods to another Node on Resource Limits
That being said the Checkpoint Restore In Userspace project has been around for a number of years and is the closest thing to what you are talking about: taking a linux process on one machine and moving it to another. It is messy but can be done in some cases. There are folks looking at how to integrate CRIU with k8s but it’s all research at this point.
- Criu: Checkpoint/Restore Functionality for Linux
- checkpoint-restore/criu: Checkpoint/Restore tool
- checkpoint-restore/criu: Linux Checkpoint/Restore tool
-
The intersection of shadow stacks and CRIU
I would love to make more use of CRIU. E.g. I considered to use CRIU for my Python preloaded logic (https://github.com/albertz/python-preloaded). Unfortunately, at that point in time, CRIU must be used with root access, which was not an option. However, I see that the PR was merged now, so maybe it works now? (https://github.com/checkpoint-restore/criu/pull/1930)
There is also DMTCP (https://github.com/dmtcp/dmtcp/) but this might have other problems for my use case.
My solution was to use a fork server instead, which works almost equally well. There are not really much downsides with this approach. And this is actually quite simple, and also quite cross-platform (except Windows).
-
Python Preloaded
CRIU currently needs root access for dump/restore. However, there is ongoing work to support a non-root option in https://github.com/checkpoint-restore/criu/pull/1930.
-
How-to "freeze" a process to disk?
There have been multiple checkpointing attempts over the years. Criu is the only one I know of that's still kicking. That's probably your best and only bet.
- I made a plugin to suspend games and apps similar to how consoles do (Deck Suspender)
What are some alternatives?
TDengine - TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, Industrial IoT and DevOps.
nyrna - Suspend games and applications.
pgBackRest - Reliable PostgreSQL Backup & Restore
FitM - FitM, the Fuzzer in the Middle, can fuzz client and server binaries at the same time using userspace snapshot-fuzzing and network emulation. It's fast and comparably easy to set up.
libarchive - Multi-format archive and compression library
Regshot-Advanced - This is a fork of Regshot (original found at https://sourceforge.net/projects/regshot/) with very enhanced functionality.
sanoid - These are policy-driven snapshot management and replication tools which use OpenZFS for underlying next-gen storage. (Btrfs support plans are shelved unless and until btrfs becomes reliable.)
DashLoader - Launch at the speed of light.
stm32-usart-uart-dma-rx-tx - STM32 examples for USART using DMA for efficient RX and TX transmission
nginx-link-function - It is a NGINX module that provides dynamic linking to your application in server context and call the function of your application in location directive
sha1 - SHA-1 Hashing
crun - A fast and lightweight fully featured OCI runtime and C library for running containers