casync
datacurator-filetree
casync | datacurator-filetree | |
---|---|---|
17 | 36 | |
1,462 | 1,426 | |
0.4% | - | |
2.4 | 2.0 | |
4 months ago | 10 months ago | |
C | Makefile | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
casync
-
We reduced conda’s index fetch bandwidth by 99%
For arbitrary state changes however, it's better to use something like casync. Note that there are a lot of tunables, implicit and explicit; for package indexing I would particularly think about "how is the index sorted" and "what is the desired chunk size".
-
Intro to Content Defined Chunking
If you just want something practical to play with, see casync. Even if it doesn't fit your workflow, or if you think you can do better, chances are you're best off building on top of it or adding patches to it, not starting from scratch.
-
Tool to clone file structure without the large files themselves?
You probably want casync.
-
A Nibble of Content-Defined Chunking - How de-duplicated, incremental file transfer works
Obligatory link to casync, which implements this better than most alternatives.
-
LibSQL – a fork of SQLite that is both Open Source, and Open Contributions
(personally, I think more people need to be aware of casync for the update storage/distribution problem. It isn't perfect for every use case, but it's good enough that you're probably better off wrapping/forking it rather than reimplementing it badly from scratch)
-
improving download infra
Does something like casync (https://github.com/systemd/casync or https://github.com/folbricht/desync) serve any purpose or provide any advantage to propagating rpm changes over rsync?
-
Are there any true alternatives to Seafile? (Nextcloud is not an alternative in this context)
Software that comes to mind for syncing lots of small files: git (and other source versioning tools), casync (https://github.com/systemd/casync) and a go implementation (https://github.com/folbricht/desync). Not really an answer and I can't think of a way to shoehorn that into your workflow, but maybe it leads you down a useful road.
- Casync – A Content-Addressable Data Synchronization Tool
-
Hacker News top posts: Apr 23, 2022
Casync – A Content-Addressable Data Synchronization Tool\ (15 comments)
datacurator-filetree
-
How do you store interest-based content? Do I store that content in separate filetype folders or a single folder with sub-directories for each media type?
For the most part I follow this file tree. However when it comes to some of my intererests, like electronics, I am unsure if I should keep splitting these interest-based files by file type, for example:
-
Where should I put my product "mockups" folder
I have redesigned my entire computer to follow the datacurator methodology: https://github.com/roboyoshi/datacurator-filetree/tree/main/root
-
Share your folder structure
P.S. I've been lurking this sub and have considered this particular problem for a long time and have read maybe everything Karl, Nayuki, Reddit, and Hacker News have had to say on the subject. Running into this post is a treat. If tags don't work out for you roboyoshi and contributors have made a really nice unified file tree https://github.com/roboyoshi/datacurator-filetree
-
I have created an Automated Screenshot Sorting in bash that moves screenshots from a folder into named subfolders in the screenshot's folder of Roboyoshi`s Datacurator Filetree.
As always, credit to u/Roboyoshi for the Datacurator filetree.
- What is your folder tree in Google Drive looks like?
-
Dataset Organisation.. Need Inspiration!
But it will obviously depend on the use case. As example you have JohnnyDecimal or a more simple approach
-
Tool to clone file structure without the large files themselves?
This tool will be useful to generate repos like these and sharing them with friends without actually needing to share them TB of data.
-
Tried to combine a few posts i saw on here
back in the days I started with this structure tho: https://github.com/roboyoshi/datacurator-filetree
- Beste Methode(n) zum organisieren von Dateien ?
-
My organisation structure; feedback appreciated
This is a mix of this post and https://github.com/roboyoshi/datacurator-filetree. Im still having trouble with a few things:
What are some alternatives?
kopia - Cross-platform backup tool for Windows, macOS & Linux with fast, incremental backups, client-side end-to-end encryption, compression and data deduplication. CLI and GUI included.
filetags - Management of simple tags within file names
tarsnap - Command-line client code for Tarsnap.
czkawka - Multi functional app to find duplicates, empty folders, similar images etc.
desync - Alternative casync implementation
pyShelf - A simple terminal based ebook server
zstd - Zstandard - Fast real-time compression algorithm
album-splitter - Split single-file MP3 albums into separate tracks. Downloads from YouTube supported.
magic-trace - magic-trace collects and displays high-resolution traces of what a process is doing
appendfilename - Intelligent appending text to file names, considering file extensions and file tags
BorgBackup - Deduplicating archiver with compression and authenticated encryption.
koreader - An ebook reader application supporting PDF, DjVu, EPUB, FB2 and many more formats, running on Cervantes, Kindle, Kobo, PocketBook and Android devices