bdg-formats
uvfs
bdg-formats | uvfs | |
---|---|---|
1 | 3 | |
38 | 5 | |
- | - | |
5.4 | 0.0 | |
4 months ago | almost 2 years ago | |
Shell | C++ | |
Apache License 2.0 | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
bdg-formats
-
Advanced Scientific Data Format
We presented using Parquet formats for bioinformatics 2012/13-ish at the Bioinformatics Open Source Conference (BOSC) and got laughed out of the place.
While using Apache Spark for bioinformatics [0] never really took off, I still think Parquet formats for bioinformatics [1] is a good idea, especially with DuckDB, Apache Arrow, etc. supporting Parquet out of the box.
0 - https://github.com/bigdatagenomics/adam
1 - https://github.com/bigdatagenomics/bdg-formats
uvfs
-
C++ Show and Tell - October 2022
recently I had the need for an archive format where file access could be mmap'd and with very fast random access to the contained files. not sure if I managed but if that can be useful to anyone: https://github.com/celtera/uvfs ; ideally i'd like to investigate how to serialize the hash map directly so that it could just be mapped too instead of having to recreate it on load.
-
Advanced Scientific Data Format
I had started a little bit of work towards that recently: https://github.com/celtera/uvfs
It's very optimized towards my specific needs but could be a basis for what you mention
-
DwarFS: The SquashFS successor has arrived
ended up biting the bullet and started https://github.com/celtera/uvfs
What are some alternatives?
asdf - ASDF (Advanced Scientific Data Format) is a next generation interchange format for scientific data
adam - ADAM is a genomics analysis platform with specialized file formats built using Apache Avro, Apache Spark, and Apache Parquet. Apache 2 licensed.
hwinfo - cross platform C++ library for hardware information (CPU, RAM, GPU, ...)
DataContainer
BFScript - A compiler backend paired with a proof of concept programming language that compiles to Brainfuck.
Pepper - PE32 (x86) and PE32+ (x64) binaries analysis tool, resources viewer/extractor.
CustomKeyboard - A swiss knife for myself - automotive development tools and a plenty of other things
asdf - Extendable version manager with support for Ruby, Node.js, Elixir, Erlang & more
simple_units - A C++20 header-only library for strongly typed units
event-bus - Event Bus utility