featurebase
A crazy fast analytical database, built on bitmaps. Perfect for ML applications. Learn more at: http://docs.featurebase.com/. Start a Docker instance: https://hub.docker.com/r/featurebasedb/featurebase (by FeatureBaseDB)
RoaringBitmap
A better compressed bitset in Java: used by Apache Spark, Netflix Atlas, Apache Pinot, Tablesaw, and many others (by RoaringBitmap)
featurebase | RoaringBitmap | |
---|---|---|
13 | 24 | |
2,515 | 3,390 | |
- | 0.9% | |
0.0 | 8.5 | |
7 months ago | 13 days ago | |
Go | Java | |
Apache License 2.0 | Apache License 2.0 |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
featurebase
Posts with mentions or reviews of featurebase.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-07-18.
-
Feature store for real time inference
You can try https://www.featurebase.com/, they have an open source version but they are focusing on the cloud offering
-
Roaring bitmaps are compressed bitmaps, can be 100x faster
Featurebase[1] is a database based on bitmaps. (Formerly known as Piloasa) I'm not sure if they actually use Roaring Bitmaps [2] or they implemented it themselves. Either way it's based on the idea of compressed bitmaps.
[1] https://github.com/FeatureBaseDB/featurebase
- FeatureBase v3.34.0: The fast Database and Analytics Engine
- FeatureBase v3.33.0 is Released with SQL B-tree Roaring Bitmaps
- FeatureBase v3.32.0 released B-tree Roaring Bitmaps SQL support
- FeatureBase v3.27.0 Is Released
- FeatureBase
- FeatureBase Community v1.3.0 is Released
- Bitmap Indexes in Go: Search Speed
-
TIL What a Bitmap DB is, 2 months after getting hired to work on one
If this still sounds complex, or if you want to see what happens when some smart people make it even faster, then give it a try with FeatureBase.
RoaringBitmap
Posts with mentions or reviews of RoaringBitmap.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-07-18.
-
Iterating over Bit Sets Quickly
I was recently reading about Roaring https://roaringbitmap.org/ which is a highly optimized compressed bitset implementation. I reccomend reading about it if you are interested in this sort of thing. The talk at https://roaringbitmap.org/talks/ is especially good.
- Roaring Bitmaps
- Roaring bitmaps are compressed bitmaps, can be 100x faster
-
What feature would you like to remove in C++26?
However, I would love compressed (not just packed) bitsets too, which is something different to me. I would make it another class with a similar interface, based on something like roaring. It doesn't need to be in the standard, but it would be nice if the API was a such that one could easily swap implementations.
-
Jaccard Index
As an aside if you find yourself having to compute them on the fly, know that the Roaring Bitmaps libraries is the way to go [1]. The bitmaps are compressed, and can be streamed directly into SIMD computations (batching XORs and popcnts 256 bits wide!). The Jaccard index is just intersection_len / union_len [2] away
[1] https://roaringbitmap.org/
[2] https://roaringbitmap.readthedocs.io/en/latest/#roaringbitma...
-
Looking for fast, space-efficient key-lookup
Use a two stage approach, with a bloom/cuckoo filter stored as a https://roaringbitmap.org/ in memory. Then a secondary key/value store on disk (bolt or anything else).
-
BitSet Vs BigInteger
As an aside, if you're dealing with large bit sets, you might also want to evaluate Roaring Bitmaps.
-
Negative Incentives in Academic Research
Sidetracking a bit the conversation. What a coincidence that the author (Lemire) is also represented on Today's #1 "Ask HN: What are some cool but obscure data structures you know about?" as he is the main contributor of RoaringBitmap https://github.com/RoaringBitmap/RoaringBitmap and one of the main authors of the data structure.
- Ask HN: What are some 'cool' but obscure data structures you know about?
- Roaring bitmaps: A better compressed bitset