multiversion-concurrency-control vs Folly

multiversion-concurrency-control

Implementation of multiversion concurrency control, Raft, Left Right concurrency Hashmaps and a multi consumer multi producer Ringbuffer, concurrent and parallel load-balanced loops, parallel actors implementation in Main.java, Actor2.java and a parallel interpreter (by samsquire)

Suggest topics

Source Code

Suggest alternative

Edit details

Folly

An open-source C++ library developed and used at Facebook. (by facebook)

Frameworks

Source Code

groups.google.com

Suggest alternative

Edit details

Our great sponsors

WorkOS - The modern identity platform for B2B SaaS

InfluxDB - Power Real-Time Data Analytics at Scale

SaaSHub - Software Alternatives and Reviews

Our great sponsors

multiversion-concurrency-control		Folly
	Project
19	Mentions	90
66	Stars	27,034
-	Growth	0.7%
7.3	Activity	9.8
3 months ago	Latest Commit	about 13 hours ago
Java	Language	C++
-	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

multiversion-concurrency-control

Posts with mentions or reviews of multiversion-concurrency-control. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-18.

Statelines - an idea for representing asynchronicity elegantly
1 project | /r/ProgrammingLanguages | 6 Jun 2023

The code is in this repository https://github.com/samsquire/multiversion-concurrency-control in MultiplexingThread.java and MultiplexProgramParser.java
CRDT-richtext: Rust implementation of Peritext and Fugue
17 projects | news.ycombinator.com | 18 May 2023

https://github.com/samsquire/multiversion-concurrency-contro...
And I implemented a 3 way text diff with myers algorithm based on https://blog.jcoglan.com/2017/02/12/the-myers-diff-algorithm...
https://github.com/samsquire/text-diff
I implemented an eventually consistent mesh protocol that uses timestamps to provide last write wins
A collection of lock-free data structures written in standard C++11
7 projects | news.ycombinator.com | 10 May 2023

I think I lean towards per-thread sharding instead of mutex based or lock free data structures except for lockfree ringbuffers.
You can get embarassingly parallel performance if you split your data by thread and aggregate periodically.
If you need a consistent view of your entire set of data, that is slow path with sharding.
In my experiments with multithreaded software I simulate a bank where many bankaccounts are randomly withdrawn from and deposited to. https://github.com/samsquire/multiversion-concurrency-contro...
I get 700 million requests per second due to the sharding of money over accounts.
How to get started?
3 projects | /r/ProgrammingLanguages | 25 Apr 2023
The “Build Your Own Database” book is finished
5 projects | news.ycombinator.com | 22 Apr 2023

If you want some sample code to implement MVCC, I implemented MVCC in multithreaded Java as a toy example
https://github.com/samsquire/multiversion-concurrency-contro...
First read TransactionC.java then read MVCC.java
Let's write a setjmp
4 projects | news.ycombinator.com | 12 Feb 2023
I wrote an unrolled switch statement in Java to simulate eager async/await across treads.
https://github.com/samsquire/multiversion-concurrency-contro...
The goal is that a compiler should generate this for you. This code is equivalent to the following:
```
   task1:
```
Structured Concurrency Definition
2 projects | news.ycombinator.com | 6 Feb 2023

https://doc.rust-lang.org/book/ch16-00-concurrency.html
I've been working on implementing Java async/await state machine with switch statements and a scheduling loop. If the user doesn't await the async task handle, then the task's returnvalue is never handled. This is similar to the Go problem with the go statement.
https://github.com/samsquire/multiversion-concurrency-contro...
If your async call returns a handle and
Are there any languages with transactions as a first-class concept?
4 projects | /r/ProgrammingLanguages | 20 Jan 2023
Small VMs and Coroutines
3 projects | news.ycombinator.com | 19 Jan 2023

yield value2++
https://github.com/samsquire/multiversion-concurrency-contro...
I am still working on allowing multiple coroutines to be in flight in parallel at the same time. At the moment the tasks share the same background thread.
I asked this stackoverflow question regarding C++ coroutines, as I wanted to use coroutines with a thread pool.
https://stackoverflow.com/questions/74520133/how-can-i-pass-...
Hctree is an experimental high-concurrency database back end for SQLite
5 projects | news.ycombinator.com | 18 Jan 2023

This is very interesting. Thank you for submitting this and thank you for working on this.
I am highly interested in parallelism and high concurrency. I implemented multiversion concurrency control in Java.
https://github.com/samsquire/multiversion-concurrency-contro...
I am curious how to handle replication with high concurrency. I'm not sure how you detect dangerous reads+writes to the same key (tuples/fields) across different replica machines. In other words, multiple master.
I am aware Google uses truetime and some form of timestamp ordering and detection of interfering timestamps. But I'm not sure how to replicate that.
I began working on an algorithm to synchronize database records, do a sort, then a hash for each row where hash(row) = hash(previous_row.hash + row.data)
Then do a binary search on hashes matching/not matching. This is a synchronization algorithm I'm designing that requires minimal data transfer but multiple round trips.
The binary search would check the end of the data set for hash(replica_a.row[last]) == hash(replica_b.row[last]) then split the hash list in half and check the middle item, this shall tell you which row and which columns are different.

Folly

Posts with mentions or reviews of Folly. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-29.

Ask HN: How bad is the xz hack?
1 project | news.ycombinator.com | 31 Mar 2024

https://github.com/facebook/folly/commit/b1391e1c57be71c1e2a...
Backdoor in upstream xz/liblzma leading to SSH server compromise
49 projects | news.ycombinator.com | 29 Mar 2024

https://github.com/facebook/folly/pull/2153
A lock-free ring-buffer with contiguous reservations (2019)
9 projects | news.ycombinator.com | 29 Feb 2024

To set a HP on Linux, Folly just does a relaxed load of the src pointer, release store of the HP, compiler-only barrier, and acquire load. (This prevents the compiler from reordering the 2nd load before the store, right? But to my understanding does not prevent a hypothetical CPU reordering of the 2nd load before the store, which seems potentially problematic!)
Then on the GC/reclaim side of things, after protected object pointers are stored, it does a more expensive barrier[0] before acquire-loading the HPs.
I'll admit, I am not confident I understand why this works. I mean, even on x86, loads can be reordered before earlier program-order stores. So it seems like the 2nd check on the protection side could be ineffective. (The non-Linux portable version just uses an atomic_thread_fence SeqCst on both sides, which seems more obviously correct.) And if they don't need the 2nd load on Linux, I'm unclear on why they do it.
[0]: https://github.com/facebook/folly/blob/main/folly/synchroniz...
(This uses either mprotect to force a TLB flush in process-relevant CPUs, or the newer Linux membarrier syscall if available.)
Appending to an std:string character-by-character: how does the capacity grow?
2 projects | news.ycombinator.com | 26 Oct 2023

folly provides functions to resize std::string & std::vector without initialization [0].
[0] https://github.com/facebook/folly/blob/3c8829785e3ce86cb821c...
Can anyone explain feedback of a HFT firm regarding implementation of SPSC lock-free ring-buffer queue?
1 project | /r/highfreqtrading | 12 Jul 2023

My implementation was quite similar to Boost's spsc_queue and Facebook's folly/ProducerConsumerQueue.h.
A Compressed Indexable Bitset
6 projects | news.ycombinator.com | 1 Jul 2023

> How is that relevant?
Roaring bitmaps and similar data structures get their speed from decoding together consecutive groups of elements, so if you do sequential decoding or decode a large fraction of the list you get excellent performance.
EF instead excels at random skipping, so if you visit a small fraction of the list you generally get better performance. This is why it works so well for inverted indexes, as generally the queries are very selective (otherwise why do you need an index?) and if you have good intersection algorithms you can skip a large fraction of documents.
I didn't follow the rest of your comment, select is what EF is good at, every other data structure needs a lot more scanning once you land on the right chunk. With BMI2 you can also use the PDEP instruction to accelerate the final select on a 64-bit block: https://github.com/facebook/folly/blob/main/folly/experiment...
Defer for Shell
1 project | news.ycombinator.com | 20 Jun 2023

C++ with folly's SCOPE_EXIT {} construct:
https://github.com/facebook/folly/blob/main/folly/ScopeGuard...
Is there any facebook/folly community for discussion and Q&A?
1 project | /r/cpp | 19 Jun 2023

Seems like github issues taking a long time to get any response: https://github.com/facebook/folly
How a Single Line of Code Made a 24-Core Server Slower Than a Laptop
4 projects | news.ycombinator.com | 17 Jun 2023

Can't speak for abseil and tbb, but in folly there are a few solutions for the common problem of sharing state between a writer that updates it very infrequently and concurrent readers that read it very frequently (typical use case is configs).
The most performant solutions are RCU (https://github.com/facebook/folly/blob/main/folly/synchroniz...) and hazard pointers (https://github.com/facebook/folly/blob/main/folly/synchroniz...), but they're not quite as easy to use as a shared_ptr [1].
Then there is simil-shared_ptr implemented with thread-local counters (https://github.com/facebook/folly/blob/main/folly/experiment...).
If you absolutely need a std::shared_ptr (which can be the case if you're working with pre-existing interfaces) there is CoreCachedSharedPtr (https://github.com/facebook/folly/blob/main/folly/concurrenc...), which uses an aliasing trick to transparently maintain per-core reference counts, and scales linearly, but it works only when acquiring the shared_ptr, any subsequent copies of that would still cause contention if passed around in threads.
[1] Google has a proposal to make a smart pointer based on RCU/hazptr, but I'm not a fan of it because generally RCU/hazptr guards need to be released in the same thread that acquired them, and hiding them in a freely movable object looks like a recipe for disaster to me, especially if paired with coroutines https://www.open-std.org/jtc1/sc22/wg21/docs/papers/2020/p05...
Ask HN: What are some of the most elegant codebases in your favorite language?
37 projects | news.ycombinator.com | 17 Jun 2023

Not sure if it's still the case but about 6 years ago Facebook's folly C++ library was something I'd point to for my junior engineers to get a sense of "good" C++ https://github.com/facebook/folly

What are some alternatives?

When comparing multiversion-concurrency-control and Folly you can also consider the following projects:

electric - Local-first sync layer for web and mobile apps. Build reactive, realtime, local-first apps directly on Postgres.

abseil-cpp - Abseil Common Libraries (C++)

marisa-trie - MARISA: Matching Algorithm with Recursively Implemented StorAge

Boost - Super-project for modularized Boost

pybktree - Python BK-tree data structure to allow fast querying of "close" matches

Seastar - High performance server-side application framework

glibc - GNU Libc

parallel-hashmap - A family of header-only, very fast and memory-friendly hashmap and btree containers.

EASTL - Obsolete repo, please go to: https://github.com/electronicarts/EASTL

tree-flat - TreeFlat is the simplest way to build & traverse a pre-order Tree in Rust

OpenFrameworks - openFrameworks is a community-developed cross platform toolkit for creative coding in C++.

multiversion-concurrency-control vs electric Folly vs abseil-cpp multiversion-concurrency-control vs marisa-trie Folly vs Boost multiversion-concurrency-control vs pybktree Folly vs Seastar multiversion-concurrency-control vs glibc Folly vs parallel-hashmap multiversion-concurrency-control vs abseil-cpp Folly vs EASTL multiversion-concurrency-control vs tree-flat Folly vs OpenFrameworks

Compare multiversion-concurrency-control vs Folly and see what are their differences.

multiversion-concurrency-control

Folly

multiversion-concurrency-control

Folly

What are some alternatives?