hash-db vs multiversion-concurrency-control

hash-db

Experimental distributed pseudomultimodel keyvalue database (it uses python dictionaries) imitating dynamodb querying with join only SQL support, distributed joins and simple Cypher graph support and document storage (by samsquire)

Suggest topics

Source Code

Suggest alternative

Edit details

multiversion-concurrency-control

Implementation of multiversion concurrency control, Raft, Left Right concurrency Hashmaps and a multi consumer multi producer Ringbuffer, concurrent and parallel load-balanced loops, parallel actors implementation in Main.java, Actor2.java and a parallel interpreter (by samsquire)

Suggest topics

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

hash-db		multiversion-concurrency-control
	Project
5	Mentions	19
50	Stars	67
-	Growth	-
0.0	Activity	7.3
over 1 year ago	Latest Commit	4 months ago
Python	Language	Java
-	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

hash-db

Posts with mentions or reviews of hash-db. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-18.

CRDT-richtext: Rust implementation of Peritext and Fugue
17 projects | news.ycombinator.com | 18 May 2023

https://github.com/samsquire/hash-db
I need to combine the ideas in each of these projects into a cohesive solution.
I did some work on trying to implement the YATA algorithm, poorly.
Ask HN: How do you test SQL?
18 projects | news.ycombinator.com | 31 Jan 2023

From an SQL database implementation perspective, in my toy Python barebones SQL database that barely supports inner joins (https://github.com/samsquire/hash-db) I tested by testing on postgresql and seeing if my query with two joins produces the same results.
I ought to produce unit tests that prove that tuples from each join operation produces the correct dataset.
For a user perspective, I guess you could write some tooling that loads example data into a database and does an incremental join with each part of the join statement added.
Bullshit Graph Database Performance Benchmarks
10 projects | news.ycombinator.com | 12 Jan 2023

I wrote a toy dynamodb, SQL, Cypher graph and document storage database engine in Python for the learning.
https://github.com/samsquire/hash-db
Experimental distributed keyvalue database (it uses python dictionaries) imitating dynamodb querying with join only SQL support, distributed joins and simple Cypher graph support
1 project | /r/databasedevelopment | 8 Aug 2022
How necessary are the programming fundamentals?
2 projects | news.ycombinator.com | 19 May 2021

I am interested in database internals. Btrees come up with regard to designing database systems that are efficient to query on disk. Postgres uses them for its indexes. Radix trees are memory efficient tries which are useful for answering prefix queries. They're also called prefix trees. I use them to get a list of prefixes of a string. Useful for simple intellisense style forms or dynamodb style querying. I've also been studying LSM trees which are used in Leveldb and RocksDB.
I experiment with database technology in my experimental project hash-db https://github.com/samsquire/hash-db The code should be readable.
I need to change my search tree to be self balancing currently it grows to the left or right without balancing. I think I need to use tree rotation depending on which branch has the highest height.

multiversion-concurrency-control

Posts with mentions or reviews of multiversion-concurrency-control. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-05-18.

Statelines - an idea for representing asynchronicity elegantly
1 project | /r/ProgrammingLanguages | 6 Jun 2023

The code is in this repository https://github.com/samsquire/multiversion-concurrency-control in MultiplexingThread.java and MultiplexProgramParser.java
CRDT-richtext: Rust implementation of Peritext and Fugue
17 projects | news.ycombinator.com | 18 May 2023

https://github.com/samsquire/multiversion-concurrency-contro...
And I implemented a 3 way text diff with myers algorithm based on https://blog.jcoglan.com/2017/02/12/the-myers-diff-algorithm...
https://github.com/samsquire/text-diff
I implemented an eventually consistent mesh protocol that uses timestamps to provide last write wins
A collection of lock-free data structures written in standard C++11
7 projects | news.ycombinator.com | 10 May 2023

I think I lean towards per-thread sharding instead of mutex based or lock free data structures except for lockfree ringbuffers.
You can get embarassingly parallel performance if you split your data by thread and aggregate periodically.
If you need a consistent view of your entire set of data, that is slow path with sharding.
In my experiments with multithreaded software I simulate a bank where many bankaccounts are randomly withdrawn from and deposited to. https://github.com/samsquire/multiversion-concurrency-contro...
I get 700 million requests per second due to the sharding of money over accounts.
How to get started?
3 projects | /r/ProgrammingLanguages | 25 Apr 2023
The “Build Your Own Database” book is finished
5 projects | news.ycombinator.com | 22 Apr 2023

If you want some sample code to implement MVCC, I implemented MVCC in multithreaded Java as a toy example
https://github.com/samsquire/multiversion-concurrency-contro...
First read TransactionC.java then read MVCC.java
Let's write a setjmp
4 projects | news.ycombinator.com | 12 Feb 2023
I wrote an unrolled switch statement in Java to simulate eager async/await across treads.
https://github.com/samsquire/multiversion-concurrency-contro...
The goal is that a compiler should generate this for you. This code is equivalent to the following:
```
   task1:
```
Structured Concurrency Definition
2 projects | news.ycombinator.com | 6 Feb 2023

https://doc.rust-lang.org/book/ch16-00-concurrency.html
I've been working on implementing Java async/await state machine with switch statements and a scheduling loop. If the user doesn't await the async task handle, then the task's returnvalue is never handled. This is similar to the Go problem with the go statement.
https://github.com/samsquire/multiversion-concurrency-contro...
If your async call returns a handle and
Are there any languages with transactions as a first-class concept?
4 projects | /r/ProgrammingLanguages | 20 Jan 2023
Small VMs and Coroutines
3 projects | news.ycombinator.com | 19 Jan 2023

yield value2++
https://github.com/samsquire/multiversion-concurrency-contro...
I am still working on allowing multiple coroutines to be in flight in parallel at the same time. At the moment the tasks share the same background thread.
I asked this stackoverflow question regarding C++ coroutines, as I wanted to use coroutines with a thread pool.
https://stackoverflow.com/questions/74520133/how-can-i-pass-...
Hctree is an experimental high-concurrency database back end for SQLite
5 projects | news.ycombinator.com | 18 Jan 2023

This is very interesting. Thank you for submitting this and thank you for working on this.
I am highly interested in parallelism and high concurrency. I implemented multiversion concurrency control in Java.
https://github.com/samsquire/multiversion-concurrency-contro...
I am curious how to handle replication with high concurrency. I'm not sure how you detect dangerous reads+writes to the same key (tuples/fields) across different replica machines. In other words, multiple master.
I am aware Google uses truetime and some form of timestamp ordering and detection of interfering timestamps. But I'm not sure how to replicate that.
I began working on an algorithm to synchronize database records, do a sort, then a hash for each row where hash(row) = hash(previous_row.hash + row.data)
Then do a binary search on hashes matching/not matching. This is a synchronization algorithm I'm designing that requires minimal data transfer but multiple round trips.
The binary search would check the end of the data set for hash(replica_a.row[last]) == hash(replica_b.row[last]) then split the hash list in half and check the middle item, this shall tell you which row and which columns are different.

What are some alternatives?

When comparing hash-db and multiversion-concurrency-control you can also consider the following projects:

electric - Local-first sync layer for web and mobile apps. Build reactive, realtime, local-first apps directly on Postgres.

kuzu - Embeddable property graph database management system built for query speed and scalability. Implements Cypher.

glibc - GNU Libc

dbt-unit-testing - This dbt package contains macros to support unit testing that can be (re)used across dbt projects.

tree-flat - TreeFlat is the simplest way to build & traverse a pre-order Tree in Rust

ustore - Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️

marisa-trie - MARISA: Matching Algorithm with Recursively Implemented StorAge

pg_crdt - POC CRDT support in Postgres

pybktree - Python BK-tree data structure to allow fast querying of "close" matches

data-diff - Compare tables within or across databases

abseil-cpp - Abseil Common Libraries (C++)