Open-source projects categorized as NoSQL
Related topics: #Database #MongoDB #Driver #JSON #SQL

Top 23 NoSQL Open-Source Projects

  • GitHub repo RethinkDB

    The open-source database for the realtime web.

    Project mention: Backend Developer Cheatsheet | reddit.com/r/learnprogramming | 2021-03-21

    RethinkDB open source project. showcase it working live project with announcement blog Announcing RethinkDB 2.4.1 dated August 13 2020

  • GitHub repo MongoDB

    The MongoDB Database

    Project mention: Sails development with Docker and Docker Compose | dev.to | 2021-04-14

    [1] https://www.docker.com/resources/what-container [2] https://www.docker.com/why-docker [3] https://hub.docker.com/_/node [4] https://sailsjs.com/documentation/concepts/extending-sails/adapters/available-adapters [5] https://www.mongodb.com [6] https://hub.docker.com/_/mongo

  • GitHub repo RxDB

    🔄 A realtime Database for JavaScript Applications

    Project mention: The Web After Tomorrow (2015) | news.ycombinator.com | 2021-02-09

    You can mix this with a client-server replication and voila, you have a website that always shows the correct data at each point in time.


  • GitHub repo ArangoDB

    🥑 ArangoDB is a native multi-model database with flexible data models for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.

    Project mention: R/FANTASYPL CUP GROUPS LIVE! | reddit.com/r/FantasyPL | 2021-04-01

    Second reason, never never use www.arangodb.com. This database is terrible. It broke when I attempted to do something fancy to ensure there would never be downtime. Im back to a stable method.

  • GitHub repo Neo4j

    Graphs for Everyone

    Project mention: Looking for visual to show moves from group to group | reddit.com/r/PowerBI | 2021-04-13

    Unfortunately, I think you will need to roll up your sleeves and do some coding, e.g., in NetworkX. Alternatively, you can look into loading your data into a professional tool, e.g., Neo4J, that comes bundled with some viz components.

  • GitHub repo Scylla

    NoSQL data store using the seastar framework, compatible with Apache Cassandra

    Project mention: How to Efficiently Choose the Right Database for Your Applications | news.ycombinator.com | 2021-03-02

    Someone from ScyllaDB here (Technical Marketing Manager). Forgive, if you can, the length of my reply.

    AGPL was chosen to prevent people from taking the software and making it an -as-a-Service (-aaS) offering without contributing anything back to it. Which, if you look at other open source products, can cause them to wither in the vine as people reap the benefits without having to sustain and enhance the base code.

    We now have plenty of folks using our Scylla Open Source product across spaces from cybersecurity to IIoT. No one who is just using Scylla internally really needs to worry about AGPL. Though I do admit that many people are allergic to it for lawyerly reasons. But it's also helped prevent other not-so-fine people from utterly vulching the code.

    Scylla Open Source is often used under JanusGraph, which is the open source fork of TitanGraph now supported by the CNCF (folks familiar with the history know what happened to TitanGraph, so yes, your concerns are warranted). We use open source Prometheus and Grafana for our monitoring, rather than a proprietary offerings.

    We're also taking your first point seriously (long-term direction). We see ourselves as stewards of the software; we don't want to bottleneck or freeze out contributions. For example, open source contributor @Fastio began adding the Redis API into Scylla Open Source! I remember when I learned he was planning on doing it, beginning with a Redis on Seastar implementation called "Pedis." Now it's there in the open source code base. Pretty amazing work, and you have to just thank amazing contributors like that.



    Apache Cassandra is also an awesome project, and ScyllaDB definitely owes a lot of our success to the groundbreaking work done there. Anyone working on it gets nothing but big props from me.

    We therefore also want to ensure that what we do stays pretty much compatible with Cassandra (CQL v4, murmur3). Like the new Rust driver we wrote as part of our internal hackathon:


    While the rivalry with the Cassandra community remains pretty heated in some parts with some parties, you'll get none of that from me. Personally I just hope that end user developers just get better code, better features, better choices.

    In 2018, the head-to-head rivalry seemed pretty fierce. But now there are soooo many closed source CQL offerings out there: DataStax, Amazon Keyspaces, Azure CosmosDB, Scylla Enterprise (separate from our open source). There's also other open source offerings like Scylla Open Source and Yugabyte. Of all of those, we hope to show up as the "most open" of the competing offerings.

    Also as of 2021 Scylla has broadened who we can please (or, I suppose, be mad at us) by offering other APIs. We support a CQL interface for Cassandra compatibility, a DynamoDB-compatible API, and, still under development, the aforementioned Redis API.

    Each of those different NoSQL communities and constituencies bring high expectations for excellence, and their own high standards for what they want from an open source vendor. We definitely take their criticisms to heart.

    And yes, our DynamoDB implementation, Alternator, is fully 100% open source. You can totally run your workloads where you want. On premise, on any cloud, or even still on AWS. We take that aspect of open source very seriously. We could have made it simply an enterprise feature. But we opened it up.

    I know my title is "Marketing" and some people see that as a license to lie on behalf of a vendor, but I have never been more proud to see the open source commitment and contributions of any company I've worked for to date.

    Thanks for the mention and for reading this far. And best wishes to anyone working on hard big data problems these days, regardless of your database-of-choice.

  • GitHub repo Predis

    A flexible and feature-complete Redis client for PHP.

  • GitHub repo LiteDB

    LiteDB - A .NET NoSQL Document Store in a single data file - https://www.litedb.org

  • GitHub repo Zeppelin

    Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

    Project mention: Is there a way to collaborate in real-time for Jupyter Notebooks? | reddit.com/r/learnpython | 2021-03-21

    Check out Zeppelin. It's similar to Jupyter and allows real-time editing by multiple users. https://zeppelin.apache.org/

  • GitHub repo CouchDB

    Seamless multi-master syncing database with an intuitive HTTP/JSON API, designed for reliability

    Project mention: CouchDB and C#. EFCore-like queries! | dev.to | 2021-03-09

    Apache CouchDB is a distributed document database and its replication protocol is available from servers to mobile phones and web browsers.

  • GitHub repo OrientDB

    OrientDB is the most versatile DBMS supporting Graph, Document, Reactive, Full-Text, Geospatial and Key-Value models in one Multi-Model product. OrientDB can run distributed (Multi-Master), supports SQL, ACID Transactions, Full-Text indexing and Reactive Queries. OrientDB Community Edition is Open Source using a liberal Apache 2 license.

  • GitHub repo TinyDB

    TinyDB is a lightweight document oriented database optimized for your happiness :)

    Project mention: Why not use PostgreSQL? | reddit.com/r/PHP | 2021-04-09

    Things like https://sleekdb.github.io/ and https://github.com/msiemens/tinydb come to mind in terms of document storage, for instance.

  • GitHub repo Apache Solr

    Apache Lucene and Solr open-source search software

    Project mention: Implementing auto-complete functionality in Elasticsearch - Part I: Prefix queries | dev.to | 2021-04-17

    A technology that makes it easy to implement such features is Elasticsearch - a search and analytics engine built on top of Apache Lucene library. Elasticsearch has distributed, multi-tenant architecture with built-in routing and re-balancing, making it easy to scale. It's a widely used data store for storing, searching, and analyzing large volumes of data.

  • GitHub repo awesome-system-design

    A curated list of awesome System Design (A.K.A. Distributed Systems) resources.

    Project mention: Any review on AlgoExpert.io or educative.io? | reddit.com/r/developersIndia | 2021-03-23

    A Curated List pointing to in depth information on most topics (This is more of a reference, not a study material)

  • GitHub repo ObjectBox Java (Kotlin, Android)

    ObjectBox is a superfast lightweight database for objects

    Project mention: Persistent Local Database with ObjectBox on Flutter | dev.to | 2021-02-19

    There are a lot of persistent database solutions available for Flutter applications. One of them is ObjectBox which is a NoSQL-based high-performance-oriented database solution written in the native dart language. You can use local as well as server features for syncing data as you like.

  • GitHub repo Riak

    Riak is a decentralized datastore from Basho Technologies.

  • GitHub repo FlockDB

    A distributed, fault-tolerant graph database

  • GitHub repo RavenDB

    ACID Document Database

  • GitHub repo tiedot

    A rudimentary implementation of a basic document (NoSQL) database in Go

  • GitHub repo upper.io/db

    Data access layer for PostgreSQL, CockroachDB, MySQL, SQLite and MongoDB with ORM-like features.

  • GitHub repo octosql

    OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.

    Project mention: Minimally Invasive (and More Accurate) Analytics: GoAccess and Athena/SQL | news.ycombinator.com | 2021-02-16

    With OctoSQL[0], as I wanted to see how people are using it, I literally just set up an http endpoint which received a JSON request on each CLI invocation (you can see the data sent in the code, it's open source) and appended it to an on-disk JSON file.

    Then I used... OctoSQL to analyze it!

    Nit: The project may seem dead for a few months, but I'm just in midst of a rewrite (on a branch) which gets rid of wrong decisions and makes it easier to embed in existing applications.


  • GitHub repo Hive

    Lightweight and blazing fast key-value database written in pure Dart. (by hivedb)

    Project mention: Trying out Moor for the first time (using .moor files) Building an offline first app. [LIVESTREAM] | reddit.com/r/FlutterDev | 2021-01-28

    Moor is a well-maintained ORM for sqlite3, a relational database management system. Hive is a fast but buggy key-value store. It's not an apple to apple comparison, they serve different purposes. Make the choice based on the type of data you want to store and how you need to access it.

  • GitHub repo gocql

    Package gocql implements a fast and robust Cassandra client for the Go programming language.

    Project mention: Web Development in Go: Middleware, Templating, Databases & Beyond | dev.to | 2021-01-27

    https://github.com/gocql/gocql (Cassandra driver).

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020). The latest post mention was on 2021-04-17.


What are some of the best open-source NoSQL projects? This list will help you:

Project Stars
1 RethinkDB 24,611
2 MongoDB 19,639
3 RxDB 15,454
4 ArangoDB 11,078
5 Neo4j 8,798
6 Scylla 6,760
7 Predis 6,747
8 LiteDB 5,671
9 Zeppelin 5,208
10 CouchDB 4,898
11 OrientDB 4,281
12 TinyDB 4,159
13 Apache Solr 4,128
14 awesome-system-design 3,887
15 ObjectBox Java (Kotlin, Android) 3,794
16 Riak 3,529
17 FlockDB 3,257
18 RavenDB 2,626
19 tiedot 2,582
20 upper.io/db 2,532
21 octosql 2,396
22 Hive 2,202
23 gocql 2,064