marmot vs sqlite-utils

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

marmot		sqlite-utils
	Project
33	Mentions	35
1,628	Stars	1,510
-	Growth	-
8.6	Activity	8.1
3 months ago	Latest Commit	22 days ago
Go	Language	Python
MIT License	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

marmot

Posts with mentions or reviews of marmot. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-12-11.

Distributed SQLite: Paradigm shift or hype?
1 project | news.ycombinator.com | 9 Apr 2024

If you're willing to accept eventual consistency (a big ask, but acceptable in some scenarios) then there are options like marmot [1] that replicate cdc over nats.
[1]: https://github.com/maxpert/marmot
Marmot: Multi-writer distributed SQLite based on NATS
1 project | /r/hypeurls | 11 Dec 2023

4 projects | news.ycombinator.com | 11 Dec 2023
Why you should probably be using SQLite
8 projects | news.ycombinator.com | 27 Oct 2023
The Raft Consensus Algorithm
5 projects | news.ycombinator.com | 3 Sep 2023

I've written a whole SQLite replication system that works on top of RAFT ( https://github.com/maxpert/marmot ). Best part is RAFT has a well understood and strong library ecosystem as well. I started of with libraries and when I noticed I am reimplementing distributed streams, I just took off the shelf implementation (https://docs.nats.io/nats-concepts/jetstream) and embedded it in system. I love the simplicity and reasoning that comes with RAFT. However I am playing with epaxos these days (https://www.cs.cmu.edu/~dga/papers/epaxos-sosp2013.pdf), because then I can truly decentralize the implementation for truly masterless implementation. Right now I've added sharding mechanism on various streams so that in high load cases masters can be distributed across nodes too.
SQLedge: Replicate Postgres to SQLite on the Edge
9 projects | news.ycombinator.com | 9 Aug 2023

Very interesting! I have question ( out of my experience in https://github.com/maxpert/marmot ) how do get around the boot time, specially when a change log of table is pretty large in Postgres? I've implemented snapshotting mechanism in Marmot as part of quickly getting up to speed. At some level I wonder if we can just feed this PG replication log into NATS cluster and Marmot can just replicate it across the board.
Show HN: Blueprint for a distributed multi-region IAM with Go and CockroachDB
4 projects | news.ycombinator.com | 8 Aug 2023

One of the reasons I started writing Marmot (https://maxpert.github.io/marmot/) was for replicating bunch of tables across regions that were read heavy. I even used it for cache replication (because who cares if it’s a cache miss, but a hit will save me time and money). It’s hard to make such blue prints in early days of product, and by the time you hit a true growth almost everyone builds a custom solution for multi-region IAM.
Stalwart All-in-One Mail Server (IMAP, JMAP, SMTP)
4 projects | news.ycombinator.com | 18 Jul 2023

Amazing I was just looking for a good mail server to configure for my demo. Which reminds me since you folks have mentioned LiteStream, have you tried Marmot (https://github.com/maxpert/marmot); I recently configured Isso with Marmot to scale it out horizontally (https://maxpert.github.io/marmot/demo). I am super curious what kind of write workload on a sub thousand people organization will have and if Marmot can help scale it horizontally without Foundation DB. I always find the the convenience of SQLite amazing.
Marmot: A distributed SQLite replicator built on top of NATS
1 project | news.ycombinator.com | 5 Jul 2023
LiteFS Cloud: Distributed SQLite with Managed Backups
9 projects | news.ycombinator.com | 5 Jul 2023

Great that you brought it up. I will fill in the perspective of what I am doing for solving this in Marmot (https://github.com/maxpert/marmot). Today Marmot already records changes via installing triggers to record changes of a table, hence all the offline changes (while Marmot is not running) are never lost. Today when Marmot comes up after a long offline (depending upon max_log_size configuration), it realizes that and tries to catch up changes via restoring a snapshot and then applying rest of logs from NATS (JetStream) change logs. I am working on change that will be publishing those change logs to NATS before it restores snapshots, and once it reapplies those changes after restoring snapshot everyone will have your changes + your DB will be up to date. Now in this case one of the things that bothers people is the fact that if two nodes coming up with conflicting rows the last writer wins.
For that I am also exploring on SQLite-Y-CRDT (https://github.com/maxpert/sqlite-y-crdt) which can help me treat each row as document, and then try to merge them. I personally think CRDT gets harder to reason sometimes, and might not be explainable to an entry level developers. Usually when something is hard to reason and explain, I prefer sticking to simplicity. People IMO will be much more comfortable knowing they can't use auto incrementing IDs for particular tables (because two independent nodes can increment counter to same values) vs here is a magical way to merge that will mess up your data.

sqlite-utils

Posts with mentions or reviews of sqlite-utils. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-19.

Ask HN: High quality Python scripts or small libraries to learn from
12 projects | news.ycombinator.com | 19 Apr 2024

https://github.com/simonw/sqlite-utils
So, his code might not be a good place to find best patterns (for ex, I don't think they are fully typed), but his repos are very pragmatic, and his development process is super insightful (well documented PRs for personal repos!). Best part, he blogs about every non-trivial update, so you get all the context!
Why you should probably be using SQLite
8 projects | news.ycombinator.com | 27 Oct 2023

Sounds like your problem is with SQLAlchemy, not with SQLite.
My https://sqlite-utils.datasette.io library might be a better fit for you. It's a much thinner abstraction than SQLAlchemy.
Welcome to Datasette Cloud
6 projects | news.ycombinator.com | 20 Aug 2023

There are a few things you can do here.
SQLite is great at JSON - so I often dump JSON structures in a TEXT column and query them using https://www.sqlite.org/json1.html
I also have plugins for running jq() functions directly in SQL queries - https://datasette.io/plugins/datasette-jq and https://github.com/simonw/sqlite-utils-jq
I've been trying to drive the cost of turning semi-structured data into structured SQL queries down as much as possible with https://sqlite-utils.datasette.io - see this tutorial for more: https://datasette.io/tutorials/clean-data
This is also an area that I'm starting to explore with LLMs. I love the idea that you could take a bunch of messy data, tell Datasette Cloud "I want this imported into a table with this schema"... and it does that.
I have a prototype of this working now, I hope to turn it into an open source plugin (and Datasette Cloud feature) pretty soon. It's using this trick: https://til.simonwillison.net/gpt3/openai-python-functions-d...
SQLite Functions for Working with JSON
10 projects | news.ycombinator.com | 10 Aug 2023

I've baked a ton of different SQLite tricks - including things like full-text indexing support and advanced alter table methods - into my sqlite-utils CLI tool and Python library: https://sqlite-utils.datasette.io
My Datasette project provides tools for exploring, analyzing and publishing SQLite databases, plus ways to expose them via a JSON API: https://datasette.io
I've also written a ton of stuff about SQLite on my two blogs:
- https://simonwillison.net/tags/sqlite/
- https://til.simonwillison.net/sqlite
Show HN: Trogon – An automatic TUI for command line apps
11 projects | news.ycombinator.com | 21 May 2023

This is really fun. I have an experimental branch of my sqlite-utils CLI tool (which has dozens of sub-commands) running with this now and it really did only take 4 lines of code - I'm treating Trogon as an optional dependency because people using my package as a Python library rather than a CLI tool may not want the extra installed components:
https://github.com/simonw/sqlite-utils/commit/ec12b780d5dcd6...
There's an animated GIF demo of the result here: https://github.com/simonw/sqlite-utils/issues/545#issuecomme...
I'm sure I'm being stupid.. Copying data from an API and making a database
2 projects | /r/Database | 19 Jan 2023

My project https://datasette.io/ is ideal for this kind of thing. You can use https://sqlite-utils.datasette.io/ to load JSON data into a SQLite database, then publish it with Datasette.
Just: A Command Runner
27 projects | news.ycombinator.com | 9 Jan 2023

I've been using this for about six months now and I absolutely love it.
Make never stuck for me - I couldn't quite get it to fit inside my head.
Just has the exact set of features I want.
Here's one example of one of my Justfiles: https://github.com/simonw/sqlite-utils/blob/fc221f9b62ed8624... - documented here: https://sqlite-utils.datasette.io/en/stable/contributing.htm...
I also wrote about using Just with Django in this TIL: https://til.simonwillison.net/django/just-with-django
Ask HN: What Do You Use for a Personal Database
4 projects | news.ycombinator.com | 16 Nov 2022

SQLite with the open source toolchain I've been building over the past five years:
https://datasette.io as the interface for running queries against (and visualizing) my data.
https://sqlite-utils.datasette.io/ as a set of tools for creating and modifying my databases (inserting JSON or CSV data, enabling full text search text)
https://dogsheep.github.io as a suite of tools for importing my personal data - see also this talk I gave about that project: https://simonwillison.net/2020/Nov/14/personal-data-warehous...
The Perfect Commit
1 project | /r/programming | 30 Oct 2022

Here's an example: https://github.com/simonw/sqlite-utils/pull/468

3 projects | news.ycombinator.com | 29 Oct 2022

> After identifying about 7 commits (with pretty basic/useless messages, and no PR link!), I then had to find the corresponding PRs based on timestamps, and search the PR history for PRs merged around those timestamps.
Not sure if this would save any time, but it is possible to search PRs by commit. For example, say git blame led me to this commit: https://github.com/simonw/sqlite-utils/commit/129141572f249e...
I could have found PR #373 via this search: https://github.com/simonw/sqlite-utils/pulls?q=bb16f52681b6d...
> I thus treat PRs as ephemeral
I think I see what you're saying but as others have pointed out, sometimes you want to add screenshots etc to the context, and you can't capture this kind of info in commit messages. So then you have two choices: issues or PRs.
> Then any review comments are preferably not addressed directly in the PR
I would think that sometimes you really do want to have a back and forth conversation in the PR, rather than just a "make this change" -> "ok done" type of feedback loop.
I view the PR as an decent place for all of this because it's basically a commit of commits, capturing the related changes/conversation/context all in a single place at the point of merge.

What are some alternatives?

When comparing marmot and sqlite-utils you can also consider the following projects:

pocketbase - Open Source realtime backend in 1 file

sqlmodel - SQL databases in Python, designed for simplicity, compatibility, and robustness.

cr-sqlite - Convergent, Replicated SQLite. Multi-writer and CRDT support for SQLite

sqliteviz - Instant offline SQL-powered data visualisation in your browser

litefs - FUSE-based file system for replicating SQLite databases across a cluster of machines

ImportExcel - PowerShell module to import/export Excel spreadsheets, without Excel

wordpress-playground - Run WordPress in the browser via WebAssembly PHP

octosql - OctoSQL is a query tool that allows you to join, analyse and transform data from multiple databases and file formats using SQL.

mssql-changefeed

q - q - Run SQL directly on delimited files and multi-file sqlite databases

rqlite - The lightweight, distributed relational database built on SQLite.

Scoop - A command-line installer for Windows.

marmot vs pocketbase sqlite-utils vs sqlmodel marmot vs cr-sqlite sqlite-utils vs sqliteviz marmot vs litefs sqlite-utils vs ImportExcel marmot vs wordpress-playground sqlite-utils vs octosql marmot vs mssql-changefeed sqlite-utils vs q marmot vs rqlite sqlite-utils vs Scoop

Compare marmot vs sqlite-utils and see what are their differences.

marmot

sqlite-utils

marmot

sqlite-utils

What are some alternatives?