go-mysql-server vs datafusion

go-mysql-server

A MySQL-compatible relational database with a storage agnostic query engine. Implemented in pure Go. (by dolthub)

Source Code

Suggest alternative

Edit details

datafusion

Apache DataFusion SQL Query Engine (by apache)

Arrow Big Data Dataframe datafusion Olap Python query-engine Rust SQL

Source Code

datafusion.apache.org

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

go-mysql-server		datafusion
	Project
23	Mentions	55
2,202	Stars	5,145
37.7%	Growth	5.2%
9.9	Activity	9.9
about 11 hours ago	Latest Commit	about 7 hours ago
Go	Language	Rust
Apache License 2.0	License	Apache License 2.0

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

go-mysql-server

Posts with mentions or reviews of go-mysql-server. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-09.

A MySQL compatible database engine written in pure Go
10 projects | news.ycombinator.com | 9 Apr 2024
What I Talk About When I Talk About Query Optimizer (Part 1): IR Design
7 projects | news.ycombinator.com | 29 Jan 2024

We implemented a query optimizer with a flexible intermediate representation in pure Go:
https://github.com/dolthub/go-mysql-server
Getting the IR correct so that it's both easy to use and flexible enough to be useful is a really interesting design challenge. Our primary abstraction in the query plan is called a Node, and is way more general than the IR type described in the article from OP. This has probably hurt us: we only recently separated the responsibility to fetch rows into its own part of the runtime, out of the IR -- originally row fetching was coupled to the Node type directly.
This is also the query engine that Dolt uses:
https://github.com/dolthub/dolt
But it has a plug-in architecture, so you can use the engine on any data source that implements a handful of Go interface.
I created an in-memory SQL database called MemSQL as a learning project
3 projects | /r/golang | 30 Mar 2023

Might be interested in https://github.com/dolthub/go-mysql-server, which also does this
Implementing the MySQL server protocol for fun and profit
2 projects | news.ycombinator.com | 22 Dec 2022

https://github.com/dolthub/go-mysql-server
One item under "Scope of this project":
Provide a runnable server speaking the MySQL wire protocol, connected to data sources of your choice.
MySQL-mimic - Python implementation of the MySQL server wire protocol.
4 projects | /r/Python | 31 Oct 2022
Parsing SQL
7 projects | news.ycombinator.com | 22 Aug 2022
Litetree – SQLite with Branches
3 projects | news.ycombinator.com | 22 Jul 2022

I just wanted to say thanks for https://github.com/dolthub/go-mysql-server
This is incredibly useful for anyone who wants to build their own DB or wrap another datasource so it's queryable via MySQL protocol.
Dolt Is Git for Data
5 projects | news.ycombinator.com | 23 Jun 2022

a very cool project they also maintain is a MySQL server framework for arbitrary backends (in Go): https://github.com/dolthub/go-mysql-server
You can define a "virtual" table (schema, how to retrieve rows/columns) and then a MySQL client can connect and execute arbitrary queries on your table (which could just be an API or other source)
A Golang library and interface that allows querying anything with SQL
1 project | news.ycombinator.com | 14 Feb 2022
The world of PostgreSQL wire compatibility
3 projects | news.ycombinator.com | 10 Feb 2022

Thanks for this write up! I've been really interested in postgres compatibility in the context of a tool I maintain (https://github.com/mergestat/mergestat) that uses SQLite. I've been looking for a way to expose the SQLite capabilities over a more commonly used wire-protocol like postgres (or mysql) so that existing BI and visualization tools can access the data.
This project is an interesting one: https://github.com/dolthub/go-mysql-server that provides a MySQL interface (wire and SQL) to arbitrary "backends" implemented in go.
It's really interesting how compatibility with existing protocols has become an important feature of new databases - there's so much existing tooling that already speaks postgres (or mysql), being able to leverage that is a huge advantage IMO

datafusion

Posts with mentions or reviews of datafusion. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-25.

Velox: Meta's Unified Execution Engine [pdf]
2 projects | news.ycombinator.com | 25 Mar 2024

Python's Substrait seems like the biggest/most-used competitor-ish out there. I'd love some compare & contrast; my sense is that Substrait has a smaller ambition, and more wants to be a language for talking about execution rather than a full on execution engine. https://github.com/substrait-io/substrait
We can also see from the DataFusion discussion that they too see themselves as a bit of a Velox competitor. https://github.com/apache/arrow-datafusion/discussions/6441
What I Talk About When I Talk About Query Optimizer (Part 1): IR Design
7 projects | news.ycombinator.com | 29 Jan 2024

Agree, substrait is a really cool project! Related: if you like substrait you might want to check out datafusion too. The project is a query execution engine built on top of Apache Arrow (with SQL parser, query planner & optimizer, execution engine, extensible user defined functions, among others) and it implements a substrait provider and consumer: https://github.com/apache/arrow-datafusion/tree/main/datafus...
DuckDB performance improvements with the latest release
8 projects | news.ycombinator.com | 6 Nov 2023

The draft contains some preliminary benchmark results, comparing it to DuckDB.
https://github.com/apache/arrow-datafusion/issues/6782
Apache Arrow DataFusion
1 project | news.ycombinator.com | 1 Oct 2023
GlareDB: An open source SQL database to query and analyze distributed data
4 projects | /r/dataengineering | 8 Jun 2023

Apache Arrow is a pretty common memory structure these days. Datafusion is an open query engine built in Rust started by Andy Grove.
DuckDB 0.8.0
5 projects | news.ycombinator.com | 17 May 2023

DuckDB is a great piece of software if you are
If you are looking for a query engine implemented in a safe language (Rust) I definitely suggest checking out DataFusion. It is comparable to DuckDB in performance, has all the standard built in SQL functionality, and is extensible in pretty much all areas (query language, data formats, catalogs, user defined functions, etc)
https://arrow.apache.org/datafusion/
Disclaimer I am a maintainer of DataFusion
Data Engineering with Rust
5 projects | /r/rust | 9 May 2023

https://github.com/jorgecarleitao/arrow2 https://github.com/apache/arrow-datafusion https://github.com/apache/arrow-ballista https://github.com/pola-rs/polars https://github.com/duckdb/duckdb
Polars: Computing a new column from multiple columns - there must be a better way
1 project | /r/rust | 4 May 2023
Bridging Async and Sync Rust Code - A lesson learned while working with Tokio
3 projects | /r/rust | 10 Mar 2023

Problem comes when you want to do this inside an async context since we couldn't block an async task. https://users.rust-lang.org/t/sync-function-invoking-async/43364/6 You might need to do it in another runtime/thread. It is not recommended to do this, but sometimes it is unavoidable while implementing a third-party trait. https://github.com/apache/arrow-datafusion/issues/3777 However, I believe this isn't a problem particular to tokio, or any specific runtime.
Using Rust to write a Data Pipeline. Thoughts. Musings.
5 projects | /r/rust | 14 Jan 2023

What are some alternatives?

When comparing go-mysql-server and datafusion you can also consider the following projects:

vitess-sqlparser - simply SQL Parser for Go ( powered by vitess and TiDB )

polars - Dataframes powered by a multithreaded, vectorized query engine, written in Rust

alasql - AlaSQL.js - JavaScript SQL database for browser and Node.js. Handles both traditional relational tables and nested JSON data (NoSQL). Export, store, and import data from localStorage, IndexedDB, or Excel.

ClickHouse - ClickHouse® is a free analytics DBMS for big data

sqlite-parser - JavaScript implentation of SQLite 3 query parser

databend - 𝗗𝗮𝘁𝗮, 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 & 𝗔𝗜. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com

grammars-v4 - Grammars written for ANTLR v4; expectation that the grammars are free of actions.

db-benchmark - reproducible benchmark of database-like ops

zetasql - ZetaSQL - Analyzer Framework for SQL

duckdb - DuckDB is an in-process SQL OLAP Database Management System

lakeFS - lakeFS - Data version control for your data lake | Git for data

nushell - A new type of shell

go-mysql-server vs vitess-sqlparser datafusion vs polars go-mysql-server vs alasql datafusion vs ClickHouse go-mysql-server vs sqlite-parser datafusion vs databend go-mysql-server vs grammars-v4 datafusion vs db-benchmark go-mysql-server vs zetasql datafusion vs duckdb go-mysql-server vs lakeFS datafusion vs nushell

Compare go-mysql-server vs datafusion and see what are their differences.

go-mysql-server

datafusion

go-mysql-server

datafusion

What are some alternatives?