1brc vs 1brc

1brc

1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java (by gunnarmorling)

Suggest topics

Source Code

morling.dev

Suggest alternative

Edit details

1brc

C99 implementation of the 1 Billion Rows Challenge. 1️⃣🐝🏎️ Runs in ~1.6 seconds on my not-so-fast laptop CPU w/ 16GB RAM. (by dannyvankooten)

1brc

Source Code

Suggest alternative

Edit details

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

1brc		1brc
	Project
28	Mentions	5
5,246	Stars	66
-	Growth	-
9.8	Activity	7.2
23 days ago	Latest Commit	20 days ago
Java	Language	C
Apache License 2.0	License	-

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

1brc

Posts with mentions or reviews of 1brc. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-13.

The One Billion Row Challenge in CUDA: from 17 minutes to 17 seconds
5 projects | news.ycombinator.com | 13 Apr 2024

This would be the code to beat. Ideally with only 8 cores but any number of cores is also very interesting.
https://github.com/gunnarmorling/1brc/discussions/710
One Billion Row Challenge in Golang - From 95s to 1.96s
2 projects | dev.to | 17 Mar 2024

Given that 1-billion-line-file is approximately 13GB, instead of providing a fixed database, the official repository offers a script to generate synthetic data with random readings. Just follow the instructions to create your own database.
1BRC Merykitty's Magic SWAR: 8 Lines of Code Explained in 3k Words
4 projects | news.ycombinator.com | 9 Mar 2024

Local disk I/O is no longer the bottleneck on modern systems: https://benhoyt.com/writings/io-is-no-longer-the-bottleneck/
In addition, the official 1BRC explicitly evaluated results on a RAM disk to avoid I/O speed entirely: https://github.com/gunnarmorling/1brc?tab=readme-ov-file#eva... "Programs are run from a RAM disk (i.o. the IO overhead for loading the file from disk is not relevant)"
Processing One Billion Rows in PHP!
3 projects | dev.to | 8 Mar 2024

You may have heard of the "The One Billion Row Challenge" (1brc) and in case you don't, go checkout Gunnar Morlings's 1brc repo.
The One Billion Row Challenge in Go: from 1m45s to 4s in nine solutions
15 projects | news.ycombinator.com | 2 Mar 2024

Here’s a thread on results with duckdb, I don’t mean to discourage you taking a shot at all though: https://github.com/gunnarmorling/1brc/discussions/39
Ask HN: How can I learn about performance optimization?
6 projects | news.ycombinator.com | 2 Mar 2024

If you are in “javaland” look at billion row challenge, you will learn a lot - https://github.com/gunnarmorling/1brc
Lessons Learned from Doing the One Billion Row Challenge
2 projects | news.ycombinator.com | 26 Feb 2024
1B Row Challenge Shows Java Can Process 1B Rows File in 2 Seconds
7 projects | news.ycombinator.com | 29 Jan 2024
From slow to SIMD: A Go optimization story
10 projects | news.ycombinator.com | 23 Jan 2024

Even manual vectorization is pain...writing ASM, really?
Rust has unstable portable SIMD and a few third-party crates, C++ has that as well, C# has stable portable SIMD and a very small BLAS-like library on top of it (hell it even exercises PackedSIMD when ran in a browser) and Java is getting stable Panama vectors some time in the future (though the question of codegen quality stands open given planned changes to unsafe API).
Go among these is uniquely disadvantaged. And if that's not enough, you may want to visit 1Brc's challenge discussions and see that Go struggles get anywhere close to 2s mark with both C# and C++ are blazing past it:
https://hotforknowledge.com/2024/01/13/1brc-in-dotnet-among-...
https://github.com/gunnarmorling/1brc/discussions/67
JEP Draft: Deprecate Memory-Access Methods in Sun.misc.Unsafe for Removal
3 projects | news.ycombinator.com | 16 Jan 2024

In terms of performance: I realize that this is a somewhat "toy" issue, and it's a sample size of 1, but for the currently ongoing "One Billion Row Challenge"[1] (an ongoing Java performance competition related to parsing and aggregating a 13 GB file), all of the current top-performers are using Unsafe. More specifically, the use of Unsafe appears to have been the change for a few entries that allowed getting below the 3-second barrier in the test.
1. https://github.com/gunnarmorling/1brc

1brc

Posts with mentions or reviews of 1brc. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-13.

The One Billion Row Challenge in CUDA: from 17 minutes to 17 seconds
5 projects | news.ycombinator.com | 13 Apr 2024

There are some good ideas for this type of problem here: https://github.com/dannyvankooten/1brc
After you deal with parsing and hashes, basically you are IO limited so mmap helps. A reasonable guess is that even for the optimal CUDA implementation, because there is no compute to speak of other than a hashmap, the starting of kernels and transfer of data to the GPU would likely add a noticeable bottleneck and make the optimal CUDA code slower than this pure C code.
The One Billion Row Challenge in Go: from 1m45s to 4s in nine solutions
15 projects | news.ycombinator.com | 2 Mar 2024

c dominates every other language again...https://github.com/dannyvankooten/1brc#submitting
The One Billion Row Challenge
10 projects | news.ycombinator.com | 3 Jan 2024

You can run the bin/create-sample program from this C implementation here: https://github.com/dannyvankooten/1brc
It’s just the city names + averages from the official repository using a normal distribution to generate 1B random rows.

What are some alternatives?

When comparing 1brc and 1brc you can also consider the following projects:

yolov7-object-tracking - YOLOv7 Object Tracking Using PyTorch, OpenCV and Sort Tracking

nodejs - 1️⃣🐝🏎️ The One Billion Row Challenge with Node.js -- A fun exploration of how quickly 1B rows from a text file can be aggregated with different languages.

csvlens - Command line csv viewer

JDK - JDK main-line development https://openjdk.org/projects/jdk

1brc - 1BRC in .NET among fastest on Linux

pocketbase - Open Source realtime backend in 1 file

Apache Arrow - Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing

java - Java bindings for TensorFlow

MindsDB - The platform for customizing AI from enterprise data

highway - Performance-portable, length-agnostic SIMD with runtime dispatch

Tribuo - Tribuo - A Java machine learning library

1brc vs yolov7-object-tracking 1brc vs nodejs 1brc vs csvlens 1brc vs JDK 1brc vs nodejs 1brc vs 1brc 1brc vs pocketbase 1brc vs Apache Arrow 1brc vs java 1brc vs MindsDB 1brc vs highway 1brc vs Tribuo

Compare 1brc vs 1brc and see what are their differences.

1brc

1brc

1brc

1brc

What are some alternatives?