1brc
1brc
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
1brc
-
The One Billion Row Challenge in CUDA: from 17 minutes to 17 seconds
There are some good ideas for this type of problem here: https://github.com/dannyvankooten/1brc
After you deal with parsing and hashes, basically you are IO limited so mmap helps. A reasonable guess is that even for the optimal CUDA implementation, because there is no compute to speak of other than a hashmap, the starting of kernels and transfer of data to the GPU would likely add a noticeable bottleneck and make the optimal CUDA code slower than this pure C code.
-
The One Billion Row Challenge in Go: from 1m45s to 4s in nine solutions
c dominates every other language again...https://github.com/dannyvankooten/1brc#submitting
-
The One Billion Row Challenge
You can run the bin/create-sample program from this C implementation here: https://github.com/dannyvankooten/1brc
Itβs just the city names + averages from the official repository using a normal distribution to generate 1B random rows.
1brc
-
Arena-Based Parsers
It may seem unexpected given all the hype around Go, but it is a surprisingly poor choice for this. If you want a more convenient language than C++ or Rust but retain the ability to reach optimal performance, C# could serve you much better.
Go underperforms at trivial XML parsing: https://news.ycombinator.com/item?id=40283721
If you push it, C# can extract optimal HW utilization when parsing text, beating C++: https://hotforknowledge.com/2024/01/13/1brc-in-dotnet-among-... (Go was not on the list because it was that much slower)
-
The One Billion Row Challenge in Go: from 1m45s to 4s in nine solutions
The more accurate statement would be is Go incapable of optimizations performed by Java and then Java is incapable of optimizations performed by C# and C++ implementations.
See https://hotforknowledge.com/2024/01/13/1brc-in-dotnet-among-...
- The One Billion Row Challenge β .NET Edition
- The One Billion Row Challenge
What are some alternatives?
1brc - 1οΈβ£πποΈ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
plb2 - A programming language benchmark
nodejs - 1οΈβ£πποΈ The One Billion Row Challenge with Node.js -- A fun exploration of how quickly 1B rows from a text file can be aggregated with different languages.
JDK - JDK main-line development https://openjdk.org/projects/jdk