Building the Perfect Memory Bandwidth Beast

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

rotate

4 143 10.0 C

A collection of array rotation algorithms.

Memory bandwidth is 1000x lower than CPU bandwidth, so as a rule of thumb any algorithm whose work scales linearly in the amount of data being processed will be memory bandwidth bound, and also any algorithm which can't be structured to do a lot of work on one memory region at once before moving onto the next one.
Examples (for large enough inputs that it's relevant) include shuffling, sorting, kmeans clustering, branch and bound sudoku solving, vector addition, dot products, and so on.
Moreover, writing a particular piece of code is often easier if you ignore memory bandwidth as a constraint. The classic example is matrix multiplication -- it can be structured such that even disk bandwidth isn't relevant compared to CPU bandwidth, but doing so is a little fiddly compared to the naive n^2 dot products approach, so writing it yourself usually results in a memory bandwidth bound solution for large matrices.
Similarly, writing two passes over your data rather than doing a mega-loop, the choice to use classic kmeans rather than one of its approximations (when it would be appropriate to do so), or not enforcing sortedness at some reasonable boundary and having to do additional passes over your data. It's easy to write code that hoovers up way more bandwidth than it needs to, and often faster algorithms that come out don't do anything different than access the right data at the right time to reduce that pressure, like a trinity rotation [0].
Caveat: Benchmark everything, especially as you're building intuition. Trying to fix what you think is a memory bandwidth issue can result in pipeline stalls and all sorts of fun things, especially when your server has more faster caches than your dev machine, when data in prod doesn't match your micro benchmark, ....
[0] https://github.com/scandum/rotate

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Is there a more efficient way to write this C program?

1 project | /r/C_Programming | 2 Mar 2023
10 000 Lower Kurast runs - my results!

1 project | /r/diablo2 | 16 Jun 2023
2000 Trav Runs

1 project | /r/diablo2 | 27 Mar 2023
Is there some handy run-counter for D2R?

1 project | /r/diablo2 | 20 Jan 2023
Making copies of online characters in single player PD2 + Plugy

1 project | /r/ProjectDiablo2 | 19 Jan 2023

Building the Perfect Memory Bandwidth Beast

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
trinity reversal gries-mills Grail juggling
Post date: 26 Jan 2023

rotate

InfluxDB

Related posts

Is there a more efficient way to write this C program?

10 000 Lower Kurast runs - my results!

2000 Trav Runs

Is there some handy run-counter for D2R?

Making copies of online characters in single player PD2 + Plugy

Building the Perfect Memory Bandwidth Beast

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com trinity reversal gries-mills Grail juggling Post date: 26 Jan 2023

rotate

InfluxDB

Related posts

Is there a more efficient way to write this C program?

10 000 Lower Kurast runs - my results!

2000 Trav Runs

Is there some handy run-counter for D2R?

Making copies of online characters in single player PD2 + Plugy

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
trinity reversal gries-mills Grail juggling
Post date: 26 Jan 2023