-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
There might be something already written in Rust for actually splitting the lines with SIMD, but I'm not sure. Then you'll want to actually parse each CSV line. I think if you go far down this road, you'll end up with something like simdjson for CSV. Oh, hey, there's a simdcsv project, in C++, from the same authors. I'm not sure from first glance how far along it is. But it was written (started?) by two very smart people famous for pushing the state of the art with SIMD. You could try calling into it via cxx, or (if you're feeling extremely ambitious) porting to Rust.
I think memchr finds the needle, so i had to then take a slice from the last needle to this needle in a loop. I might have this wrong? When i timed it, it ended up slower on my machine. I looked at the memchr source, and it doesn't support neon simd :( https://github.com/BurntSushi/memchr/issues/76