-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
yEnc encoding/decoding (used in Usenet binary transfers): performance is upwards of 6GB/s on one core of a modern CPU. Even if we assume a GPU has infinite performance, just the transfer to+from over a PCI-e link likely isn't much faster. yEnc also operates on bytes (which GPUs suck at relative to CPUs) and involves a fair bit of conditional data shuffling
PAR2 creation (file-based error correction): the core routine here is GF(2^16) matrix multiplication. AVX512 has the GF2P8AFFINEQB instruction, which isn't present on any GPU that I know of, but massively accelerates GF multiplication. Without this instruction, you have to resort to emulating it via bitwise operations or lookup tables, which are much slower. I actually have an OpenCL implementation, but its performance is rather meh compared to CPU