Our great sponsors
-
decuda
Discontinued Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
If you are worried about Intel vs Arm vs whatever, use https://github.com/VcDevel/std-simd
This is false as well. It's the hardware taking care of the serialization, and that's the fundamental difference. Like I said, the machine instructions are scalar. People actually reverse engineered things enough to confirm that (https://github.com/laanwj/decuda). The blob you DMA into the GPU looks no different than a single-threaded program and would work as one if you had a scalar processor with the same instruction set.