Faster_SGEMM_CUDA

FP32 matrix multiplication of large square matrices in some cases faster than cuBLAS. (by arekpaterek)

Faster_SGEMM_CUDA Alternatives

Similar projects and alternatives to Faster_SGEMM_CUDA

  1. maxas

    Discontinued Assembler for NVIDIA Maxwell architecture

  2. InfluxDB

    InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better Faster_SGEMM_CUDA alternative or higher similarity.

Faster_SGEMM_CUDA discussion

Log in or Post with

Faster_SGEMM_CUDA reviews and mentions

Posts with mentions or reviews of Faster_SGEMM_CUDA. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-07-26.

Stats

Basic Faster_SGEMM_CUDA repo stats
4
0
3.3
11 months ago

arekpaterek/Faster_SGEMM_CUDA is an open source project licensed under MIT License which is an OSI approved license.

The primary programming language of Faster_SGEMM_CUDA is Cuda.

Popular Comparisons


Sponsored
InfluxDB – Built for High-Performance Time Series Workloads
InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
www.influxdata.com

Did you know that Cuda is
the 55th most popular programming language
based on number of references?