ROCm VS KernelAbstractions.jl

Compare ROCm vs KernelAbstractions.jl and see what are their differences.

ROCm

AMD ROCmâ„¢ Software - GitHub Home [Moved to: https://github.com/ROCm/ROCm] (by RadeonOpenCompute)
Our great sponsors
  • WorkOS - The modern identity platform for B2B SaaS
  • InfluxDB - Power Real-Time Data Analytics at Scale
  • SaaSHub - Software Alternatives and Reviews
ROCm KernelAbstractions.jl
198 4
3,637 330
- 2.7%
0.0 8.0
4 months ago 5 days ago
Python Julia
MIT License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

ROCm

Posts with mentions or reviews of ROCm. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-10-06.

KernelAbstractions.jl

Posts with mentions or reviews of KernelAbstractions.jl. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-04-12.

What are some alternatives?

When comparing ROCm and KernelAbstractions.jl you can also consider the following projects:

tensorflow-directml - Fork of TensorFlow accelerated by DirectML

Pytorch - Tensors and Dynamic neural networks in Python with strong GPU acceleration

rocm-arch - A collection of Arch Linux PKGBUILDS for the ROCm platform

oneAPI.jl - Julia support for the oneAPI programming toolkit.

SHARK - SHARK - High Performance Machine Learning Distribution

plaidml - PlaidML is a framework for making deep learning work everywhere.

llama.cpp - LLM inference in C/C++

exllama - A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

tensorflow-upstream - TensorFlow ROCm port

ROCm-OpenCL-Runtime - ROCm OpenOpenCL Runtime

AdaptiveCpp - Implementation of SYCL and C++ standard parallelism for CPUs and GPUs from all vendors: The independent, community-driven compiler for C++-based heterogeneous programming models. Lets applications adapt themselves to all the hardware in the system - even at runtime!

kompute - General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.