SaaSHub helps you find the best software and product alternatives Learn more →
RyzenAI-SW Alternatives
Similar projects and alternatives to RyzenAI-SW
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
-
-
-
-
-
-
-
-
-
lemonade
Lemonade helps users discover and run local AI apps by serving optimized LLMs right from their own GPUs and NPUs. Join our discord: https://discord.gg/5xXzkMu8Zk
-
-
-
-
FastFlowLM
Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.
-
-
-
RyzenAI-SW discussion
RyzenAI-SW reviews and mentions
-
Lemonade by AMD: a fast and open source local LLM server using GPU and NPU
AMD/Xilinx's software support for the NPU is fully open, it's only FFLM's models that are proprietary. See https://github.com/amd/iron https://github.com/Xilinx/mlir-aie https://github.com/amd/RyzenAI-SW/ . It would be nice to explore whether one can simply develop kernels for these NPU's using Vulkan Compute and drive them that way; that would provide unification with the existing cross-platform support for GPU's.
-
AMD Inference
You should have ROCm/HIP support on the iGPU as well, be sure to compile llama.cpp w/ the LLAMA_HIP_UMA=1 flag. If you take a look at https://github.com/amd/RyzenAI-SW you can see there's a fair amount of software to play with on the NPU now, but Phoenix is only 16 TOPS, so I've never bothered testing it.
- An Interview with AMD CEO Lisa Su About Solving Hard Problems
-
AMD unveils Ryzen Pro 8000-series processors
In the benchmark you have linked, you clearly see that the performance of the CPU only implementation and the NPU implementation are identical.
https://github.com/amd/RyzenAI-SW/blob/main/example/transfor...
What this should tell you is that "15 TOPs" is an irrelevant number in this benchmark. There are exactly two FLOPs per parameter. Loading the parameters takes more time than processing them.
There are people with less than 8GB of VRAM and they can't load these models into their GPU and end up with the exact same performance as on CPU. The 12tflops of the 3060 Ti 8GB are "no good" for LLMs, because the bottleneck for token generation is memory bandwidth.
My Ryzen 2700 gets 7 tokens per second at 50 GFLOPs. What does this tell you? The NPU can saturate the memory bandwidth of the system.
Now here is the gotcha: Have you tried inputting very large prompts? Because that is where the speedup is going to be extremely noticeable. Instead of waiting minutes on a 2000 token prompt, it will be just as fast as on GPUs, because the initial prompt processing is compute bound.
Also, before calling something subpar, you're going to have to tell me how you are going to put larger models like Goliath 70b or 120b models on your GPU.
-
AMD Unveils Ryzen 8000G Series Processors: Zen 4 APUs for Desktop with Ryzen AI
Unfortunately, Ryzen AI, while neat, remains Windows-exclusive
https://github.com/amd/RyzenAI-SW/issues/2
- AMD announces Ryzen 8045HS, 8040HS and 8040U "Hawk Point" series powered by Zen4, RDNA3 and XDNA - VideoCardz.com
- AMD Wants To Know If You'd Like Ryzen AI Support On Linux - Please upvote here to have a AMD AI Linux driver
- AMD wants to know if you would like Ryzen AI support for Linux
-
AMD Wants to Know If You'd Like Ryzen AI Support on Linux
I mean, it barely even exists or seems to be acknowledged in Windows either. Somehow, they made it into a feature that OEMs can decide to disable , even with compatible CPUs. This issue (and the complete lack of info, save from the input of a very helpful employee) to me shows the state of ryzenai:
https://github.com/amd/RyzenAI-SW/issues/5#issuecomment-1726...
-
A note from our sponsor - SaaSHub
www.saashub.com | 13 Jun 2026
Stats
amd/RyzenAI-SW is an open source project licensed under MIT License which is an OSI approved license.
The primary programming language of RyzenAI-SW is Python.