ncnn
rocm-build
Our great sponsors
ncnn | rocm-build | |
---|---|---|
12 | 7 | |
19,176 | 167 | |
1.8% | - | |
9.4 | 0.0 | |
7 days ago | 4 months ago | |
C++ | C++ | |
GNU General Public License v3.0 or later | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ncnn
-
AMD Funded a Drop-In CUDA Implementation Built on ROCm: It's Open-Source
ncnn uses Vulkan for GPU acceleration, I've seen it used in a few projects to get AMD hardware support.
https://github.com/Tencent/ncnn
-
[D] Best way to package Pytorch models as a standalone application
They're using NCNN to package the model. Have a look. https://github.com/Tencent/NCNN
-
Realtime object detection android app
Hi. Here is my prefered android app for realtime objet detection: https://github.com/nihui/ncnn-android-nanodet ; https://github.com/Tencent/ncnn contains a lot of android demo app for a lot of models.
- ncnn: High-performance neural network inference framework optimized for mobile
-
Esp32 tensorflow lite
ncnn home page: https://github.com/Tencent/ncnn
-
MMDeploy: Deploy All the Algorithms of OpenMMLab
ncnn
-
Draw Things, Stable Diffusion in your pocket, 100% offline and free
Yes, Android devices tend to have bigger RAMs, making running 1024x1024 possible (this is not possible at all on iPhones, which could peak around 5GiB memory with my current implementation, some serious engineering required to bring that down on iPhone devices). The problem is I am not sure about speed. I would likely switch to NCNN (https://github.com/Tencent/ncnn) as the backend which have a decent Vulkan computing kernel support. It is definitely a possibility and there is a path to do that.
- What’s New in TensorFlow 2.10?
-
[Technical Article] OCR Upgrade
As the leading open-source inference framework in China and in the world, what we like are its almost zero cost cross-platform capability, high inference speed, and minimal deployment volume. (Project address: https://github.com/Tencent/ncnn)
-
Is there a functioning neural netowork or backbone written in pure C language only?
If you’re not planning on training the neural net on an embedded device and just do inference, this might interest you: https://github.com/Tencent/ncnn
rocm-build
- AMD's Hidden $100 Stable Diffusion Beast!
-
AMD GPU driver not installed correctly
Scripts to help with building rocm and hip. It will also help work out dependencies. You will need to modify the scripts for them to work and not all are required. https://github.com/xuhuisheng/rocm-build
-
Stable Diffusion on AMD RDNA3
Short answer no. Long answer "in theory" yes. I tried this [1] but gave up as building rocm + deps takes up to 6h :/ Official statement [2]
[1] https://github.com/xuhuisheng/rocm-build
-
Show HN: InvokeAI, an open source Stable Diffusion toolkit and WebUI
I am in the same boat with a gfx03 card. What patch did you use? The ones here? https://github.com/xuhuisheng/rocm-build
I also tried to compile pytorch with its Vulkan backend, but ended throwing the towel as LDFLAGS are a mess to get right (I successfully compiled it, but that was only part of the build chain, and decided I had better things to spend time on). I wonder how that would perform; ncnn works pretty decently.
-
How do I run Stable Diffusion and sharing FAQs
Unofficial black magic is available: https://github.com/xuhuisheng/rocm-build/tree/master/navi10 (pytorch 1.12.0 is outdated but can run SD)
- Deep Learning options on Radeon RX 6800
-
Which version of ROCm and Tensorflow should I use?
also have an RX570, currently running latest Tensorflow and ROCm 4.1. had to recompile some parts of ROCm 4.1 libraries to get tensorflow to work. mostly followed this guide: https://github.com/xuhuisheng/rocm-build/tree/master/gfx803
What are some alternatives?
XNNPACK - High-efficiency floating-point neural network inference operators for mobile, server, and Web
stable-diffusion-webui - Stable Diffusion web UI [Moved to: https://github.com/Sygil-Dev/sygil-webui]
rife-ncnn-vulkan - RIFE, Real-Time Intermediate Flow Estimation for Video Frame Interpolation implemented with ncnn library
stable-diffusion-webui - Stable Diffusion web UI
deepdetect - Deep Learning API and Server in C++14 support for Caffe, PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE
tensorflow-upstream - TensorFlow ROCm port
netron - Visualizer for neural network, deep learning and machine learning models
stable-diffusion - Optimized Stable Diffusion modified to run on lower GPU VRAM
darknet - Convolutional Neural Networks
stable-diffusion-webui - Stable Diffusion web UI [Moved to: https://github.com/sd-webui/stable-diffusion-webui]
RPi_64-bit_Zero-2-image - Raspberry Pi Zero 2 W 64-bit OS image with OpenCV, TensorFlow Lite and ncnn Framework.
stable-diffusion - A latent text-to-image diffusion model