VkFFT
ocl
Our great sponsors
VkFFT | ocl | |
---|---|---|
37 | 7 | |
1,441 | 694 | |
- | 3.0% | |
8.1 | 5.9 | |
about 1 month ago | 18 days ago | |
C++ | Rust | |
MIT License | GNU General Public License v3.0 or later |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
VkFFT
-
[P] - VkFFT now supports quad precision (double-double) FFT computation on GPU
Hello, I am the creator of the VkFFT - GPU Fast Fourier Transform library for Vulkan/CUDA/HIP/OpenCL/Level Zero and Metal. In the latest update, I have added support for quad-precision double-double emulation for FFT calculation on most modern GPUs. I understand that modern ML is going in the opposite low-precision direction, but I still think that it may be useful to have this functionality at least for some prototyping and development of concepts.
- VkFFT now supports quad precision (double-double) FFT computation on GPU
-
VkFFT: Vulkan/CUDA/Hip/OpenCL/Level Zero/Metal Fast Fourier Transform Library
Not quite what I asked for, but close enough for now...
https://github.com/DTolm/VkFFT/discussions/126
-
Implementing complex numbers (and FFT) elegantly with just algebraic datatypes (no machine floats)
Source - I have made a somewhat functional programming-like FFT library (https://github.com/DTolm/VkFFT/tree/develop) which also operates on abstract data containers. Maybe it can be interesting to you from the algorithmic point of view.
-
how does Vulkan compare to CUDA?
VkFFT is a use-case I've heard of where Vulkan-Compute is faster than its Cuda and OpenCL counter-part: https://github.com/DTolm/VkFFT
-
VkFFT now supports Apple Metal API - M1 Pro GPU FFT benchmarking
Hello, I am the creator of the VkFFT - GPU Fast Fourier Transform library for Vulkan/CUDA/HIP/OpenCL and Level Zero. In the latest update, I have added support for Apple Metal API, which will allow VkFFT to run natively on modern Apple SoC. I have tested it on MacBook Pro with an M1 Pro 8c CPU/14c GPU SoC single precision on 1D batched FFT test of all systems from 2 to 4096. Achieved bandwidth is calculated as 2*system size divided by the time taken per FFT - minimum memory that has to be transferred between DRAM and GPU:
-
Any good compute shader tutorials?
Another possible project to look at is https://github.com/DTolm/VkFFT
- VkFFT now supports Rader's algorithm - A100 and MI250 benchmarks: Part 2
- VkFFT now supports Rader's algorithm - A100 and MI250 benchmarks
ocl
-
An example for OpenCL 3.0?
Please note that OpenCL consists of two parts: host API and a separate language which is used to write kernels (code which is going to be offloaded to devices). OpenCL specification describes host APIs as C-style APIs and that is what implementors has to provide. However, there are number of various libraries which provides bindings for other languages: - C++ - Python - Go - Rust
- Any OpenCL + Rust Guides
-
Non graphical computing on GPU
ocl
-
Resources for Vulkan GPGPU searched
I don't know a lot about Rust, but this looks like a valid set of OpenCL bindings for Rust: https://github.com/cogciprocate/ocl
-
What's the current state of GPU compute in rust?
If you prefer an open alternative to CUDA, there are complete, easy to use und well documented bindings for opencl: https://github.com/cogciprocate/ocl/
-
Trying to install something using rust and really stuck, any help at all appreciated.
- https://github.com/cogciprocate/ocl/issues/202
-
Making an algorithmic trading bot in Rust?
I use Rust with OpenCL (ocl). And I am still in college studying CS. It takes a while to setup OpenCL depending on what you want to do with it. But performance benefits are well worth it. On average I can backtest 4 years of data with 1 minute candles in about 8.745 ms for typical RSI indicator. This is done on i5-3320m CPU (not iGPU). Took me a year to build it. Was also learning rust with it. My project has many features so you probably can do it in half amount of time or even less. Currently the project has 21k in Rust and 2k lines in OpenCL.
What are some alternatives?
wgpu - Cross-platform, safe, pure-rust graphics api.
nvfancontrol - NVidia dynamic fan control for Linux and Windows
kompute - General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.
rust-gpu - 🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧
GLSL - GLSL Shading Language Issue Tracker
cuda-samples - Samples for CUDA Developers which demonstrates features in CUDA Toolkit
vuh - Vulkan compute for people
rocFFT - Next generation FFT implementation for ROCm
autograph - Machine Learning Library for Rust
xNVMe - Portable and high-performance libraries and tools for NVMe devices as well as support for traditional/legacy storage devices/interfaces.
RustaCUDA - Rusty wrapper for the CUDA Driver API