VkFFT
GLSL
Our great sponsors
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
VkFFT
-
[P] - VkFFT now supports quad precision (double-double) FFT computation on GPU
Hello, I am the creator of the VkFFT - GPU Fast Fourier Transform library for Vulkan/CUDA/HIP/OpenCL/Level Zero and Metal. In the latest update, I have added support for quad-precision double-double emulation for FFT calculation on most modern GPUs. I understand that modern ML is going in the opposite low-precision direction, but I still think that it may be useful to have this functionality at least for some prototyping and development of concepts.
- VkFFT now supports quad precision (double-double) FFT computation on GPU
-
VkFFT: Vulkan/CUDA/Hip/OpenCL/Level Zero/Metal Fast Fourier Transform Library
Not quite what I asked for, but close enough for now...
https://github.com/DTolm/VkFFT/discussions/126
-
Implementing complex numbers (and FFT) elegantly with just algebraic datatypes (no machine floats)
Source - I have made a somewhat functional programming-like FFT library (https://github.com/DTolm/VkFFT/tree/develop) which also operates on abstract data containers. Maybe it can be interesting to you from the algorithmic point of view.
-
how does Vulkan compare to CUDA?
VkFFT is a use-case I've heard of where Vulkan-Compute is faster than its Cuda and OpenCL counter-part: https://github.com/DTolm/VkFFT
-
VkFFT now supports Apple Metal API - M1 Pro GPU FFT benchmarking
Hello, I am the creator of the VkFFT - GPU Fast Fourier Transform library for Vulkan/CUDA/HIP/OpenCL and Level Zero. In the latest update, I have added support for Apple Metal API, which will allow VkFFT to run natively on modern Apple SoC. I have tested it on MacBook Pro with an M1 Pro 8c CPU/14c GPU SoC single precision on 1D batched FFT test of all systems from 2 to 4096. Achieved bandwidth is calculated as 2*system size divided by the time taken per FFT - minimum memory that has to be transferred between DRAM and GPU:
-
Any good compute shader tutorials?
Another possible project to look at is https://github.com/DTolm/VkFFT
- VkFFT now supports Rader's algorithm - A100 and MI250 benchmarks: Part 2
- VkFFT now supports Rader's algorithm - A100 and MI250 benchmarks
GLSL
-
Is there a good resource for mapping GLSL extensions to Vulkan extensions/features
You're looking for this repo
-
GLSL shaders resources
You have access to physical (u64) global memory pointers through buffer_reference in GLSL, note you should be using buffer_reference2
-
New Blog: Introducing Ray Tracing Position Fetch Extension
On April 27, 2023 the Vulkan® Ray Tracing TSG released the VK_KHR_ray_tracing_position_fetch extension, which exposes the ability to fetch vertex positions from an acceleration structure hit when tracing rays. The SPIR-V SPV_KHR_ray_tracing_position_fetch and GLSL GL_EXT_ray_tracing_position_fetch extensions have also been released to provide SPIR-V and GLSL support for this functionality.
- Where can I find and what is with there being practically no documentation for GL_EXT_spirv_intrinsics and spirv_by_reference?
-
Thoughts?
You can also have general extensions that are only supported on some hardware, for instance NVIDIA supports all of these extensions on OpenGL that add support for Vulkan's thread subgroup operations while AMD only supports these extensions and a few others that add support for similar features to the extension that NVIDIA supports, but with way more restrictions.
-
You Can Use Vulkan Without Pipelines Today - Khronos Blog
GL_NV_ray_tracing is a vendor extension, but it does exist.
-
Porting from DXR/HLSL to Vulkan Ray Tracing Extension/GLSL
According to this document, one should be able to extract those values through
-
Learning Modern 3D Graphics Programming
The reports of my death are greatly exaggerated
https://github.com/KhronosGroup/GLSL/blob/master/extensions/...
-
Is there any way to index an array of bindless textures based on gl_InstanceID?
I don't think there's away. For Vulkan, there's GL_EXT_nonuniform_qualifier which allows you to index with gl_InstanceIndex but AFAIK it's not available for OpenGL.
-
Drawing multiple separate textures in one draw call
AMD hardware works a little differently, and for that you have to write a waterfall loop with subgroup instructions to get the correct behavior (since GL_EXT_nonuniform_qualifier doesn't exist in OpenGL GLSL). The loop might look like this (you would use it the same way as NonUniformEXT in Vulkan GLSL).
What are some alternatives?
wgpu - Cross-platform, safe, pure-rust graphics api.
clspv - Clspv is a compiler for OpenCL C to Vulkan compute shaders
kompute - General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.
Rust-CUDA - Ecosystem of libraries and tools for writing and executing fast GPU code fully in Rust.
rust-gpu - 🐉 Making Rust a first-class language and ecosystem for GPU shaders 🚧
vuh - Vulkan compute for people
cuda-samples - Samples for CUDA Developers which demonstrates features in CUDA Toolkit
ocl - OpenCL for Rust
rocFFT - Next generation FFT implementation for ROCm
build_all - GO HERE FIRST: nvpro-samples overview
xNVMe - Portable and high-performance libraries and tools for NVMe devices as well as support for traditional/legacy storage devices/interfaces.
alpaka - Abstraction Library for Parallel Kernel Acceleration :llama: