flash-attention-minimal
Vulkan-Samples
flash-attention-minimal | Vulkan-Samples | |
---|---|---|
2 | 44 | |
464 | 4,011 | |
- | 1.9% | |
5.7 | 9.1 | |
2 months ago | 8 days ago | |
Cuda | C++ | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
flash-attention-minimal
-
Google's First Tensor Processing Unit: Architecture
Vulcan is a driver-level API. It competes with DirectX and OpenGL.
CUDA is a language you write kernels. It competes with OpenAI's Triton language.
Here's what CUDA looks like: https://github.com/tspeterkim/flash-attention-minimal/blob/m...
This is what Triton looks like: https://triton-lang.org/main/getting-started/tutorials/06-fu...
By contrast Vulcan looks like this: https://github.com/KhronosGroup/Vulkan-Samples/blob/main/sam...
(It's true to some extent that maybe you could use Vulcan shaders to write deep learning kernels, maybe? I'm not aware of anyone doing it though)
- Show HN: Flash Attention in ~100 lines of CUDA
Vulkan-Samples
-
Google's First Tensor Processing Unit: Architecture
Vulcan is a driver-level API. It competes with DirectX and OpenGL.
CUDA is a language you write kernels. It competes with OpenAI's Triton language.
Here's what CUDA looks like: https://github.com/tspeterkim/flash-attention-minimal/blob/m...
This is what Triton looks like: https://triton-lang.org/main/getting-started/tutorials/06-fu...
By contrast Vulcan looks like this: https://github.com/KhronosGroup/Vulkan-Samples/blob/main/sam...
(It's true to some extent that maybe you could use Vulcan shaders to write deep learning kernels, maybe? I'm not aware of anyone doing it though)
-
Question regarding descriptor set binding invalidation/disturbing over pipeline boundaries
Hi, i'm looking into descriptor indexing and was looking at the Vulkan-Samples repo. I finally thought i understood how the set binding invalidation works (like a stack?). In the code below both the non_uniform_indexing pipeline and the update_after_bind pipeline use the immutable sampler descriptor in their fragment shader. But since the descriptor set = 0 is rebound, won't the immutable sampler in set = 1 from the "non_uniform_indexing" pipeline be "disturbed" and have to be rebound?
- New Vulkan Sample - VK_EXT_extended_dynamic_state2 Extension
-
Show HN: A tutorial for building a 2D game engine with Go and OpenGL
and see if it works.
[1] https://github.com/KhronosGroup/Vulkan-Samples/blob/main/sam...
- New Vulkan sample - VK_EXT_full_screen_exclusive
-
Mesh Shader Sample
In this new sample, we see how to incorporate the Vulkan extension VK_EXT_mesh_shader and introduce per primitive culling in a mesh shader. https://github.com/KhronosGroup/Vulkan-Samples/tree/main/samples/extensions/mesh_shader_culling
- New Extension Sample: VK_EXT_swapchain_maintenance1
-
Looking for resources / mentor
Also beginner here, I think you can learn other's codes to get a better understanding of the API by learning how others can abstract these concept into higher level. They're many great projects out there like Google's pbr renderer filament https://github.com/google/filament, AMD's gltf sample https://github.com/GPUOpen-LibrariesAndSDKs/glTFSample and also as many suggested, the Sascha Willems's repos https://github.com/SaschaWillems and the official sample https://github.com/KhronosGroup/Vulkan-Samples
-
Recommendations on how to start a small Vulkan project
Another way I saw some projects start was by using example base classes (either Sascha Willems' implementations or the framework used by Khronos Samples).
- Can anyone give valid reasons why game engines are any easier to use than just programming with something like Vulkan?
What are some alternatives?
Vulkan - C++ examples for the Vulkan graphics API
renderdoc - RenderDoc is a stand-alone graphics debugging tool.
vulkano - Safe and rich Rust wrapper around the Vulkan API
bgfx - Cross-platform, graphics API agnostic, "Bring Your Own Engine/Framework" style rendering library.
MoltenVK - MoltenVK is a Vulkan Portability implementation. It layers a subset of the high-performance, industry-standard Vulkan graphics and compute API over Apple's Metal graphics framework, enabling Vulkan applications to run on macOS, iOS and tvOS.
VulkanHelper - A simple helper interface between Vulkan C API and C++ containers
glTF-Sample-Models - glTF Sample Models
vulkan-tutorial-rs - Rust version of https://github.com/Overv/VulkanTutorial
paradigm - C++20 Vulkan and GLes rendering engine
Vulkan-Profiles - Vulkan Profiles Tools
ascii-graphics - 3D graphics in the terminal using ASCII characters
Hazel - Hazel Engine