clspv vs Cgml

clspv

Clspv is a compiler for OpenCL C to Vulkan compute shaders (by google)

Source Code

Suggest alternative

Edit details

Cgml

GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation. (by Const-me)

Suggest topics

Source Code

Suggest alternative

Edit details

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

clspv		Cgml
	Project
8	Mentions	21
575	Stars	37
2.4%	Growth	-
8.9	Activity	8.6
4 days ago	Latest Commit	3 months ago
LLVM	Language	C++
Apache License 2.0	License	GNU Lesser General Public License v3.0 only

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

clspv

Posts with mentions or reviews of clspv. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-09.

Vcc – The Vulkan Clang Compiler
9 projects | news.ycombinator.com | 9 Jan 2024

See https://github.com/google/clspv for an OpenCL implementation on Vulkan Compute. There are plenty of quirks involved because the two standards use different varieties of SPIR-V ("kernels" vs. "shaders") and provide different guarantees (Vulkan Compute doesn't care much about numerical accuracy). The Mesa folks are also looking into this as part of their RustiCL (a modern OpenCL implementation) and Zink (implementing OpenGL and perhaps OpenCL itself on Vulkan) projects.
AMD's CDNA 3 Compute Architecture
7 projects | news.ycombinator.com | 17 Dec 2023

Vulkan Compute backends for numerical compute (as typified by both OpenCL and SYCL) are challenging, you can look at Google's cspv https://github.com/google/clspv project for the nitty gritty details. The lowest-effort path is actually via some combination of Rocm (for hardware that AMD bothers to support themselves) and the Mesa project's Rusticl backend (for everything else).
WSL with CUDA Support
2 projects | news.ycombinator.com | 22 Jan 2022

D3D12 has more compute features than Vulkan has. It works out for DXVK because games often don’t use those, but it’ll cause much more issues with CLon12.
By the way, if you are ready to have a _limited_ implementation without a full feature set because of Vulkan API limitations, clvk is a thing. The list of limitations of that approach is at https://github.com/google/clspv/blob/master/docs/OpenCLCOnVu...
tldr: Vulkan and OpenCL SPIR-V dialects are different, and the former has significant limitations affecting this use case
Resources for Vulkan GPGPU searched
6 projects | /r/vulkan | 12 Jan 2022
Low overhead C++ interface for Apple's Metal API
8 projects | news.ycombinator.com | 22 Nov 2021

For OpenCL on DX12, the test suite doesn't pass yet. Every Khronos OpenCL 1.2 CTS test passes on at least one hardware driver, but there's none that pass them all. That is why CLon12 isn't submitted to Khronos's compliant products list yet.
The pointer semantics that Vulkan has aren't very amenable to implementing a compliant OpenCL implementation on top of. There are also some other limitatons: https://github.com/google/clspv/blob/master/docs/OpenCLCOnVu....
[Hardware Unboxed] - Apple M1 Pro Review - Is It Really Faster than Intel/AMD?
2 projects | /r/hardware | 10 Nov 2021

Vulkan is much more limited, notably because of Vulkan's SPIR-V dialect limitations. That makes a compliant OpenCL 1.2 impl on top of Vulkan impossible. (see: https://github.com/google/clspv/blob/master/docs/OpenCLCOnVulkan.md)
Cross Platform GPU-Capable Framework?
6 projects | /r/gpgpu | 1 Aug 2021

OpenCL really is your best bet for a cross-platform GPU-capable framework. OpenCL 3.0 cleared out a lot of the cruft from OpenCL 2.x so it's seeing a lot more adoption. The most cross-platform solution is still OpenCL 1.2, largely for MacOS, but OpenCL 3.0 is becoming more and more common for Windows and Linux and multiple devices. Even on platforms without native OpenCL support there are compatibility layers that implement OpenCL on top of DirectX (OpenCLOn12) or Vulkan (clvk and clspv).

Cgml

Posts with mentions or reviews of Cgml. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-07.

Groq CEO: 'We No Longer Sell Hardware'
2 projects | news.ycombinator.com | 7 Apr 2024

> If there is a future with this idea, its gotta be just shipping the LLM with game right?
That might be a nice application for this library of mine: https://github.com/Const-me/Cgml/
That’s an open source Mistral ML model implementation which runs on GPUs (all of them, not just nVidia), takes 4.5GB on disk, uses under 6GB of VRAM, and optimized for interactive single-user use case. Probably fast enough for that application.
You wouldn’t want in-game dialogues with the original model though. Game developers would need to finetune, retrain and/or do something else with these weights and/or my implementation.
Ask HN: How to get started with local language models?
6 projects | news.ycombinator.com | 17 Mar 2024

If you just want to run Mistral on Windows, you could try my port: https://github.com/Const-me/Cgml/tree/master/Mistral/Mistral...
The setup is relatively easy: install .NET runtime, download 4.5 GB model file from BitTorrent, unpack a small ZIP file and run the EXE.
OpenAI postmortem – Unexpected responses from ChatGPT
1 project | news.ycombinator.com | 22 Feb 2024

Speaking about random sampling during inference, most ML models are doing it rather inefficiently.
Here’s a better way: https://github.com/Const-me/Cgml/blob/master/Readme.md#rando...
My HLSL is easily portable to CUDA, which has `__syncthreads` and `atomicInc` intrinsics.
Nvidia's Chat with RTX is a promising AI chatbot that runs locally on your PC
7 projects | news.ycombinator.com | 13 Feb 2024
AMD Funded a Drop-In CUDA Implementation Built on ROCm: It's Open-Source
23 projects | news.ycombinator.com | 12 Feb 2024

I did a few times with Direct3D 11 compute shaders. Here’s an open-source example: https://github.com/Const-me/Cgml
Pretty sure Vulkan gonna work equally well, at the very least there’s an open source DXVK project which implements D3D11 on top of Vulkan.
Brave Leo now uses Mixtral 8x7B as default
7 projects | news.ycombinator.com | 27 Jan 2024

Here’s an example of a custom 4 bits/weight codec for ML weights:
https://github.com/Const-me/Cgml/blob/master/Readme.md#bcml1...
llama.cpp does it slightly differently but still, AFAIK their quantized data formats are conceptually similar to my codec.
Efficient LLM inference solution on Intel GPU
3 projects | news.ycombinator.com | 20 Jan 2024
Vcc – The Vulkan Clang Compiler
9 projects | news.ycombinator.com | 9 Jan 2024

> the API was high-friction due to the shader language, and the glue between shader and CPU
Direct3D 11 compute shaders share these things with Vulkan, yet D3D11 is relatively easy to use. For example, see that library which implements ML-targeted compute shaders for C# with minimal friction: https://github.com/Const-me/Cgml The backend implemented in C++ is rather simple, just binds resources and dispatches these shaders.
I think the main usability issue with Vulkan is API design. Vulkan was only designed with AAA game engines in mind. The developers of these game engines have borderline unlimited budgets, and their requirements are very different from ordinary folks who want to leverage GPU hardware.
I made an app that runs Mistral 7B 0.2 LLM locally on iPhone Pros
12 projects | news.ycombinator.com | 7 Jan 2024

Minor update https://github.com/Const-me/Cgml/releases/tag/1.1a Can’t edit that comment anymore, too late.
Ask HN: Best way to learn GPU programming?
2 projects | news.ycombinator.com | 1 Jan 2024

For implementing stuff from scratch, if you use Windows you could try my C# based library for that: https://github.com/Const-me/Cgml/
It’s vendor agnostic, so HLSL instead of CUDA or Triton. Here’s the compute shaders implementing inference of Mistral-7B model: https://github.com/Const-me/Cgml/tree/master/Mistral/Mistral...

What are some alternatives?

When comparing clspv and Cgml you can also consider the following projects:

OpenCLOn12 - The OpenCL-on-D3D12 mapping layer

PowerInfer - High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

kompute - General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized for advanced GPU data processing usecases. Backed by the Linux Foundation.

ollama - Get up and running with Llama 3, Mistral, Gemma, and other large language models.

GLSL - GLSL Shading Language Issue Tracker

mlx - MLX: An array framework for Apple silicon

alpaka - Abstraction Library for Parallel Kernel Acceleration :llama:

EmotiVoice - EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

MoltenVK - MoltenVK is a Vulkan Portability implementation. It layers a subset of the high-performance, industry-standard Vulkan graphics and compute API over Apple's Metal graphics framework, enabling Vulkan applications to run on macOS, iOS and tvOS.

llamafile - Distribute and run LLMs with a single file.

SPIRV-VM - Virtual machine for executing SPIR-V

HIP - HIP: C++ Heterogeneous-Compute Interface for Portability