C++ language-model Projects

rwkv.cpp

12 1,100 6.8 C++

INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model

Project mention: Eagle 7B: Soaring past Transformers | news.ycombinator.com | 2024-01-28

There's https://github.com/saharNooby/rwkv.cpp, which related-ish[0] to ggml/llama.cpp
[0]: https://github.com/ggerganov/llama.cpp/issues/846

nnl

1 4 5.4 C++

a low-latency and high-performance inference engine for large models on low-memory GPU platform.

Project mention: Run 70B LLM Inference on a Single 4GB GPU with This New Technique | news.ycombinator.com | 2023-12-03

I did roughly the same thing in one of my hobby project https://github.com/fengwang/nnl. But in stead of using SSD, I load all the weights to the host memory, and while inferencing the model layer by layer, I asynchronously copy memory from global to shared memory in the hope of better performance. However, my approach is bounded by the PCI-E bandwidth.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C++ language-model related posts

Haystack DB – 10x faster than FAISS with binary embeddings by default

3 projects | news.ycombinator.com | 28 Apr 2024
WyGPT: Minimal mature GPT model in C++

2 projects | news.ycombinator.com | 16 Dec 2023
[D] SentencePiece, WordPiece, BPE... Which tokenizer is the best one?

3 projects | /r/MachineLearning | 27 Dec 2021
[P]wyGPT: improved small GPT model in C++ from scratch

1 project | /r/MachineLearning | 18 Jun 2021
wangyi-fudan/wyGPT

1 project | /r/MachineLearning | 18 Jun 2021
[P]wyGPT: Improved Small GPT In C++ From Scratch

1 project | /r/MachineLearning | 18 Jun 2021
WyGPT: C++ GPT Language Model from Scratch

1 project | news.ycombinator.com | 18 Jun 2021
A note from our sponsor - InfluxDB
www.influxdata.com | 2 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

	Project	Stars
1	rwkv.cpp	1,100
2	nnl	4

C++ language-model

C++ language-model Projects

rwkv.cpp

nnl

InfluxDB

C++ language-model related posts

Haystack DB – 10x faster than FAISS with binary embeddings by default

WyGPT: Minimal mature GPT model in C++

[D] SentencePiece, WordPiece, BPE... Which tokenizer is the best one?

[P]wyGPT: improved small GPT model in C++ from scratch

wangyi-fudan/wyGPT

[P]wyGPT: Improved Small GPT In C++ From Scratch

WyGPT: C++ GPT Language Model from Scratch

Index