InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now. Learn more →
Top 23 C++ Machine Learning Projects
-
Project mention: None of the top 10 projects in GitHub is actually a software project 🤯 | dev.to | 2025-05-10
We see an addition to the AI community with AutoGPT. Along with Tensorflow they represent the AI community in the software category, which is getting relevant (2 out of 8). We can expect in the future to have new AI projects in the top 25 such as Transformers or Ollama (currently top 34 and 36, respectively).
-
InfluxDB
InfluxDB – Built for High-Performance Time Series Workloads. InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.
-
https://www.home-assistant.io/integrations/seven_segments/
https://www.unix-ag.uni-kl.de/~auerswal/ssocr/
https://github.com/tesseract-ocr/tesseract
https://community.home-assistant.io/t/ocr-on-camera-image-fo...
https://www.google.com/search?q=home+assistant+ocr+integrati...
https://www.google.com/search?q=esphome+ocr+sensor
https://hackaday.com/2021/02/07/an-esp-will-read-your-meter-...
...start digging around and you'll likely find something. HA has integrations which can support writing to InfluxDB (local for sure, and you can probably configure it for a remote influxdb).
You're looking at 1xRaspberry PI, 1xUSB Webcam, 1x"Power Management / humidity management / waterproof electrical box" to stuff it into, and then either YOLO and DIY to shoot over to your influxdb, or set up a Home Assistant and "attach" your frankenbox as some sort of "sensor" or "integration" which spits out metrics and yadayada...
-
-
openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
-
C-Plus-Plus
Collection of various algorithms in mathematics, machine learning, computer science and physics implemented in C++ for educational purposes.
-
Project mention: Integrating MediaPipe with DeepSeek for Enhanced AI Performance | dev.to | 2025-02-03
Code Examples: Check out the MediaPipe and LLM Integration Examples provided by Google AI Edge.
-
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
Boosting Boosting is not a separate ML model but a technique that combines multiple weak learners to create a single model that can generate highly accurate predictions. Xgboost is a common boosting model that supports distributed training, resulting in faster training. According to research by Intel, Xgboost can be more effective than a neural network-based approach for tabular data. In addition, Xgboost is faster to train and doesn’t require as much data as neural networks need.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Project mention: From Voice to Text: Exploring Speech-to-Text Tools and APIs for Developers | dev.to | 2025-05-19Setup: Install deepspeech with pip install deepspeech. Download pre-trained models from DeepSpeech Releases. Use a 16kHz mono WAV file.
-
PaddlePaddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
PaddlePaddle (so good they named it twice) predates Ray and supports both data parallel and model-parallel training. It is still being developed.
https://github.com/PaddlePaddle/Paddle
They have pedigry.
-
Github Source Code: CNTK
-
LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
-
Project mention: Generative AI Interview for Senior Data Scientists: 50 Key Questions and Answers | dev.to | 2025-05-06
How it works: A model trained in one framework can be converted to the ONNX format. This format can then be run on various hardware or inference engines that support it (e.g., ONNX Runtime). It facilitates easy model transfer and execution even if the development framework and deployment environment differ.
-
Project mention: Dlib: Modern C++ toolkit containing machine learning algorithms | news.ycombinator.com | 2025-03-19
-
video2x
A machine learning-based video super resolution and frame interpolation framework. Est. Hack the Valley II, 2018.
-
Project mention: Xiaomi unveils open-source AI reasoning model MiMo | news.ycombinator.com | 2025-04-30
One of the core design goals Georgi Gerganov had with GGUF was to not need other files. It's literally bullet point #1 in the specs
>Single-file deployment
>Full information: all information needed to load a model is contained in the model file, and no additional information needs to be provided by the user.
https://github.com/ggml-org/ggml/blob/master/docs/gguf.md
We literally just got rid of that multi file chaos only for ollama to add it back :/
-
-
MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/README.md)
Project mention: Alibaba mnn android app support DeepSeek R1 model | news.ycombinator.com | 2025-02-06 -
vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
-
-
catboost
A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
Project mention: 🚀 Why Your ML Service Needs Rust + CatBoost: A Setup Guide That Actually Works | dev.to | 2025-01-19[package] name = "MLApp" version = "0.1.0" edition = "2021" [dependencies] catboost = { git = "https://github.com/catboost/catboost", rev = "0bfdc35"}
-
jetson-inference
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
-
-
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
C++ Machine Learning discussion
C++ Machine Learning related posts
-
Baby Steps into Genetic Programming
-
How to Create Vector Embeddings in Node.js
-
Dlib: Modern C++ toolkit containing machine learning algorithms
-
GPT 4.5 level for 1% of the price
-
Show HN: Txeo – A Modern C++ Wrapper for TensorFlow
-
Train a Mnist VAE with C and CUDA
-
Unlocking DuckDB from Anywhere - A Guide to Remote Access with Apache Arrow and Flight RPC (gRPC)
-
A note from our sponsor - InfluxDB
www.influxdata.com | 20 May 2025
Index
What are some of the best open-source Machine Learning projects in C++? This list will help you:
# | Project | Stars |
---|---|---|
1 | tensorflow | 189,943 |
2 | tesseract-ocr | 66,835 |
3 | Caffe | 34,356 |
4 | openpose | 32,454 |
5 | C-Plus-Plus | 31,775 |
6 | mediapipe | 29,728 |
7 | xgboost | 26,922 |
8 | DeepSpeech | 26,336 |
9 | PaddlePaddle | 22,774 |
10 | CNTK | 17,552 |
11 | LightGBM | 17,217 |
12 | onnxruntime | 16,629 |
13 | Dlib | 13,994 |
14 | video2x | 13,351 |
15 | ggml | 12,521 |
16 | Open3D | 12,300 |
17 | MNN | 10,862 |
18 | vowpal_wabbit | 8,565 |
19 | oneflow | 8,448 |
20 | catboost | 8,394 |
21 | jetson-inference | 8,287 |
22 | interpret | 6,496 |
23 | serving | 6,281 |