C++ inference-engine

Open-source C++ projects categorized as inference-engine

Top 5 C++ inference-engine Projects

inference-engine
  1. aphrodite-engine

    Large-scale LLM inference engine

    Project mention: How to run llama 405b bf16 with gh200s | dev.to | 2024-12-21

    k=3 ssh_k # building this on the third machine git clone https://github.com/PygmalionAI/aphrodite-engine.git ~/shared/aphrodite-engine cd ~/shared/aphrodite-engine pip install protobuf==3.20.2 ninja msgspec coloredlogs portalocker pytimeparse -r requirements-common.txt python setup.py bdist_wheel pip install --no-deps dist/*.whl

  2. Nutrient

    Nutrient - The #1 PDF SDK Library. Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free.

    Nutrient logo
  3. yalm

    Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

    Project mention: Fast LLM Inference From Scratch (using CUDA) | news.ycombinator.com | 2024-12-15

    Oops, you're right and it's a difference between my blog post and source code. It should be __shfl_down_sync as seen [here](https://github.com/andrewkchan/yalm/blob/8c908f23f5d8cc3f14c...)

  4. Daisykit

    Daisykit is an easy AI toolkit with face mask detection, pose detection, background matting, barcode detection, and more. With Daisykit, you don't need AI knowledge to build AI software.

  5. EasyOCR-cpp

    Custom C++ implementation of deep learning based OCR

  6. nnl

    a low-latency and high-performance inference engine for large models on low-memory GPU platform.

  7. CodeRabbit

    CodeRabbit: AI Code Reviews for Developers. Revolutionize your code reviews with AI. CodeRabbit offers PR summaries, code walkthroughs, 1-click suggestions, and AST-based analysis. Boost productivity and code quality across all major languages with each PR.

    CodeRabbit logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

C++ inference-engine discussion

Log in or Post with

C++ inference-engine related posts

  • Fast LLM Inference From Scratch (using CUDA)

    5 projects | news.ycombinator.com | 15 Dec 2024
  • QUIK is a method for quantizing LLM post-training weights to 4 bit precision

    2 projects | news.ycombinator.com | 6 Nov 2023
  • Intel OpenVINO 2023.1.0 released

    1 project | /r/intel | 20 Sep 2023
  • Intel OpenVINO 2023.1.0 released, open-source toolkit for optimizing and deploying AI inference

    1 project | /r/opensource | 20 Sep 2023
  • OpenVINO 2023.1.0 released

    1 project | /r/IntelArc | 20 Sep 2023
  • [N] Intel OpenVINO 2023.1.0 released, open-source toolkit for optimizing and deploying AI inference

    1 project | /r/MachineLearning | 19 Sep 2023
  • Intel OpenVINO 2023.1.0 released, open-source toolkit for optimizing and deploying AI inference

    1 project | /r/computervision | 19 Sep 2023
  • A note from our sponsor - SaaSHub
    www.saashub.com | 16 Feb 2025
    SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source inference-engine projects in C++? This list will help you:

# Project Stars
1 aphrodite-engine 1,287
2 yalm 233
3 Daisykit 104
4 EasyOCR-cpp 51
5 nnl 5

Sponsored
Nutrient - The #1 PDF SDK Library
Bad PDFs = bad UX. Slow load times, broken annotations, clunky UX frustrates users. Nutrient’s PDF SDKs gives seamless document experiences, fast rendering, annotations, real-time collaboration, 100+ features. Used by 10K+ devs, serving ~half a billion users worldwide. Explore the SDK for free.
nutrient.io

Did you know that C++ is
the 7th most popular programming language
based on number of references?