Efficient LLM inference solution on Intel GPU

Our great sponsors

InfluxDB - Power Real-Time Data Analytics at Scale

WorkOS - The modern identity platform for B2B SaaS

SaaSHub - Software Alternatives and Reviews

Our great sponsors

intel-extension-for-pytorch

14 1,342 9.7 Python

A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

OK I found it. Looks like they use SYCL (which for some reason they've rebranded to DPC++): https://github.com/intel/intel-extension-for-pytorch/tree/v2...

llama.cpp

769 56,891 10.0 C++

LLM inference in C/C++
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Cgml

21 37 8.6 C++

GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project