Llama-2-Onnx Alternatives

Similar projects and alternatives to Llama-2-Onnx

llama.cpp

775 57,463 10.0 C++ Llama-2-Onnx VS llama.cpp

LLM inference in C/C++
mlc-llm

89 17,053 9.9 Python Llama-2-Onnx VS mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
SHARK

84 1,385 9.4 Python Llama-2-Onnx VS SHARK

SHARK - High Performance Machine Learning Distribution
FLiPStackWeekly

81 14 9.9 Llama-2-Onnx VS FLiPStackWeekly

FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
pkgx

47 8,716 9.0 TypeScript Llama-2-Onnx VS pkgx

the last thing you’ll install
chatgpt-retrieval-plugin

52 20,850 6.1 Python Llama-2-Onnx VS chatgpt-retrieval-plugin

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
vllm

31 18,931 9.9 Python Llama-2-Onnx VS vllm

A high-throughput and memory-efficient inference and serving engine for LLMs
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
fluent-bit

35 5,366 9.8 C Llama-2-Onnx VS fluent-bit

Fast and Lightweight Logs and Metrics processor for Linux, BSD, OSX and Windows
axolotl

29 5,899 9.8 Python Llama-2-Onnx VS axolotl

Go ahead and axolotl questions
towhee

26 3,001 8.6 Python Llama-2-Onnx VS towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
libsql

23 7,782 9.9 C Llama-2-Onnx VS libsql

libSQL is a fork of SQLite that is both Open Source, and Open Contributions.
awesome-data-temporality

17 96 10.0 Llama-2-Onnx VS awesome-data-temporality

A curated list to help you manage temporal data across many modalities 🚀.
OpenPipe

13 2,381 9.9 TypeScript Llama-2-Onnx VS OpenPipe

Turn expensive prompts into cheap fine-tuned models
llama2.c

13 16,071 9.2 C Llama-2-Onnx VS llama2.c

Inference Llama 2 in one file of pure C
dify

13 27,030 9.9 TypeScript Llama-2-Onnx VS dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
symmetric-ds

12 693 9.5 Java Llama-2-Onnx VS symmetric-ds

SymmetricDS is database replication and file synchronization software that is platform independent, web enabled, and database agnostic. It is designed to make bi-directional data replication fast, easy, and resilient. It scales to a large number of nodes and works in near real-time across WAN and LAN networks.
pytorch-forecasting

9 3,625 8.6 Python Llama-2-Onnx VS pytorch-forecasting

Time series forecasting with PyTorch
onnx-coreml

1 378 10.0 Python Llama-2-Onnx VS onnx-coreml

Discontinued ONNX to Core ML Converter
feldera

4 256 9.9 Rust Llama-2-Onnx VS feldera

Feldera Continuous Analytics Platform
gpt-llm-trainer

4 3,811 5.4 Jupyter Notebook Llama-2-Onnx VS gpt-llm-trainer
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better Llama-2-Onnx alternative or higher similarity.

Suggest an alternative to Llama-2-Onnx

Llama-2-Onnx reviews and mentions

Posts with mentions or reviews of Llama-2-Onnx. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2023-09-12.

Show HN: Fine-tune your own Llama 2 to replace GPT-3.5/4
8 projects | news.ycombinator.com | 12 Sep 2023

System: Here's some docs, answer concisely in a sentence.
YMMV on cost still, depends on cloud vendor, and my intuition & viewpoint agrees with yours, GPT-3.5 is priced low enough that there isn't a case where it makes sense to use another model.
It strikes me now that _very_ likely and not just our intuition: OpenAI's $/GPU hour is likely <= any other vendor's.
The next big step will come from formalizing the stuff rolling around the local LLM community, for months now it's either been one-off $X.c stunts that run on desktop, and the vast majority of the _actual_ usage and progress is coming from porn-y stuff, like all nascent tech.
Microsoft has LLaMa-2 ONNX available on GitHub[1]. There's budding but very small projects in different languages to wrap ONNX. Once there's a genuine cross-platform[2] ONNX wrapper that makes running LLaMa-2 easy, there will be a step change. It'll be "free"[3] to run your fine-tuned model that does as well as GPT-4 .
It's not clear to me exactly when this will occur. It's "difficult" now, but only because the _actual usage_ in the local LLM community doesn't have a reason to invest in ONNX, and it's extremely intimidating to figure out how exactly to get LLaMa-2 running in ONNX. Microsoft kinda threw it up on GitHub and moved on, the sample code even still needs a PyTorch model. I see at least one very small company on HuggingFace that _may_ have figured out full ONNX.
[1] https://github.com/microsoft/Llama-2-Onnx
FLaNK Stack Weekly for 14 Aug 2023
32 projects | dev.to | 14 Aug 2023
Llama 2 on ONNX runs locally
5 projects | news.ycombinator.com | 10 Aug 2023
A note from our sponsor - SaaSHub
www.saashub.com | 8 May 2024

SaaSHub helps you find the best software and product alternatives Learn more →