Llama2.c Alternatives

Similar projects and alternatives to llama2.c

llama.cpp

769 55,846 10.0 C++ llama2.c VS llama.cpp

LLM inference in C/C++
languagetool

310 11,543 10.0 Java llama2.c VS languagetool

Style and Grammar Checker for 25+ Languages
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Jailer

217 2,705 9.6 Java llama2.c VS Jailer

Database Subsetting and Relational Data Browsing Tool.
llama

184 53,053 8.1 Python llama2.c VS llama

Inference code for Llama models
FLiPStackWeekly

79 14 9.9 llama2.c VS FLiPStackWeekly

FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
chatgpt-retrieval-plugin

52 20,836 6.1 Python llama2.c VS chatgpt-retrieval-plugin

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
pkgx

46 8,708 9.0 TypeScript llama2.c VS pkgx

the last thing you’ll install
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
fluent-bit

35 5,344 9.8 C llama2.c VS fluent-bit

Fast and Lightweight Logs and Metrics processor for Linux, BSD, OSX and Windows
llama2.c

17 1,380 9.5 C llama2.c VS llama2.c

Llama 2 Everywhere (L2E) (by trholding)
towhee

26 2,989 8.6 Python llama2.c VS towhee

Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
awesome-data-temporality

17 96 10.0 llama2.c VS awesome-data-temporality

A curated list to help you manage temporal data across many modalities 🚀.
micrograd

22 8,273 0.0 Jupyter Notebook llama2.c VS micrograd

A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
libsql

21 7,720 9.9 C llama2.c VS libsql

libSQL is a fork of SQLite that is both Open Source, and Open Contributions.
dify

11 23,073 9.9 TypeScript llama2.c VS dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
symmetric-ds

12 691 9.5 Java llama2.c VS symmetric-ds

SymmetricDS is database replication and file synchronization software that is platform independent, web enabled, and database agnostic. It is designed to make bi-directional data replication fast, easy, and resilient. It scales to a large number of nodes and works in near real-time across WAN and LAN networks.
fastGPT

3 174 7.4 Fortran llama2.c VS fastGPT

Fast GPT-2 inference written in Fortran (by certik)
pytorch-forecasting

9 3,611 8.6 Python llama2.c VS pytorch-forecasting

Time series forecasting with PyTorch
feldera

4 247 9.9 Rust llama2.c VS feldera

Feldera Continuous Analytics Platform
CML_AMP_Churn_Prediction_mlflow

1 1 1.9 Jupyter Notebook llama2.c VS CML_AMP_Churn_Prediction_mlflow

Build an scikit-learn model to predict churn using customer telco data.
api-for-open-llm

1 1,952 9.5 Python llama2.c VS api-for-open-llm

Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better llama2.c alternative or higher similarity.

Suggest an alternative to llama2.c

llama2.c reviews and mentions

Posts with mentions or reviews of llama2.c. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-01-01.

Stuff we figured out about AI in 2023
5 projects | news.ycombinator.com | 1 Jan 2024

FOr inference, less than 1KLOC of pure, dependency-free C is enough (if you include the tokenizer and command line parsing)[1]. This was a non-obvious fact for me, in principle, you could run a modern LLM 20 years ago with just 1000 lines of code, assuming you're fine with things potentially taking days to run of course.
Training wouldn't be that much harder, Micrograd[2] is 200LOC of pure Python, 1000 lines would probably be enough for training an (extremely slow) LLM. By "extremely slow", I mean that a training run that normally takes hours could probably take dozens of years, but the results would, in principle, be the same.
If you were writing in C instead of Python and used something like Llama CPP's optimization tricks, you could probably get somewhat acceptable training performance in 2 or 3 KLOC. You'd still be off by one or two orders of magnitude when compared to a GPU cluster, but a lot better than naive, loopy Python.
[1] https://github.com/karpathy/llama2.c
[2] https://github.com/karpathy/micrograd
Minimal neural network implementation
4 projects | /r/C_Programming | 6 Dec 2023

A bit off topic but ML-guru Mr Karpathy has implemented a state-of-art Llama2 model in a plain C with no dependencies on 3rd party/freeware libraries. See repo.
WebLLM: Llama2 in the Browser
4 projects | news.ycombinator.com | 28 Aug 2023

Related. I built karpathy’s llama2.c (https://github.com/karpathy/llama2.c) without modifications to WASM and run it in the browser. It was a fun exercise to directly compare native vs. Web perf. Getting 80% of native performance on my M1 Macbook Air and haven’t spent anytime optimizing the WASM side.
Demo: https://diegomarcos.com/llama2.c-web/
Code:
Lfortran: Modern interactive LLVM-based Fortran compiler
2 projects | news.ycombinator.com | 28 Aug 2023

Would be cool for there to be a `llama2.f`, similar to https://github.com/karpathy/llama2.c, to demo it's capabilities
Llama2.c L2E LLM – Multi OS Binary and Unikernel Release
4 projects | news.ycombinator.com | 25 Aug 2023

This is a fork of https://github.com/karpathy/llama2.c
karpathy's llama2.c is like llama.cpp but it is written in C and the python training code is available in that same repo. llama2.c's goal is to be a elegant single file C implementation of the inference and an elegant python implementation for training.
His goal is for people to understand how llama 2 and LLM's work, so he keeps it simple and sweet. As the project progresses, so will features and performance improvements added.
Currently it can infer baby (small) Story models trained by Karpathy at a fast pace. It can also infer Meta LLAMA 2 7b models, but at a very slow rate such as 1 token per second.
So currently this can be used for learning or as a tech preview.
Our friendly fork tries to make it portable, performant and more usable (bells and whistles) over time. Since we mirror upstream closely, the inference capabilities of our fork is similar but slightly faster if compiled with acceleration. What we try to do different is that we try to make this bootable (not there yet) and portable. Right now you can get binary portablity - use the same run.com on any x86_64 machine running on any OS, it will work (possible due to cosmopolitan toolchain). The other part that works is unikernels - boot this as unikernel in VM's (possible due unikraft unikernel & toolchain).
See our fork currently as a release early and release often toy tech demo. We plan to build it out into a useful product.
FLaNK Stack Weekly for 14 Aug 2023
32 projects | dev.to | 14 Aug 2023
Adding LLaMa2.c support for Web with GGML.JS
2 projects | /r/LocalLLaMA | 14 Aug 2023

In my latest release of ggml.js, I've added support for Karapathy's llama2.c model.
Beginner's Guide to Llama Models
2 projects | news.ycombinator.com | 12 Aug 2023

I really enjoyed Anrej Kaparthy's llama2.c project (https://github.com/karpathy/llama2.c), which runs through creating and running a miniature Llama2 architecture model from scratch.
How to scale LLMs better with an alternative to transformers
1 project | news.ycombinator.com | 27 Jul 2023

- https://github.com/karpathy/llama2.c
I think there may be some applications in this limited space that are worth looking into. You won’t replicate GPT-anything but it may be possible to solve some nice problems very much more efficiently that one would expect at first.
A simple guide to fine-tuning Llama 2
1 project | news.ycombinator.com | 27 Jul 2023

It does now: https://github.com/karpathy/llama2.c#metas-llama-2-models
A note from our sponsor - SaaSHub
www.saashub.com | 27 Apr 2024

SaaSHub helps you find the best software and product alternatives Learn more →