Mlc-llm Alternatives

Similar projects and alternatives to mlc-llm

text-generation-webui

876 35,862 9.9 Python mlc-llm VS text-generation-webui

A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.
llama.cpp

769 55,846 10.0 C++ mlc-llm VS llama.cpp

LLM inference in C/C++
WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
ROCm

198 3,637 0.0 Python mlc-llm VS ROCm

Discontinued AMD ROCm™ Software - GitHub Home [Moved to: https://github.com/ROCm/ROCm]
ollama

192 58,943 9.9 Go mlc-llm VS ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.
whisper.cpp

187 30,942 9.8 C mlc-llm VS whisper.cpp

Port of OpenAI's Whisper model in C/C++
koboldcpp

180 3,749 10.0 C++ mlc-llm VS koboldcpp

A simple one-file way to run various GGML and GGUF models with KoboldAI's UI
FastChat

82 33,877 9.6 Python mlc-llm VS FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
FLiPStackWeekly

79 14 9.9 mlc-llm VS FLiPStackWeekly

FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...
ggml

69 9,642 9.8 C mlc-llm VS ggml

Tensor library for machine learning
GPTQ-for-LLaMa

75 2,913 8.6 Python mlc-llm VS GPTQ-for-LLaMa

4 bits quantization of LLaMA using GPTQ
exllama

64 2,582 9.0 Python mlc-llm VS exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
llama-cpp-python

54 6,378 9.9 Python mlc-llm VS llama-cpp-python

Python bindings for llama.cpp
dalai

59 13,044 6.5 CSS mlc-llm VS dalai

The simplest way to run LLaMA on your local machine
open_llama

52 7,193 5.3 mlc-llm VS open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
web-llm

42 9,018 9.0 TypeScript mlc-llm VS web-llm

Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
jsonformer

25 3,774 5.4 Jupyter Notebook mlc-llm VS jsonformer

A Bulletproof Way to Generate Structured JSON from Language Models
sparsegpt

16 620 3.2 Python mlc-llm VS sparsegpt

Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
CTranslate2

13 2,776 9.0 C++ mlc-llm VS CTranslate2

Fast inference engine for Transformer models
tvm

15 11,156 9.9 Python mlc-llm VS tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators
Open-Llama

7 637 10.0 Python mlc-llm VS Open-Llama

Discontinued The complete training code of the open-source high-performance Llama model, including the full process from pre-training to RLHF.
SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a better mlc-llm alternative or higher similarity.

Suggest an alternative to mlc-llm

mlc-llm reviews and mentions

Posts with mentions or reviews of mlc-llm. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-03-04.

FLaNK 04 March 2024
26 projects | dev.to | 4 Mar 2024
Ai on a android phone?
2 projects | /r/LocalLLaMA | 8 Dec 2023

This one uses gpu, it doesn't support Mistral yet: https://github.com/mlc-ai/mlc-llm
MLC vs llama.cpp
2 projects | /r/LocalLLaMA | 7 Nov 2023

I have tried running mistral 7B with MLC on my m1 metal. And it kept crushing (git issue with description). Memory inefficiency problems.
[Project] Scaling LLama2 70B with Multi NVIDIA and AMD GPUs under 3k budget
1 project | /r/LocalLLaMA | 21 Oct 2023

Project: https://github.com/mlc-ai/mlc-llm
Scaling LLama2-70B with Multi Nvidia/AMD GPU
2 projects | news.ycombinator.com | 19 Oct 2023
AMD May Get Across the CUDA Moat
8 projects | news.ycombinator.com | 6 Oct 2023

For LLM inference, a shoutout to MLC LLM, which runs LLM models on basically any API that's widely available: https://github.com/mlc-ai/mlc-llm
ROCm Is AMD's #1 Priority, Executive Says
5 projects | news.ycombinator.com | 26 Sep 2023

One of your problems might be that gfx1032 is not supported by AMD's ROCm packages, which has a laughably short list of supported hardware: https://rocm.docs.amd.com/en/latest/release/gpu_os_support.h...
The normal workaround is to assign the closest architecture, eg gfx1030, so `HSA_OVERRIDE_GFX_VERSION=10.3.0` might help
Also, it looks like some of your tested projects are OpenCL? For me, I do something like: `yay -S rocm-hip-sdk rocm-ml-sdk rocm-opencl-sdk` to cover all the bases.
My recent interest has been LLMs and this is my general step by step for those (llama.cpp, exllama) for those interested: https://llm-tracker.info/books/howto-guides/page/amd-gpus
I didn't port the docs back in, but also here's a step-by-step w/ my adventures getting TVM/MLC working w/ an APU: https://github.com/mlc-ai/mlc-llm/issues/787
From my experience, ROCm is improving, but there's a good reason that Nvidia has 90% market share even at big price premiums.
Show HN: Ollama for Linux – Run LLMs on Linux with GPU Acceleration
14 projects | news.ycombinator.com | 26 Sep 2023

Maybe they're talking about https://github.com/mlc-ai/mlc-llm which is used for web-llm (https://github.com/mlc-ai/web-llm)? Seems to be using TVM.
Show HN: Fine-tune your own Llama 2 to replace GPT-3.5/4
8 projects | news.ycombinator.com | 12 Sep 2023

you already have TVM for the cross platform stuff
see https://tvm.apache.org/docs/how_to/deploy/android.html
or https://octoml.ai/blog/using-swift-and-apache-tvm-to-develop...
or https://github.com/mlc-ai/mlc-llm
Ask HN: Are you training and running custom LLMs and how are you doing it?
1 project | news.ycombinator.com | 14 Aug 2023
A note from our sponsor - SaaSHub
www.saashub.com | 26 Apr 2024

SaaSHub helps you find the best software and product alternatives Learn more →