llm-mlc
DP_means
llm-mlc | DP_means | |
---|---|---|
3 | 1 | |
172 | 45 | |
- | - | |
5.1 | 1.7 | |
about 2 months ago | about 1 year ago | |
Python | C++ | |
Apache License 2.0 | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
llm-mlc
-
LLM now provides tools for working with embeddings
I'm still iterating on that. Plugins get complete control over the prompts, so they can handle the various weirdnesses of them. Here's some relevant code:
https://github.com/simonw/llm-gpt4all/blob/0046e2bf5d0a9c369...
https://github.com/simonw/llm-mlc/blob/b05eec9ba008e700ecc42...
https://github.com/simonw/llm-llama-cpp/blob/29ee8d239f5cfbf...
I'm not completely happy with this yet. Part of the problem is that different models on the same architecture may have completely different prompting styles.
I expect I'll eventually evolve the plugins to allow them to be configured in an easier and more flexible way. Ideally I'd like you to be able to run new models on existing architectures using an existing plugin.
-
Show HN: LlamaGPT – Self-hosted, offline, private AI chatbot, powered by Llama 2
What is the advantage of this versus running something like https://github.com/simonw/llm , which also gives you options to e.g. use https://github.com/simonw/llm-mlc for accelerated inference?
-
Show HN: LLMs can generate valid JSON 100% of the time
I'm quite impressed with Llama 2 13B - the more time I spend with it the more I think it might be genuinely useful for more than just playing around with local LLMs.
I'm using the MLC version (since that works with a GPU on my M2 Mac) via my https://github.com/simonw/llm-mlc plugin.
DP_means
-
LLM now provides tools for working with embeddings
I found one implementation here: https://github.com/vsmolyakov/DP_means
Alternatively, there is a Bayesian GMM in sklearn. When you restrict it to diagonal Covariance matrices, you should be fine in high dimensions
What are some alternatives?
llm-gpt4all - Plugin for LLM adding support for the GPT4All collection of models
llm-cluster - LLM plugin for clustering embeddings
can-ai-code - Self-evaluating interview for AI coders
llm-llama-cpp - LLM plugin for running models using llama.cpp
llama-gpt - A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!
datasette-faiss - Maintain a FAISS index for specified Datasette tables
outlines - Structured Text Generation
TypeChat - TypeChat is a library that makes it easy to build natural language interfaces using types.
ad-llama - Structured inference with Llama 2 in your browser
llama.cpp - LLM inference in C/C++
relm - ReLM is a Regular Expression engine for Language Models