ollama_local_rag
promptfoo
ollama_local_rag | promptfoo | |
---|---|---|
1 | 5 | |
16 | 328 | |
- | - | |
6.4 | 10.0 | |
12 days ago | 11 months ago | |
Python | TypeScript | |
- | MIT License |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
ollama_local_rag
-
Ollama v0.1.33 with Llama 3, Phi 3, and Qwen 110B
I love working with Ollama, I was really surprised at how easy it is to build a simple RAG system with it. For example: https://github.com/stephen37/ollama_local_rag
promptfoo
-
Ollama v0.1.33 with Llama 3, Phi 3, and Qwen 110B
Jumping in because I'm a big believer in (1) local LLMs, and (2) evals specific to individual use cases.
[0] https://github.com/typpo/promptfoo
- Meta Llama 3
-
Launch HN: Talc AI (YC S23) – Test Sets for AI
Congrats on the launch!
I've been interested in automatic testset generation because I find that the chore of writing tests is one of the reasons people shy away from evals. Recently landed eval testset generation for promptfoo (https://github.com/typpo/promptfoo), but it is non-RAG so more simplistic than your implementation.
Was also eyeballing this paper https://arxiv.org/abs/2401.03038, which outlines a method for generating asserts from prompt version history that may also be useful for these eval tools.
-
GPT-Prompt-Engineer
Thanks for the promptfoo mention. For anyone else who might prefer deterministic, programmatic evaluation of LLM outputs, I've been building promptfoo: https://github.com/typpo/promptfoo
Example asserts include basic string checks, regex, is-json, cosine similarity, etc.
What are some alternatives?
ollama - Get up and running with Llama 3, Mistral, Gemma, and other large language models.
rebuff - LLM Prompt Injection Detector
TensorRT-LLM - TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
gpt-engineer - Specify what you want it to build, the AI asks for clarification, and then builds it.
mlx - MLX: An array framework for Apple silicon
ChainForge - An open-source visual programming environment for battle-testing prompts to LLMs.
cloudseeder - One-click install internet appliances that operate on your terms. Transform your home computer into a sovereign and secure cloud.
gateway - A Blazing Fast AI Gateway. Route to 200+ LLMs with 1 fast & friendly API.
llama-chat - Implements a simple REPL chat with a locally running instance of Ollama.
shap-e - Generate 3D objects conditioned on text or images
sugarcane-ai - npm like package ecosystem for Prompts 🤖
plandex - AI driven development in your terminal. Designed for large, real-world tasks.