spin
langkit
spin | langkit | |
---|---|---|
2 | 5 | |
881 | 724 | |
5.0% | 4.7% | |
7.8 | 8.8 | |
3 days ago | 5 days ago | |
Shell | Jupyter Notebook | |
GNU General Public License v3.0 only | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
spin
langkit
- FLaNK Stack Weekly 22 January 2024
- LangKit: An open-source toolkit for monitoring LLMs
-
Ask HN: How are you improving your use of LLMs in production?
Would love to hear feedback and thoughts on how people approach monitoring in production in real world applications in general! It's an area that I think not enough people talk about when operating LLMs.
We spent a lot of time working with various companies with GenAI use cases before LLM was a thing and captured them in our library called LangKit - it's designed to be generic and pluggable into many different systems, including langchain: https://github.com/whylabs/langkit/. It's designed beyond prompt engineering and aims to provide automated ways to monitor LLM once deployed. Happy to answer any questions here!
- LangKit: An open-source toolkit for monitoring Language Learning Models (LLMs)
- LangKit: An open-source text metrics toolkit for monitoring LLM
What are some alternatives?
docker-mern - One-stop repo for starting up MERN containers with as little configuration
evals - Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
laravel-docker-app - A complete Laravel, PHP Docker based development environment with individual Nginx, Web app, Queue, Scheduler, Redis containers.
promptfoo - Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.
fugue - A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
Mermaid - Edit, preview and share mermaid charts/diagrams. New implementation of the live editor.
realworlddevopscourse - Accompanying files for "Real world Devops project from start to finish" course
raspberrypi-homeserver - A collection of applications and tools to make awesome Raspberry Pi homerserver