speedbump
langkit
speedbump | langkit | |
---|---|---|
17 | 5 | |
1,477 | 719 | |
- | 4.0% | |
1.2 | 8.8 | |
9 months ago | 5 days ago | |
Go | Jupyter Notebook | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
speedbump
langkit
- FLaNK Stack Weekly 22 January 2024
- LangKit: An open-source toolkit for monitoring LLMs
-
Ask HN: How are you improving your use of LLMs in production?
Would love to hear feedback and thoughts on how people approach monitoring in production in real world applications in general! It's an area that I think not enough people talk about when operating LLMs.
We spent a lot of time working with various companies with GenAI use cases before LLM was a thing and captured them in our library called LangKit - it's designed to be generic and pluggable into many different systems, including langchain: https://github.com/whylabs/langkit/. It's designed beyond prompt engineering and aims to provide automated ways to monitor LLM once deployed. Happy to answer any questions here!
- LangKit: An open-source toolkit for monitoring Language Learning Models (LLMs)
- LangKit: An open-source text metrics toolkit for monitoring LLM
What are some alternatives?
dejitun - De-jitter tunnel
evals - Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Comcast - Simulating shitty network connections so you can build better systems.
promptfoo - Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.
toxiproxy - :alarm_clock: :fire: A TCP proxy to simulate network and system conditions for chaos and resiliency testing
Mermaid - Edit, preview and share mermaid charts/diagrams. New implementation of the live editor.
OutRun - OutRun is an open-source, privacy oriented, outdoor fitness tracker.
fugue - A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
gateway - A Blazing Fast AI Gateway. Route to 100+ LLMs with 1 fast & friendly API.
Electric_and_Utilities_System_Demo - Using CDF, CDW, CML and Data Viz, this demo is a complete Electric and Utilities Company use case to broadly leverage the CDP Data Services platform