openllmetry
nannyml
openllmetry | nannyml | |
---|---|---|
3 | 7 | |
1,299 | 1,761 | |
16.2% | 1.2% | |
9.8 | 8.6 | |
6 days ago | 7 days ago | |
Python | Python | |
Apache License 2.0 | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
openllmetry
-
Pydantic Logfire
I’m also aware of other OSS initiatives doing similar initiatives, so I wouldn’t say no one has ever done what your doing.
[1] https://github.com/traceloop/openllmetry
-
Show HN: You don't need to adopt new tools for LLM observability
So why should it be different when the app you're building happened to be using LLMs?
So today we're open-sourcing OpenLLMetry-JS. It's an open protocol and SDK, based on OpenTelemetry, that provides traces and metrics for LLM JS/TS applications and can be connected to any of the 15+ tools that already support OpenTelemetry. Here's the repo: https://github.com/traceloop/openllmetry-js
A few months ago we launched the python flavor here (https://news.ycombinator.com/item?id=37843907) and we've now built a compatible one for Node.js.
Would love to hear your thoughts and opinions!
Check it out -
Docs: https://www.traceloop.com/docs/openllmetry/getting-started-t...
Github:
nannyml
-
Introduction to NannyML: Model Evaluation without labels
In order to try to solve this issue, NannyML was created. NannyML is an open-source Python library designed in order to make it easy to monitor drift in the distributions of our model input variables and estimate our model performance (even without labels!) thanks to the Confidence-Based Performance Estimation algorithm they developed. But first of all, why do models need to be monitored and why their performance might vary over time?
- Detecting silent model failure. NannyML estimates performance for regression and classification models using tabular data. It alerts you when and why it changed. It is the only open-source library capable of fully capturing the impact of data drift on performance.
-
[D] Data drift is not a good indicator of model performance degradation
But I may have it haha. What we propose in the blog post instead of relying solely on data drift is using performance estimation methods (eg: https://github.com/NannyML) with them you can estimate the performance of the ml model without having access to ground truth.
-
[HIRING][Full Time, Part Time, Temporary, Internship, Freelance] Data Science Intern (Remote)
Description NannyML - creators of an Open Source Python library, are looking for multiple Data Science interns to help across research, prototyping, and product. Github: https://github.com/NannyML/nannyml About Us NannyML is an Open Source Python lib …
-
What do you think about Detecting Silent ML Failure with an Open Source Python library?
If you think this could add value to your daily life, check it out here: https://github.com/NannyML/nannyml.
-
Can I estimate the impact of data drift on performance?
I found it implemented here: https://github.com/NannyML/nannyml
- Show HN: OSS Python library for detecting silent ML model failure
What are some alternatives?
ludwig - Low-code framework for building custom LLMs, neural networks, and other AI models
evidently - Evaluate and monitor ML models from validation to production. Join our Discord: https://discord.com/invite/xZjKRaNp8b
openlit - OpenLIT is an open-source GenAI and LLM observability platform native to OpenTelemetry with traces and metrics in a single application 🔥 🖥 . Open source GenAI and LLM Application Performance Monitoring (APM) & Observability tool
cuttle-cli - Cuttle automates the transformation of your Python notebook into deployment-ready projects (API, ML pipeline, or just a Python script)
deep-significance - Enabling easy statistical significance testing for deep neural networks.
barfi - Python Flow Based Programming environment that provides a graphical programming environment.
ydata-profiling - 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
eurybia - ⚓ Eurybia monitors model drift over time and securizes model deployment with data validation
cyclops - Toolkit for evaluating and monitoring AI models in clinical settings
deepchecks - Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
frouros - Frouros: an open-source Python library for drift detection in machine learning systems.
model-validation-toolkit - Model Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to production and monitoring them after deployment to production.