-
LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
Here, we make use of Eleuther AI’s LM evaluation harness repository (https://github.com/EleutherAI/lm-evaluation-harness) to get QA accuracy results. We also evaluate all models’ NLL metrics on their datasets, with their questions as contexts and answers as output sentences.
LMFlow: https://github.com/OptimalScale/LMFlow
This is super interesting! Thanks for sharing. We're also working on this research field from an open-source angle (https://github.com/Giskard-AI/giskard)
Related posts
-
Anomaly Detection with FiftyOne and Anomalib
-
May 8, 2024 AI, Machine Learning and Computer Vision Meetup
-
Voxel51 Is Hiring AI Researchers and Scientists — What the New Open Science Positions Mean
-
Machine Learning and AI Beyond the Basics Book
-
Show HN: Evaluate LLM-based RAG Applications with automated test set generation