DeepEval – Unit Testing for LLMs

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • deepeval

    Discontinued Unit Testing For LLMs [Moved to: https://github.com/confident-ai/deepeval] (by mr-gpt)

  • bettertest

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • agentops

    Python SDK for agent evals and observability

  • promptfoo

    Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.

  • agenta

    The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.

  • I'd add ours too, although we're trying to be an end-to-end one-stop platform.

    https://github.com/agenta-ai/agenta

  • ai-notes

    notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.

  • added to my notes! https://github.com/swyxio/ai-notes/

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • RAG for Medical Research

    1 project | news.ycombinator.com | 21 Oct 2023
  • Patterns for Building LLM-Based Systems and Products

    6 projects | news.ycombinator.com | 1 Aug 2023
  • Claude AI launches on iOS (Android coming soon)

    2 projects | news.ycombinator.com | 1 May 2024
  • AgentCloud vs Google Cloud Agents

    1 project | dev.to | 29 Apr 2024
  • Insights from Finetuning LLMs for Classification Tasks

    1 project | news.ycombinator.com | 28 Apr 2024