langfuse vs trulens

langfuse

🪢 Open source LLM engineering platform: Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23 (by langfuse)

Suggest topics

Source Code

langfuse.com

Suggest alternative

Edit details

trulens

Evaluation and Tracking for LLM Experiments (by truera)

Machine Learning neural-networks explainable-ml llm llmops

Source Code

trulens.org

Suggest alternative

Edit details

SurveyJS - Open-Source JSON Form Builder to Create Dynamic Forms Right in Your App

With SurveyJS form UI libraries, you can build and style forms in a fully-integrated drag & drop form builder, render them in your JS app, and store form submission data in any backend, inc. PHP, ASP.NET Core, and Node.js.

surveyjs.io

featured

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

langfuse		trulens
	Project
11	Mentions	14
3,681	Stars	1,669
30.4%	Growth	10.1%
9.9	Activity	9.8
7 days ago	Latest Commit	5 days ago
TypeScript	Language	Jupyter Notebook
GNU General Public License v3.0 or later	License	MIT License

The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

langfuse

Posts with mentions or reviews of langfuse. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-05-02.

Top Open Source Prompt Engineering Guides & Tools🔧🏗️🚀
5 projects | dev.to | 2 May 2024

Langfuse is an open-source LLM engineering platform that helps teams collaboratively debug, analyze, and iterate on their LLM applications.
Roast My Docs
3 projects | news.ycombinator.com | 17 Apr 2024
Show HN: Open-Source LLM Observability and Export to Grafana, Datadog etc.
2 projects | news.ycombinator.com | 22 Mar 2024

Congrats on the Show! How’s this different from https://github.com/langfuse/langfuse? The exports seems really interesting
RAG observability in 2 lines of code with Llama Index & Langfuse
1 project | dev.to | 18 Mar 2024

Thus, we started working on Langfuse.com (GitHub) to establish an open source LLM engineering platform with tightly integrated features for tracing, prompt management, and evaluation. In the beginning we just solved our own and our friends’ problems. Today we are at over 1000 projects which rely on Langfuse, and 2.3k stars on GitHub. You can either self-host Langfuse or use the cloud instance maintained by us.
langfuse VS agenta - a user suggested alternative
2 projects | 22 Nov 2023
Ask HN: Who is hiring? (November 2023)
15 projects | news.ycombinator.com | 1 Nov 2023

- We want to build a tool that is recommended here on HN: you can build a tool you would want to use yourself.
Please see more details here: https://langfuse.com/careers or reach out directly to me: [email protected]
[1] https://github.com/langfuse/langfuse
[2] https://create.t3.gg/
How are generative AI companies monitoring their systems in production?
4 projects | news.ycombinator.com | 19 Sep 2023

We struggled with this ourselves while building LLM-based products and then open-sourced our observability/monitoring tool [1]. Many use it to track RAG and agents in production, run custom evals on the production traces (focused on hallucination), and track how metrics are different across releases or customers. Feel free to dm if there is something specific you are looking to solve, happy to help.
[1] https://github.com/langfuse/langfuse
LLM Analytics 101 - How to Improve your LLM app
1 project | dev.to | 14 Sep 2023

Visit us on Discord and Github to engage with our project.
Ask HN: Any tools or frameworks to monitor the usage of OpenAI API keys?
2 projects | news.ycombinator.com | 4 Sep 2023

Maybe try https://github.com/langfuse/langfuse
It was recently shared on HN
Show HN: Langfuse – Open-source observability and analytics for LLM apps
5 projects | news.ycombinator.com | 29 Aug 2023

trulens

Posts with mentions or reviews of trulens. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-24.

Why Vector Compression Matters
3 projects | dev.to | 24 Apr 2024

Retrieval using a single vector is called dense passage retrieval (DPR), because an entire passage (dozens to hundreds of tokens) is encoded as a single vector. ColBERT instead encodes a vector-per-token, where each vector is influenced by surrounding context. This leads to meaningfully better results; for example, here’s ColBERT running on Astra DB compared to DPR using openai-v3-small vectors, compared with TruLens for the Braintrust Coda Help Desk data set. ColBERT easily beats DPR at correctness, context relevance, and groundedness.
FLaNK AI Weekly 18 March 2024
39 projects | dev.to | 18 Mar 2024
First 15 Open Source Advent projects
16 projects | dev.to | 15 Dec 2023

12. TruLens by TruEra | Github | tutorial
trulens VS agenta - a user suggested alternative
2 projects | 22 Nov 2023
How are generative AI companies monitoring their systems in production?
4 projects | news.ycombinator.com | 19 Sep 2023

3) Hallucination is probably the biggest problem we solve for. To do evals for hallucination, we typically see our users use a combination of groundedness (does the context support the LLM response) and context relevance (is the retrieved context relevant to the query). There's also a bunch more for the evaluations you mentioned (moderation models, sentiment, usefulness, etc.) and it's pretty easy to add custom evals.
Also - my hot take is that gpt-3.5 is good enough for evals (sometimes better) than gpt-4 if you give the LLM enough instructions on how to do the eval.
website: https://www.trulens.org/
FLaNK Stack Weekly 28 August 2023
27 projects | dev.to | 28 Aug 2023
[P] TruLens-Eval is an open source project for eval & tracking LLM experiments.
1 project | /r/MachineLearning | 21 Jul 2023

The team at TruEra recently released an open source project for evaluation & tracking of LLM applications called TruLens-Eval. We’ve specifically targeted retrieval-augmented QA as a core use case and so far we’ve seen it used for comparing different models and parameters, prompts, vector-db configurations and query planning strategies. I’d love to get your feedback on it.
[D] Hardest thing about building with LLMs?
3 projects | /r/MachineLearning | 8 Jul 2023
Stop Evaluating LLMs on Vibes
1 project | news.ycombinator.com | 7 Jun 2023
OSS library for attribution and interpretation methods for deep nets
1 project | /r/programming | 24 May 2023

What are some alternatives?

When comparing langfuse and trulens you can also consider the following projects:

llama_index - LlamaIndex is a data framework for your LLM applications

shapash - 🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models

langchain - 🦜🔗 Build context-aware reasoning applications

probability - Probabilistic reasoning and statistical analysis in TensorFlow

agenta - The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.

LIME - Tutorial notebooks on explainable Machine Learning with LIME (Original work: https://arxiv.org/abs/1602.04938)

opentelemetry-instrument-openai-py - OpenTelemetry instrumentation for the OpenAI Python library

embedchain - Personalizing LLM Responses

examples - Your one-stop-shop to try Xata out. From packages to apps, whatever you need to get started.

machine_learning_basics - Plain python implementations of basic machine learning algorithms

clickhouse_knowledge_base - The Tinybird ClickHouse Knowledge Base

ML-Workspace - 🛠 All-in-one web-based IDE specialized for machine learning and data science.

langfuse vs llama_index trulens vs shapash langfuse vs langchain trulens vs probability langfuse vs agenta trulens vs LIME langfuse vs opentelemetry-instrument-openai-py trulens vs embedchain langfuse vs examples trulens vs machine_learning_basics langfuse vs clickhouse_knowledge_base trulens vs ML-Workspace

Compare langfuse vs trulens and see what are their differences.

langfuse

trulens

langfuse

trulens

What are some alternatives?