long-form-factuality
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models". (by google-deepmind)
trajectopy
Trajectopy - Trajectory Evaluation in Python (by gereon-t)
long-form-factuality | trajectopy | |
---|---|---|
2 | 1 | |
447 | 21 | |
80.5% | - | |
6.3 | 8.6 | |
6 days ago | 5 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | GNU General Public License v3.0 only |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
long-form-factuality
Posts with mentions or reviews of long-form-factuality.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2024-04-06.
-
An Open Source Tool for Multimodal Fact Verification
Isn't this similar to the Deepmind paper on long form factuality posted a few days ago?
https://arxiv.org/abs/2403.18802
https://github.com/google-deepmind/long-form-factuality/tree...
- LongFact – Long-Form Factuality in Large Language Models
trajectopy
Posts with mentions or reviews of trajectopy.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-10-28.
-
Trajectory Evaluation in Python - Update
The first, called trajectopy, stands as a full-fledged application featuring a PyQt6-based graphical user interface (GUI). This GUI-driven platform simplifies trajectory-related tasks and offers an intuitive user experience. For those desiring a more in-depth approach, there is trajectopy-core. This backend implementation without any PyQt6 dependencies provides essential functionality e.g. for computing absolute trajectory error (ATE) and relative pose error (RPE).
What are some alternatives?
When comparing long-form-factuality and trajectopy you can also consider the following projects:
torch-fidelity - High-fidelity performance metrics for generative models in PyTorch
avalanche - Avalanche: an End-to-End Library for Continual Learning based on PyTorch.
trajectopy-core - Trajectopy - Trajectory Evaluation in Python
rexmex - A general purpose recommender metrics library for fair evaluation.
datasets - 🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
py-motmetrics - :bar_chart: Benchmark multiple object trackers (MOT) in Python
ZnH5MD - ZnH5MD - High Performance Interface for H5MD Trajectories