long-form-factuality
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models". (by google-deepmind)
trajectopy-core
Trajectopy - Trajectory Evaluation in Python (by gereon-t)
long-form-factuality | trajectopy-core | |
---|---|---|
2 | 1 | |
447 | 1 | |
80.5% | - | |
6.3 | 9.2 | |
7 days ago | 5 days ago | |
Python | Python | |
GNU General Public License v3.0 or later | MIT License |
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
long-form-factuality
Posts with mentions or reviews of long-form-factuality.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2024-04-06.
-
An Open Source Tool for Multimodal Fact Verification
Isn't this similar to the Deepmind paper on long form factuality posted a few days ago?
https://arxiv.org/abs/2403.18802
https://github.com/google-deepmind/long-form-factuality/tree...
- LongFact – Long-Form Factuality in Large Language Models
trajectopy-core
Posts with mentions or reviews of trajectopy-core.
We have used some of these posts to build our list of alternatives
and similar projects. The last one was on 2023-10-28.
-
Trajectory Evaluation in Python - Update
The first, called trajectopy, stands as a full-fledged application featuring a PyQt6-based graphical user interface (GUI). This GUI-driven platform simplifies trajectory-related tasks and offers an intuitive user experience. For those desiring a more in-depth approach, there is trajectopy-core. This backend implementation without any PyQt6 dependencies provides essential functionality e.g. for computing absolute trajectory error (ATE) and relative pose error (RPE).
What are some alternatives?
When comparing long-form-factuality and trajectopy-core you can also consider the following projects:
torch-fidelity - High-fidelity performance metrics for generative models in PyTorch
rexmex - A general purpose recommender metrics library for fair evaluation.
py-motmetrics - :bar_chart: Benchmark multiple object trackers (MOT) in Python
trajectopy - Trajectopy - Trajectory Evaluation in Python
avalanche - Avalanche: an End-to-End Library for Continual Learning based on PyTorch.
datasets - 🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
ZnH5MD - ZnH5MD - High Performance Interface for H5MD Trajectories