The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning. Learn more →
Top 8 Python language-modeling Projects
-
tape
Tasks Assessing Protein Embeddings (TAPE), a set of five biologically relevant semi-supervised learning tasks spread across different domains of protein biology. (by songlab-cal)
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
long-form-factuality
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".
-
FActScore
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"
-
recurrent-fwp
Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
I think of guardrails as another dimension of human preferences: whether you are training a model to answer questions more gooder or avoid saying horrifying stuff, you are teaching the model a preference. So I thinks it's a straightforward RLHF problem but from a different perspective.
Project mention: An Open Source Tool for Multimodal Fact Verification | news.ycombinator.com | 2024-04-06Isn't this similar to the Deepmind paper on long form factuality posted a few days ago?
https://arxiv.org/abs/2403.18802
https://github.com/google-deepmind/long-form-factuality/tree...
Looks like a slight modification of FActScore [1], but instead of using Wikipedia as a verification source, they use Google Search. They also claim to include a wider range of topics. That said, FActScore allows you to use whatever knowledge source and topics you want [2].
[1]: https://arxiv.org/abs/2305.14251
[2]: https://github.com/shmsw25/FActScore?tab=readme-ov-file#to-u...
Project mention: [D] To all the machine learning engineers: most difficult model task/type you’ve ever had to work with? | /r/MachineLearning | 2023-07-03
Python language-modeling related posts
- How To Setup a Model With Guardrails?
- OpenDILab Awesome Paper Collection: RL with Human Feedback (2)
- Best option for creating a custom GPT AI
- An Open-Source Version of ChatGPT is Coming [News]
- Allen Institute for Artificial Intelligence Introduces MemPrompt: A New Method to “fix” GPT-3 After Deployment with User Interaction
- [D] Paper Review Video - Memory-assisted prompt editing to improve GPT-3 after deployment
- Swiss AI Lab Team Going Beyond Linear Transformers with Recurrent Fast Weight Programmers
-
A note from our sponsor - WorkOS
workos.com | 26 Apr 2024
Index
What are some of the best open-source language-modeling projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | RL4LMs | 2,084 |
2 | tape | 620 |
3 | long-form-factuality | 435 |
4 | memprompt | 320 |
5 | FActScore | 210 |
6 | deepblast | 96 |
7 | recurrent-fwp | 46 |
8 | verified-smart-contracts | 18 |
Sponsored