SaaSHub helps you find the best software and product alternatives Learn more →
Top 23 Alignment Open-Source Projects
-
Pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
3DDFA_V2
The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.
-
aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
gangealing
Official PyTorch Implementation of "GAN-Supervised Dense Visual Alignment" (CVPR 2022 Oral, Best Paper Finalist)
-
facexlib
FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.
-
HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
-
mesh_mesh_align_plus
Precisely align, move, and measure+match objects and mesh parts in your 3D scenes.
-
rmsd
Calculate Root-mean-square deviation (RMSD) of two molecules, using rotation, in xyz or pdb format
-
subaligner
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
-
ExpertLLaMA
An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
It's also possible for you to give a package an alias by using the as keyword. For instance, you could use the pandas package as pd like this:
Project mention: The GitHub Black Market That Helps Coders Cheat the Popularity Contest | news.ycombinator.com | 2023-10-23> Another giveaway is the ratio of stars to watchers / forks. I remember one project with thousands of stars but only 10 users "watching" it. They went on to raise a sizable seed round too.
Not necessarily indicative of foul play. I have two projects like this (https://github.com/smacke/ffsubsync and https://github.com/ipyflow/ipyflow) and I attribute it to not having great developer documentation.
Project mention: Automatic Language-Agnostic Subtitle Synchronization | news.ycombinator.com | 2024-02-13
Project mention: stable diffusion downloads something from github when making a image | /r/StableDiffusion | 2023-07-22"https://github.com/xinntao/facexlib/releases/download/v0.1.0/detection_Resnet50_Final.pth"
Project mention: Looking for an Open Source project to participate in for Google Summer of Code | /r/golang | 2023-12-10
I came across this open source library a few times across Reddit + HN and something that piqued my interest was their concept around "test-driven alignment".
If you are using no-code solutions, increasing an "idea" in a dataset will make that idea more likely to appear.
If you are fine-tuning your own LLM, there are other ways to get your idea to appear. In the literature this is sometimes called RLHF or preference optimization, and here are a few approaches:
Direct Preference Optimization
This uses Elo-scores to learn pairwise preferences. Elo is used in chess and basketball to rank individuals who compete in pairs.
@argilla_io on X.com has been doing some work in evaluating DPO.
Here is a decent thread on this: https://x.com/argilla_io/status/1745057571696693689?s=20
Identity Preference Optimization
IPO is research from Google DeepMind. It removes the reliance of Elo scores to address overfitting issues in DPO.
Paper: https://x.com/kylemarieb/status/1728281581306233036?s=20
Kahneman-Tversky Optimization
KTO is an approach that uses mono preference data. For example, it asks if a response is "good or not." This is helpful for a lot of real word situations (e.g. "Is the restaurant well liked?").
Here is a brief discussion on it:
https://x.com/ralphbrooks/status/1744840033872330938?s=20
Here is more on KTO:
* Paper: https://github.com/ContextualAI/HALOs/blob/main/assets/repor...
* Code: https://github.com/ContextualAI/HALOs
Project mention: ExpertPrompting: Instructing Large Language Models to be Distinguished Experts | /r/singularity | 2023-05-25The answering quality of an aligned large language model (LLM) can be drastically improved if treated with proper crafting of prompts. In this paper, we propose ExpertPrompting to elicit the potential of LLMs to answer as distinguished experts. We first utilize In-Context Learning to automatically synthesize detailed and customized descriptions of the expert identity for each specific instruction, and then ask LLMs to provide answer conditioned on such agent background. Based on this augmented prompting strategy, we produce a new set of instruction-following data using GPT-3.5, and train a competitive open-source chat assistant called ExpertLLaMA. We employ GPT4-based evaluation to show that 1) the expert data is of significantly higher quality than vanilla answers, and 2) ExpertLLaMA outperforms existing open-source opponents and achieves 96\% of the original ChatGPT's capability. All data and the ExpertLLaMA model will be made publicly available at this https URL.
Alignment related posts
-
PDEP-13: The Pandas Logical Type System
-
Pandas reset_index(): How To Reset Indexes in Pandas
-
Help Us Build Our Roadmap – Pydantic
-
Automatic Language-Agnostic Subtitle Synchronization
-
Mastering Pandas read_csv() with Examples - A Tutorial by Codes With Pankaj
-
How do people know when to use what programming language?
-
stable diffusion downloads something from github when making a image
-
A note from our sponsor - SaaSHub
www.saashub.com | 21 May 2024
Index
What are some of the best open-source Alignment projects? This list will help you:
Project | Stars | |
---|---|---|
1 | Pandas | 42,159 |
2 | ffsubsync | 6,547 |
3 | 3DDFA_V2 | 2,799 |
4 | aeneas | 2,379 |
5 | DECA | 2,026 |
6 | tabulate | 1,810 |
7 | SmartSystemMenu | 1,748 |
8 | MMseqs2 | 1,268 |
9 | gangealing | 1,008 |
10 | alass | 975 |
11 | facexlib | 759 |
12 | DataDreamer | 681 |
13 | poly | 653 |
14 | tanuki.py | 639 |
15 | HALOs | 573 |
16 | mesh_mesh_align_plus | 543 |
17 | hh-suite | 504 |
18 | betteralign | 486 |
19 | rmsd | 474 |
20 | SeqAn | 458 |
21 | RealSR | 425 |
22 | subaligner | 421 |
23 | ExpertLLaMA | 288 |
Sponsored