Top 23 Alignment Open-Source Projects

Pandas

399 42,159 10.0 Python

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Project mention: The ultimate guide to creating a secure Python package | dev.to | 2024-05-08

It's also possible for you to give a package an alias by using the as keyword. For instance, you could use the pandas package as pd like this:

ffsubsync

31 6,547 4.8 Python

Automagically synchronize subtitles with video.

Project mention: The GitHub Black Market That Helps Coders Cheat the Popularity Contest | news.ycombinator.com | 2023-10-23

> Another giveaway is the ratio of stars to watchers / forks. I remember one project with thousands of stars but only 10 users "watching" it. They went on to raise a sizable seed round too.
Not necessarily indicative of foul play. I have two projects like this (https://github.com/smacke/ffsubsync and https://github.com/ipyflow/ipyflow) and I attribute it to not having great developer documentation.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
3DDFA_V2

2 2,799 0.0 Python

The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.
aeneas

4 2,379 0.0 Python

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
DECA

5 2,026 0.0 Python

DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)
tabulate

1 1,810 6.0 C++

Table Maker for Modern C++ (by p-ranav)
SmartSystemMenu

2 1,748 5.3 C#

SmartSystemMenu extends system menu of all windows in the system
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
MMseqs2

4 1,268 7.7 C

MMseqs2: ultra fast and sensitive search and clustering suite
gangealing

3 1,008 0.0 Python

Official PyTorch Implementation of "GAN-Supervised Dense Visual Alignment" (CVPR 2022 Oral, Best Paper Finalist)
alass

18 975 0.0 Rust

"Automatic Language-Agnostic Subtitle Synchronization"

Project mention: Automatic Language-Agnostic Subtitle Synchronization | news.ycombinator.com | 2024-02-13

facexlib

3 759 1.9 Python

FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.

Project mention: stable diffusion downloads something from github when making a image | /r/StableDiffusion | 2023-07-22

"https://github.com/xinntao/facexlib/releases/download/v0.1.0/detection_Resnet50_Final.pth"

DataDreamer

5 681 8.5 Python

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Project mention: FLaNK AI - 01 April 2024 | dev.to | 2024-04-01

poly

24 653 8.1 Go

A Go package for engineering organisms.

Project mention: Looking for an Open Source project to participate in for Google Summer of Code | /r/golang | 2023-12-10

tanuki.py

10 639 9.2 Python

Prompt engineering for developers

Project mention: [D] Alignment using tests? Will it work? | /r/MachineLearning | 2023-12-06

I came across this open source library a few times across Reddit + HN and something that piqued my interest was their concept around "test-driven alignment".

HALOs

1 573 8.8 Python

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Project mention: On Sleeper Agent LLMs | news.ycombinator.com | 2024-01-13

If you are using no-code solutions, increasing an "idea" in a dataset will make that idea more likely to appear.
If you are fine-tuning your own LLM, there are other ways to get your idea to appear. In the literature this is sometimes called RLHF or preference optimization, and here are a few approaches:
Direct Preference Optimization
This uses Elo-scores to learn pairwise preferences. Elo is used in chess and basketball to rank individuals who compete in pairs.
@argilla_io on X.com has been doing some work in evaluating DPO.
Here is a decent thread on this: https://x.com/argilla_io/status/1745057571696693689?s=20
Identity Preference Optimization
IPO is research from Google DeepMind. It removes the reliance of Elo scores to address overfitting issues in DPO.
Paper: https://x.com/kylemarieb/status/1728281581306233036?s=20
Kahneman-Tversky Optimization
KTO is an approach that uses mono preference data. For example, it asks if a response is "good or not." This is helpful for a lot of real word situations (e.g. "Is the restaurant well liked?").
Here is a brief discussion on it:
https://x.com/ralphbrooks/status/1744840033872330938?s=20
Here is more on KTO:
* Paper: https://github.com/ContextualAI/HALOs/blob/main/assets/repor...
* Code: https://github.com/ContextualAI/HALOs

mesh_mesh_align_plus

2 543 7.2 Python

Precisely align, move, and measure+match objects and mesh parts in your 3D scenes.

Project mention: Mesh Align Plus 1.0 | news.ycombinator.com | 2023-07-17

hh-suite

2 504 0.0 C

Remote protein homology detection suite.
betteralign

3 486 6.2 Go

Make your Go programs use less memory (maybe)
rmsd

1 474 4.9 Python

Calculate Root-mean-square deviation (RMSD) of two molecules, using rotation, in xyz or pdb format
SeqAn

0 458 7.4 C++

SeqAn's official repository.
RealSR

2 425 3.2 MATLAB

Toward Real-World Single Image Super-Resolution: A New Benchmark and A New Model (ICCV 2019)
subaligner

3 421 6.5 Python

Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
ExpertLLaMA

1 288 6.1 Python

An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.

Project mention: ExpertPrompting: Instructing Large Language Models to be Distinguished Experts | /r/singularity | 2023-05-25

The answering quality of an aligned large language model (LLM) can be drastically improved if treated with proper crafting of prompts. In this paper, we propose ExpertPrompting to elicit the potential of LLMs to answer as distinguished experts. We first utilize In-Context Learning to automatically synthesize detailed and customized descriptions of the expert identity for each specific instruction, and then ask LLMs to provide answer conditioned on such agent background. Based on this augmented prompting strategy, we produce a new set of instruction-following data using GPT-3.5, and train a competitive open-source chat assistant called ExpertLLaMA. We employ GPT4-based evaluation to show that 1) the expert data is of significantly higher quality than vanilla answers, and 2) ExpertLLaMA outperforms existing open-source opponents and achieves 96\% of the original ChatGPT's capability. All data and the ExpertLLaMA model will be made publicly available at this https URL.

SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Alignment related posts

PDEP-13: The Pandas Logical Type System

1 project | news.ycombinator.com | 4 May 2024
Pandas reset_index(): How To Reset Indexes in Pandas

1 project | dev.to | 27 Apr 2024
Help Us Build Our Roadmap – Pydantic

2 projects | news.ycombinator.com | 19 Feb 2024
Automatic Language-Agnostic Subtitle Synchronization

1 project | news.ycombinator.com | 13 Feb 2024
Mastering Pandas read_csv() with Examples - A Tutorial by Codes With Pankaj

1 project | dev.to | 9 Dec 2023
How do people know when to use what programming language?

1 project | /r/AskProgramming | 6 Dec 2023
stable diffusion downloads something from github when making a image

1 project | /r/StableDiffusion | 22 Jul 2023
A note from our sponsor - SaaSHub
www.saashub.com | 21 May 2024

SaaSHub helps you find the best software and product alternatives Learn more →

Index

What are some of the best open-source Alignment projects? This list will help you:

	Project	Stars
1	Pandas	42,159
2	ffsubsync	6,547
3	3DDFA_V2	2,799
4	aeneas	2,379
5	DECA	2,026
6	tabulate	1,810
7	SmartSystemMenu	1,748
8	MMseqs2	1,268
9	gangealing	1,008
10	alass	975
11	facexlib	759
12	DataDreamer	681
13	poly	653
14	tanuki.py	639
15	HALOs	573
16	mesh_mesh_align_plus	543
17	hh-suite	504
18	betteralign	486
19	rmsd	474
20	SeqAn	458
21	RealSR	425
22	subaligner	421
23	ExpertLLaMA	288

Alignment

Top 23 Alignment Open-Source Projects

Alignment related posts

PDEP-13: The Pandas Logical Type System

Pandas reset_index(): How To Reset Indexes in Pandas

Help Us Build Our Roadmap – Pydantic

Automatic Language-Agnostic Subtitle Synchronization

Mastering Pandas read_csv() with Examples - A Tutorial by Codes With Pankaj

How do people know when to use what programming language?

stable diffusion downloads something from github when making a image

Index