Top 17 Python Alignment Projects

Pandas

393 41,923 10.0 Python

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Project mention: Deploying a Serverless Dash App with AWS SAM and Lambda | dev.to | 2024-03-04

Dash is a Python framework that enables you to build interactive frontend applications without writing a single line of Javascript. Internally and in projects we like to use it in order to build a quick proof of concept for data driven applications because of the nice integration with Plotly and pandas. For this post, I'm going to assume that you're already familiar with Dash and won't explain that part in detail. Instead, we'll focus on what's necessary to make it run serverless.

ffsubsync

31 6,495 4.8 Python

Automagically synchronize subtitles with video.

Project mention: The GitHub Black Market That Helps Coders Cheat the Popularity Contest | news.ycombinator.com | 2023-10-23

> Another giveaway is the ratio of stars to watchers / forks. I remember one project with thousands of stars but only 10 users "watching" it. They went on to raise a sizable seed round too.
Not necessarily indicative of foul play. I have two projects like this (https://github.com/smacke/ffsubsync and https://github.com/ipyflow/ipyflow) and I attribute it to not having great developer documentation.

WorkOS

workos.com sponsored

The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
3DDFA_V2

2 2,777 0.0 Python

The official PyTorch implementation of Towards Fast, Accurate and Stable 3D Dense Face Alignment, ECCV 2020.
aeneas

4 2,379 0.0 Python

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
DECA

5 2,005 0.0 Python

DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)

Project mention: Looking for a NLP expert to help me in a project about body movements and facial expressions. | /r/MLQuestions | 2023-04-27

[2] https://github.com/yfeng95/DECA

gangealing

3 1,008 0.0 Python

Official PyTorch Implementation of "GAN-Supervised Dense Visual Alignment" (CVPR 2022 Oral, Best Paper Finalist)
facexlib

3 741 1.9 Python

FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.

Project mention: stable diffusion downloads something from github when making a image | /r/StableDiffusion | 2023-07-22

"https://github.com/xinntao/facexlib/releases/download/v0.1.0/detection_Resnet50_Final.pth"

InfluxDB

www.influxdata.com sponsored

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
DataDreamer

5 632 8.1 Python

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Project mention: FLaNK AI - 01 April 2024 | dev.to | 2024-04-01

tanuki.py

10 634 9.2 Python

Prompt engineering for developers

Project mention: [D] Alignment using tests? Will it work? | /r/MachineLearning | 2023-12-06

I came across this open source library a few times across Reddit + HN and something that piqued my interest was their concept around "test-driven alignment".

mesh_mesh_align_plus

2 540 7.2 Python

Precisely align, move, and measure+match objects and mesh parts in your 3D scenes.

Project mention: Mesh Align Plus 1.0 | news.ycombinator.com | 2023-07-17

HALOs

1 525 8.6 Python

A library with extensible implementations of DPO, KTO, PPO, and other human-aware loss functions (HALOs).

Project mention: On Sleeper Agent LLMs | news.ycombinator.com | 2024-01-13

If you are using no-code solutions, increasing an "idea" in a dataset will make that idea more likely to appear.
If you are fine-tuning your own LLM, there are other ways to get your idea to appear. In the literature this is sometimes called RLHF or preference optimization, and here are a few approaches:
Direct Preference Optimization
This uses Elo-scores to learn pairwise preferences. Elo is used in chess and basketball to rank individuals who compete in pairs.
@argilla_io on X.com has been doing some work in evaluating DPO.
Here is a decent thread on this: https://x.com/argilla_io/status/1745057571696693689?s=20
Identity Preference Optimization
IPO is research from Google DeepMind. It removes the reliance of Elo scores to address overfitting issues in DPO.
Paper: https://x.com/kylemarieb/status/1728281581306233036?s=20
Kahneman-Tversky Optimization
KTO is an approach that uses mono preference data. For example, it asks if a response is "good or not." This is helpful for a lot of real word situations (e.g. "Is the restaurant well liked?").
Here is a brief discussion on it:
https://x.com/ralphbrooks/status/1744840033872330938?s=20
Here is more on KTO:
* Paper: https://github.com/ContextualAI/HALOs/blob/main/assets/repor...
* Code: https://github.com/ContextualAI/HALOs

rmsd

1 463 4.9 Python

Calculate Root-mean-square deviation (RMSD) of two molecules, using rotation, in xyz or pdb format
subaligner

3 415 6.5 Python

Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/
ExpertLLaMA

1 289 6.1 Python

An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.

Project mention: ExpertPrompting: Instructing Large Language Models to be Distinguished Experts | /r/singularity | 2023-05-25

The answering quality of an aligned large language model (LLM) can be drastically improved if treated with proper crafting of prompts. In this paper, we propose ExpertPrompting to elicit the potential of LLMs to answer as distinguished experts. We first utilize In-Context Learning to automatically synthesize detailed and customized descriptions of the expert identity for each specific instruction, and then ask LLMs to provide answer conditioned on such agent background. Based on this augmented prompting strategy, we produce a new set of instruction-following data using GPT-3.5, and train a competitive open-source chat assistant called ExpertLLaMA. We employ GPT4-based evaluation to show that 1) the expert data is of significantly higher quality than vanilla answers, and 2) ExpertLLaMA outperforms existing open-source opponents and achieves 96\% of the original ChatGPT's capability. All data and the ExpertLLaMA model will be made publicly available at this https URL.

unsupervisedRR

1 134 3.6 Python

[CVPR 2021 - Oral] UnsupervisedR&R: Unsupervised Point Cloud Registration via Differentiable Rendering
trajectopy

1 20 8.3 Python

Trajectopy - Trajectory Evaluation in Python

Project mention: Trajectory Evaluation in Python - Update | /r/robotics | 2023-10-28

The first, called trajectopy, stands as a full-fledged application featuring a PyQt6-based graphical user interface (GUI). This GUI-driven platform simplifies trajectory-related tasks and offers an intuitive user experience. For those desiring a more in-depth approach, there is trajectopy-core. This backend implementation without any PyQt6 dependencies provides essential functionality e.g. for computing absolute trajectory error (ATE) and relative pose error (RPE).

trajectopy-core

1 1 9.2 Python

Trajectopy - Trajectory Evaluation in Python

Project mention: Trajectory Evaluation in Python - Update | /r/robotics | 2023-10-28

The first, called trajectopy, stands as a full-fledged application featuring a PyQt6-based graphical user interface (GUI). This GUI-driven platform simplifies trajectory-related tasks and offers an intuitive user experience. For those desiring a more in-depth approach, there is trajectopy-core. This backend implementation without any PyQt6 dependencies provides essential functionality e.g. for computing absolute trajectory error (ATE) and relative pose error (RPE).

SaaSHub

www.saashub.com sponsored

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Alignment related posts

Help Us Build Our Roadmap – Pydantic
2 projects | news.ycombinator.com | 19 Feb 2024
Mastering Pandas read_csv() with Examples - A Tutorial by Codes With Pankaj
1 project | dev.to | 9 Dec 2023
How do people know when to use what programming language?
1 project | /r/AskProgramming | 6 Dec 2023
stable diffusion downloads something from github when making a image
1 project | /r/StableDiffusion | 22 Jul 2023
Mesh Align Plus 1.0
1 project | news.ycombinator.com | 17 Jul 2023
Declutter your Gmail inbox with Python: A Step-by-Step Guide
1 project | dev.to | 15 Jul 2023
Which software is suitable for achieving my goal?
2 projects | /r/ffmpeg | 12 Jul 2023
A note from our sponsor - InfluxDB
www.influxdata.com | 24 Apr 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source Alignment projects in Python? This list will help you:

	Project	Stars
1	Pandas	41,923
2	ffsubsync	6,495
3	3DDFA_V2	2,777
4	aeneas	2,379
5	DECA	2,005
6	gangealing	1,008
7	facexlib	741
8	DataDreamer	632
9	tanuki.py	634
10	mesh_mesh_align_plus	540
11	HALOs	525
12	rmsd	463
13	subaligner	415
14	ExpertLLaMA	289
15	unsupervisedRR	134
16	trajectopy	20
17	trajectopy-core	1