Top 17 robustness Open-Source Projects

promptbench

4 2,103 9.2 Python

A unified evaluation framework for large language models

Project mention: Show HN: Times faster LLM evaluation with Bayesian optimization | news.ycombinator.com | 2024-02-13

Fair question.
Evaluate refers to the phase after training to check if the training is good.
Usually the flow goes training -> evaluation -> deployment (what you called inference). This project is aimed for evaluation. Evaluation can be slow (might even be slower than training if you're finetuning on a small domain specific subset)!
So there are [quite](https://github.com/microsoft/promptbench) [a](https://github.com/confident-ai/deepeval) [few](https://github.com/openai/evals) [frameworks](https://github.com/EleutherAI/lm-evaluation-harness) working on evaluation, however, all of them are quite slow, because LLM are slow if you don't have infinite money. [This](https://github.com/open-compass/opencompass) one tries to speed up by parallelizing on multiple computers, but none of them takes advantage of the fact that many evaluation queries might be similar and all try to evaluate on all given queries. And that's where this project might come in handy.

advertorch

1 1,273 0.0 Jupyter Notebook

A Toolbox for Adversarial Robustness Research
InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
OpenOOD

2 762 7.5 Python

Benchmarking Generalized Out-of-Distribution Detection

Project mention: [Online Leaderboard | Easy Evaluation] OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection | /r/DeepLearningPapers | 2023-06-28

Open-sourced implementations of 40+ advanced methods (see our repo);

natural-adv-examples

1 573 0.0 Python

A Harder ImageNet Test Set (CVPR 2021)
Awesome-Out-Of-Distribution-Detection

6 553 7.6

A professionally curated list of papers, tutorials, books, videos, articles and open-source libraries etc for Out-of-distribution detection, robustness, and generalization

Project mention: Awesome-Out-Of-Distribution-Detection | /r/aiengineer | 2023-08-15

photoguard

7 520 1.8 Jupyter Notebook

Raising the Cost of Malicious AI-Powered Image Editing

Project mention: PhotoGuard - сервіс для захисту зображень від нейромереж. Працює за допомогою моделей редагування фотографій на основі машинного навчання, таких як Stable Diffusion. | /r/LinuxUkraine | 2023-12-04

safe-control-gym

2 528 6.2 Python

PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and RL
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
assembled-cnn

1 330 0.0 Python

Tensorflow implementation of "Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural Network"
auto_LiRPA

1 265 4.2 Python

auto_LiRPA: An Automatic Linear Relaxation based Perturbation Analysis Library for Neural Networks and General Computational Graphs
linqit

2 245 3.5 Python

Extend python lists operations using .NET's LINQ syntax for clean and fast coding.
ImageNetV2

1 225 2.1 Jupyter Notebook

A new test set for ImageNet
alpha-beta-CROWN

1 206 4.7 Python

alpha-beta-CROWN: An Efficient, Scalable and GPU Accelerated Neural Network Verifier (winner of VNN-COMP 2021, 2022, and 2023)
ModelNet40-C

2 202 0.0 Python

Repo for "Benchmarking Robustness of 3D Point Cloud Recognition against Common Corruptions" https://arxiv.org/abs/2201.12296
ViTs-vs-CNNs

1 171 0.0 Python

[NeurIPS 2021]: Are Transformers More Robust Than CNNs? (Pytorch implementation & checkpoints)
fiddler-auditor

2 142 8.1 Python

Fiddler Auditor is a tool to evaluate language models.

Project mention: I asked 60 LLMs a set of 20 questions | news.ycombinator.com | 2023-09-09

This is really cool!
I've been using this auditor tool that some friends at Fiddler created: https://github.com/fiddler-labs/fiddler-auditor
They went with a langchain interface for custom Evals which I really like. I am curious to hear if anyone has tried both of these. What's been your key take away for these?

adversarial-reinforcement-learning

15 76 6.1

Reading list for adversarial perspective and robustness in deep reinforcement learning.

Project mention: Safety in Deep Reinforcement Learning | /r/programming | 2023-12-06

LBGAT

2 33 2.8 Python

Learnable Boundary Guided Adversarial Training (ICCV2021)
SaaSHub

www.saashub.com featured

SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

robustness related posts

Show HN: Times faster LLM evaluation with Bayesian optimization

6 projects | news.ycombinator.com | 13 Feb 2024
PhotoGuard - сервіс для захисту зображень від нейромереж. Працює за допомогою моделей редагування фотографій на основі машинного навчання, таких як Stable Diffusion.

1 project | /r/LinuxUkraine | 4 Dec 2023
Are there any tools for "Defend Against the Dark Arts" of diffusion?

1 project | /r/StableDiffusion | 4 Jun 2023
Raising the Cost of Malicious AI-Powered Image Editing

1 project | news.ycombinator.com | 27 Feb 2023
PhotoGuard: Defending Against Diffusion-Based Image Manipulation

1 project | news.ycombinator.com | 12 Dec 2022
PhotoGuard: Defending Against Diffusion-Based Image Manipulation

1 project | news.ycombinator.com | 10 Dec 2022
Welcome to a community to discuss what to do about the negative effects of AI art

1 project | /r/ArtistProtectionToAI | 4 Dec 2022
A note from our sponsor - InfluxDB
www.influxdata.com | 15 May 2024

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source robustness projects? This list will help you:

	Project	Stars
1	promptbench	2,103
2	advertorch	1,273
3	OpenOOD	762
4	natural-adv-examples	573
5	Awesome-Out-Of-Distribution-Detection	553
6	photoguard	520
7	safe-control-gym	528
8	assembled-cnn	330
9	auto_LiRPA	265
10	linqit	245
11	ImageNetV2	225
12	alpha-beta-CROWN	206
13	ModelNet40-C	202
14	ViTs-vs-CNNs	171
15	fiddler-auditor	142
16	adversarial-reinforcement-learning	76
17	LBGAT	33