robustness

Top 17 robustness Open-Source Projects

  • promptbench

    A unified evaluation framework for large language models

  • Project mention: Show HN: Times faster LLM evaluation with Bayesian optimization | news.ycombinator.com | 2024-02-13

    Fair question.

    Evaluate refers to the phase after training to check if the training is good.

    Usually the flow goes training -> evaluation -> deployment (what you called inference). This project is aimed for evaluation. Evaluation can be slow (might even be slower than training if you're finetuning on a small domain specific subset)!

    So there are [quite](https://github.com/microsoft/promptbench) [a](https://github.com/confident-ai/deepeval) [few](https://github.com/openai/evals) [frameworks](https://github.com/EleutherAI/lm-evaluation-harness) working on evaluation, however, all of them are quite slow, because LLM are slow if you don't have infinite money. [This](https://github.com/open-compass/opencompass) one tries to speed up by parallelizing on multiple computers, but none of them takes advantage of the fact that many evaluation queries might be similar and all try to evaluate on all given queries. And that's where this project might come in handy.

  • advertorch

    A Toolbox for Adversarial Robustness Research

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • OpenOOD

    Benchmarking Generalized Out-of-Distribution Detection

  • Project mention: [Online Leaderboard | Easy Evaluation] OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection | /r/DeepLearningPapers | 2023-06-28

    Open-sourced implementations of 40+ advanced methods (see our repo);

  • natural-adv-examples

    A Harder ImageNet Test Set (CVPR 2021)

  • Awesome-Out-Of-Distribution-Detection

    A professionally curated list of papers, tutorials, books, videos, articles and open-source libraries etc for Out-of-distribution detection, robustness, and generalization

  • Project mention: Awesome-Out-Of-Distribution-Detection | /r/aiengineer | 2023-08-15
  • photoguard

    Raising the Cost of Malicious AI-Powered Image Editing

  • Project mention: PhotoGuard - сервіс для захисту зображень від нейромереж. Працює за допомогою моделей редагування фотографій на основі машинного навчання, таких як Stable Diffusion. | /r/LinuxUkraine | 2023-12-04
  • safe-control-gym

    PyBullet CartPole and Quadrotor environments—with CasADi symbolic a priori dynamics—for learning-based control and RL

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • assembled-cnn

    Tensorflow implementation of "Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural Network"

  • auto_LiRPA

    auto_LiRPA: An Automatic Linear Relaxation based Perturbation Analysis Library for Neural Networks and General Computational Graphs

  • linqit

    Extend python lists operations using .NET's LINQ syntax for clean and fast coding.

  • ImageNetV2

    A new test set for ImageNet

  • alpha-beta-CROWN

    alpha-beta-CROWN: An Efficient, Scalable and GPU Accelerated Neural Network Verifier (winner of VNN-COMP 2021, 2022, and 2023)

  • ModelNet40-C

    Repo for "Benchmarking Robustness of 3D Point Cloud Recognition against Common Corruptions" https://arxiv.org/abs/2201.12296

  • ViTs-vs-CNNs

    [NeurIPS 2021]: Are Transformers More Robust Than CNNs? (Pytorch implementation & checkpoints)

  • fiddler-auditor

    Fiddler Auditor is a tool to evaluate language models.

  • Project mention: I asked 60 LLMs a set of 20 questions | news.ycombinator.com | 2023-09-09

    This is really cool!

    I've been using this auditor tool that some friends at Fiddler created: https://github.com/fiddler-labs/fiddler-auditor

    They went with a langchain interface for custom Evals which I really like. I am curious to hear if anyone has tried both of these. What's been your key take away for these?

  • adversarial-reinforcement-learning

    Reading list for adversarial perspective and robustness in deep reinforcement learning.

  • Project mention: Safety in Deep Reinforcement Learning | /r/programming | 2023-12-06
  • LBGAT

    Learnable Boundary Guided Adversarial Training (ICCV2021)

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

robustness related posts

  • Show HN: Times faster LLM evaluation with Bayesian optimization

    6 projects | news.ycombinator.com | 13 Feb 2024
  • PhotoGuard - сервіс для захисту зображень від нейромереж. Працює за допомогою моделей редагування фотографій на основі машинного навчання, таких як Stable Diffusion.

    1 project | /r/LinuxUkraine | 4 Dec 2023
  • Are there any tools for "Defend Against the Dark Arts" of diffusion?

    1 project | /r/StableDiffusion | 4 Jun 2023
  • Raising the Cost of Malicious AI-Powered Image Editing

    1 project | news.ycombinator.com | 27 Feb 2023
  • PhotoGuard: Defending Against Diffusion-Based Image Manipulation

    1 project | news.ycombinator.com | 12 Dec 2022
  • PhotoGuard: Defending Against Diffusion-Based Image Manipulation

    1 project | news.ycombinator.com | 10 Dec 2022
  • Welcome to a community to discuss what to do about the negative effects of AI art

    1 project | /r/ArtistProtectionToAI | 4 Dec 2022
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 15 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source robustness projects? This list will help you:

Project Stars
1 promptbench 2,103
2 advertorch 1,273
3 OpenOOD 762
4 natural-adv-examples 573
5 Awesome-Out-Of-Distribution-Detection 553
6 photoguard 520
7 safe-control-gym 528
8 assembled-cnn 330
9 auto_LiRPA 265
10 linqit 245
11 ImageNetV2 225
12 alpha-beta-CROWN 206
13 ModelNet40-C 202
14 ViTs-vs-CNNs 171
15 fiddler-auditor 142
16 adversarial-reinforcement-learning 76
17 LBGAT 33

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com