Ask HN: Advice on setting up adversarial programming challenge?

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

SPAG

2 54 7.2 Python

Self-playing Adversarial Language Game Enhances LLM Reasoning

I have been very intrigued lately by the results shown in the SPAG paper [1], and I would like to take the next step in research.
I would like to set up an adversarial training exercise for LLMs around writing secure / robust code.
For those unfamiliar with the SPAG paper, the idea is to train an LLM on reinforcement learning on an adversarial language game. As their results show, teaching an LLM to play a language game (such as Taboo, in their paper) through reinforcement learning increases the model's ability to solve other human reasoning tasks (when compared to traditional imitation learning).
Rough idea:
Setup: The defender is responsible for writing code according to a specification and a set of (hidden) unit tests that will evaluate the code for validity and completeness. This will be the standard sort of setup for Leetcode or other coding challenge websites.
The attacker is responsible for calling the code in such a way that it will crash or fail or be exploited in some fashion. For instance, if the attacker has the goal of getting the defender's code to crash, then so long as the stack trace fails from inside the defender's code (and not the attacker's code), then it's a victory for the attacker.
Is there a better way to set up an automated red-team vs. blue-team for a coding CTF style challenge around Leetcode-type coding problems? If you wanted to set up a challenge in an adversarial style to train the defender to write more robust code, how would you do it?
[1] - https://github.com/Linear95/SPAG

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Show HN: All you need is Prometheus and Jaeger for LLM Observability

1 project | news.ycombinator.com | 28 May 2024
Transformers Can Do Arithmetic with the Right Embeddings

1 project | news.ycombinator.com | 28 May 2024
ascii-based graphics: the only image file format for the terminal

1 project | dev.to | 28 May 2024
YouTube has now begun skipping videos altogether for users with ad blockers

1 project | news.ycombinator.com | 27 May 2024
SignWave: Program to transcribe text, audio files into a sign language animation

1 project | news.ycombinator.com | 27 May 2024

Ask HN: Advice on setting up adversarial programming challenge?

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Post date: 6 May 2024

SPAG

InfluxDB

Related posts

Show HN: All you need is Prometheus and Jaeger for LLM Observability

Transformers Can Do Arithmetic with the Right Embeddings

ascii-based graphics: the only image file format for the terminal

YouTube has now begun skipping videos altogether for users with ad blockers

SignWave: Program to transcribe text, audio files into a sign language animation