RL framework for 2v2 kart soccer

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

slimevolleygym

3 698 3.2 Python

A simple OpenAI Gym environment for single and multi-agent reinforcement learning

Hi great that you are interested in the area, but as a beginner project is quite complex, having a team is a multi-agent task so not a small feat and i guess you want the same policy to play against itself? what is know as selfplay. which is not so hard to understand but a little bit in the tech part. Look a this 1v1 environment has a tutorial where they show selfplay and other single agent approaches using a well known RL Pytorch implementations. and for the policy optimization algorithm as the tutorial before you should go with PPO (which is a on-policy method like reinforce). there is something called HER for sparse reward but it works with off-policy methods like ddpg or sac. read a little bit more about this and then you will get the idea. My suggestion if you dont have extend experience try a supervise learning approach, where you have a dataset where the action is the label to be predicted and the observation is the input, MSE for the loss. like predicting the stering wheel angle from the image of the road kind of setup.

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Ask HN: Running LLMs Locally

2 projects | news.ycombinator.com | 15 May 2024
GPUsGoBurr: Get up to 2x higher performance by Tuning LLM Inference Deployment

1 project | news.ycombinator.com | 15 May 2024
Show HN: Tarsier – vision for text-only LLM web agents that beats GPT-4o

8 projects | news.ycombinator.com | 15 May 2024
PaliGemma: Open-Source Multimodal Model by Google

5 projects | news.ycombinator.com | 15 May 2024
Project Gameface Launches on Android

1 project | news.ycombinator.com | 15 May 2024

RL framework for 2v2 kart soccer

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning Post date: 17 Nov 2021

slimevolleygym

InfluxDB

Related posts

Ask HN: Running LLMs Locally

GPUsGoBurr: Get up to 2x higher performance by Tuning LLM Inference Deployment

Show HN: Tarsier – vision for text-only LLM web agents that beats GPT-4o

PaliGemma: Open-Source Multimodal Model by Google

Project Gameface Launches on Android