RL framework for 2v2 kart soccer

This page summarizes the projects mentioned and recommended in the original post on /r/reinforcementlearning

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • slimevolleygym

    A simple OpenAI Gym environment for single and multi-agent reinforcement learning

  • Hi great that you are interested in the area, but as a beginner project is quite complex, having a team is a multi-agent task so not a small feat and i guess you want the same policy to play against itself? what is know as selfplay. which is not so hard to understand but a little bit in the tech part. Look a this 1v1 environment has a tutorial where they show selfplay and other single agent approaches using a well known RL Pytorch implementations. and for the policy optimization algorithm as the tutorial before you should go with PPO (which is a on-policy method like reinforce). there is something called HER for sparse reward but it works with off-policy methods like ddpg or sac. read a little bit more about this and then you will get the idea. My suggestion if you dont have extend experience try a supervise learning approach, where you have a dataset where the action is the label to be predicted and the observation is the input, MSE for the loss. like predicting the stering wheel angle from the image of the road kind of setup.

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Ask HN: Running LLMs Locally

    2 projects | news.ycombinator.com | 15 May 2024
  • GPUsGoBurr: Get up to 2x higher performance by Tuning LLM Inference Deployment

    1 project | news.ycombinator.com | 15 May 2024
  • Show HN: Tarsier – vision for text-only LLM web agents that beats GPT-4o

    8 projects | news.ycombinator.com | 15 May 2024
  • PaliGemma: Open-Source Multimodal Model by Google

    5 projects | news.ycombinator.com | 15 May 2024
  • Project Gameface Launches on Android

    1 project | news.ycombinator.com | 15 May 2024