Greedy AI agents learn to cooperate

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • pleuro

  • > even markov chains can lead to behavior that looks this way.

    What a fun project, thank you for sharing the link.

    re:

    > Once we created the creatures, we set up their odors, which enable our creatures to smell them. When we have odors and creatures all we then have to do is build a the control components

    This sounds similar to the "collaborative diffusion" approach to pathfinding [1].

    > However, it is possible to view the code on the github page.

    The link to https://github.com/lettergram/pleuro does not work. Do you still have a copy of the code somewhere? I'm curious to learn more about how you modelled the control loops with markov chains.

    Was the rough idea that there are states "forage", "eat", "protect", and then probability of transitions between states depends upon the simulated creature's current state & sensor information about the environment?

    [1] Repenning 2006 "Collaborative diffusion: programming antiobjects" https://home.cs.colorado.edu/~ralex/papers/PDF/OOPSLA06antio...

  • AI-Toolbox

    A C++ framework for MDPs and POMDPs with Python bindings

  • I maintain a repository of many implementations of classical (tabular) RL algorithms [1] which you might enjoy playing with when starting out. I use it for both research and for student projects. The advantage of avoiding NNs when starting out is that it is much simpler to inspect the inner workings of an algorithm to see whether it's working or not.

    I'm always happy to help if something is unclear or difficult so feel free to open issues there :)

    [1]: https://github.com/Svalorzen/AI-Toolbox

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • duckduckgo-locales

    Translation files for <a href="https://duckduckgo.com"> </a>

  • Alpha Zero would be a good start (a generalized version of Alpha Go, that started the current AI hype cycle) https://duckduckgo.com/?q=alpha+zero+github&t=ffab&ia=web

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Bountysource Stole at Least $17,000 from Open Source Developers

    2 projects | news.ycombinator.com | 3 May 2024
  • SB-1047 will stifle open-source AI and decrease safety

    2 projects | news.ycombinator.com | 29 Apr 2024
  • Ask HN: Recommendations for Local LLMs in 2024: Private and Offline?

    2 projects | news.ycombinator.com | 6 Apr 2024
  • Sequence-to-Sequence Toolkit Written in Python

    1 project | news.ycombinator.com | 30 Mar 2024
  • Show HN: LlamaGym – fine-tune LLM agents with online reinforcement learning

    2 projects | news.ycombinator.com | 10 Mar 2024