leela-zero
mctx
Our great sponsors
leela-zero | mctx | |
---|---|---|
11 | 10 | |
5,225 | 2,203 | |
0.0% | 2.1% | |
0.0 | 0.0 | |
about 1 year ago | 3 months ago | |
C++ | Python | |
GNU General Public License v3.0 only | Apache License 2.0 |
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.
leela-zero
-
I guess I have mastered the AI attack
https://github.com/leela-zero/leela-zero but it is not user friendly imo
- DailyMotion ont ils un boulevard devant eux ?
-
Human Go players beat top Go AIs using a "trick"
Yeah, see https://github.com/leela-zero/leela-zero/pull/883 for the discussions near the origin of this idea, which Leela Zero was the first to use many years ago. KataGo's implementation is a bit different in minor ways, but still based on the same mathematical idea. https://github.com/lightvector/KataGo/blob/master/cpp/search/searchhelpers.cpp#L482
-
DeepMind has open-sourced the heart of AlphaGo and AlphaZero
Totally agree. I don't even know what benefit they'd get at this point from keeping some parts locked up.
Anyway if you want something runnable Leela has a nice reimplementation: https://github.com/leela-zero/leela-zero
-
Please help me settle an argument with my friend about KataGo
See https://github.com/leela-zero/leela-zero/issues/2445 for an example with Leela Zero failing to see an atari, even *with* tons of search. This is a similar issue - neural nets have a hard time perceiving things that depend sensitively on large areas when unusual shapes are involved.
-
Go-playing trick defeats world-class Go AI—but loses to human amateurs
(https://github.com/leela-zero/leela-zero/issues/2273)
-
The blue recommended move was there like most the game screaming at me for not playing it. This doesn't look that big though? Why would this be significant?
I think it's Leela https://github.com/leela-zero/leela-zero
- 【推荐交流帖】键政累了,大家一人推荐一个翻墙后常上的网站吧
-
[D] How OpenAI Sold its Soul for $1 Billion: The company behind GPT-3 and Codex isn’t as open as it claims.
There is Leela Zero
-
Lizzie Suggests Moves Off Board in 9x9 Game
Yeah, it looks like I need to recompile Leela Zero with a 9x9 board size? https://github.com/leela-zero/leela-zero/pull/928 (and https://github.com/leela-zero/leela-zero/issues/2613)
mctx
- About Monte Carlo tree search in Jax
-
Programming language dilemma
Maybe you can have your cake and eat it too. :) You could use Python with one of the hardware accelerating languages like Jax. This project for example uses Jax to implement Monte Carlo Tree Search and includes a few games as examples. https://github.com/deepmind/mctx
-
Is there any proof that AlphaZero actually exist?
recently tree search part of alpha zero has gone open source https://github.com/deepmind/mctx
-
[D] Anyone interested in training an AI for Tigris and Euphrates?
You could try starting with https://github.com/deepmind/mctx. You’ll probably need to expose your game state and actions via IPC of some sort or FFI your rust code to Python.
-
DeepMind has open-sourced the heart of AlphaGo and AlphaZero
Interesting approach to private variables https://github.com/deepmind/mctx/blob/577fc77a3cda1b796e277e...
- AlphaZero's Monte Carlo tree search implementation in Jax
-
Anyone found any working replication repo for MuZero?
Just have a look at the DM repo: https://github.com/deepmind/mctx
- MuZero Implementation
- Official DeepMind MuZero Implementation
-
Finally an official MuZero implementation
deepmind/mctx: Monte Carlo tree search in JAX (github.com)
What are some alternatives?
KataGo - GTP engine and self-play learning in Go
EfficientZero - Fork of EfficientZero to use newer libraries and to fix a few runtime bugs. Also includes pretrained models!
alpha-zero-boosted - A "build to learn" Alpha Zero implementation using Gradient Boosted Decision Trees (LightGBM)
minihack - MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
opensea-js - TypeScript SDK for the OpenSea marketplace
koneko - 🐈🌐 nyaa.si terminal BitTorrent tracker
leela-zero - Go engine with no human-provided knowledge, modeled after the AlphaGo Zero paper.
craftingway - A ffxiv crafting tool
hivemind - Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.
omega - A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.