SaaSHub helps you find the best software and product alternatives Learn more →
EfficientZero Alternatives
Similar projects and alternatives to EfficientZero
-
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
-
-
WorkOS
The modern identity platform for B2B SaaS. The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.
-
XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model
-
-
msn
Discontinued Masked Siamese Networks for Label-Efficient Learning (https://arxiv.org/abs/2204.07141)
-
CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
-
-
minihack
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
-
omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments. (by hr0nix)
-
GPT-3T
Building language models to predict more than one token ahead to enable further ahead predictions.
-
EfficientZero
Fork of EfficientZero to use newer libraries and to fix a few runtime bugs. Also includes pretrained models! (by steventrouble)
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
EfficientZero reviews and mentions
-
[D] GPT-3T: Can we train language models to think further ahead?
Here's an algorithm that is more sample efficient : https://github.com/YeWR/EfficientZero
-
MuZero learns to play Teamfight Tactics
Use multiprocessing to have more GPU workers could help. My code based on EfficientZero https://github.com/YeWR/EfficientZero is utilizing CPUs and GPUs to 90%. It uses Ray for multiprocessing and splits Reanalyze into CPU and GPU workers to maximize resource utilization. By the way, it's not converging to optimal policy well: it gets stuck at 50% optimal episode return at with a small amount of training. Have you had this issue before?
- Anyone found any working replication repo for MuZero?
-
[D] Most important AI Paper´s this year so far in my opinion + Proto AGI speculation at the end
Mastering Atari Games with Limited Data – EfficientZero ( Human sample -efficiency! ) Paper: https://arxiv.org/abs/2111.00210 Lesswrong article about the paper: https://www.lesswrong.com/posts/mRwJce3npmzbKfxws/efficientzero-how-it-works Github: https://github.com/YeWR/EfficientZero
-
A note from our sponsor - SaaSHub
www.saashub.com | 28 Mar 2024
Stats
YeWR/EfficientZero is an open source project licensed under GNU General Public License v3.0 only which is an OSI approved license.
The primary programming language of EfficientZero is Python.