JAX implementation of Adaptive Approximate Policy Iteration (Hao et al., 2021)
Why do you think that https://github.com/d2l-ai/d2l-en is a good alternative to adaptive-policy-iteration
JAX implementation of Adaptive Approximate Policy Iteration (Hao et al., 2021)
Why do you think that https://github.com/d2l-ai/d2l-en is a good alternative to adaptive-policy-iteration