Suggest an alternative to

adaptive-policy-iteration

JAX implementation of Adaptive Approximate Policy Iteration (Hao et al., 2021)

Why do you think that https://github.com/d2l-ai/d2l-en is a good alternative to adaptive-policy-iteration