Continuous-Time/State/Action Fitted Value Iteration via Hamilton-Jacobi-Bellman (HJB)
reinforcement-learning
flax
optimal-control
value-iteration
continuous-control
jax
hamilton-jacobi-bellman
hamilton-jacobi
continuous-value-iteration
-
Updated
Feb 1, 2022 - Python