Abstract: As an online learning algorithm of approximate dynamic programming (ADP), direct heuristic dynamic programming (DHDP) has demonstrated its applicability to large state and control problems.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results