Posts
-
Q-Learning with Multiple Subactions
Autoregressive subaction sampling lets DQN handle composite actions without combinatorial explosion. -
Extending DQN to Continuous Action Spaces with Cubic Splines
Cubic splines let DQN handle continuous action space without discretizing
subscribe via RSS