Workshop Paper

Better Gradient Steps for Deep On-Policy Reinforcement Learning

This paper studies how the collected transitions can be prioritized to speed up the gradient ascent process toward a favorable policy. To do so, we weigh the transitions in the …

avatar
Ryan Pégoud
Read more