Workshop Paper

Better Gradient Steps for Deep On-Policy Reinforcement Learning

This paper studies how the collected transitions can be prioritized to speed up the gradient ascent process toward a favorable policy. To do so, we weigh the transitions in the …

Ryan Pégoud

• Aug 1, 2024 • 1 min read