@_akhaliq
Reparameterized Policy Learning for Multimodal Trajectory Optimization paper page: https://t.co/U4hfleRZvM We investigate the challenge of parametrizing policies for reinforcement learning (RL) in high-dimensional continuous action spaces. Our objective is to develop a… https://t.co/51VEttH8i5 https://t.co/0Z9EkJkd3c