Embodied Robotics Research
Search
Search
Dark mode
Light mode
Explorer
Tag: GRPO
1 item with this tag.
May 05, 2026
RoboAlign-R1: Distilled Multimodal Reward Alignment for Robot Video World Models
reward-alignment
video-world-model
GRPO
RL-post-training
long-horizon
benchmark