Embodied Robotics Research

Tag: GRPO

1 item with this tag.

  • May 05, 2026

    RoboAlign-R1: Distilled Multimodal Reward Alignment for Robot Video World Models

    • reward-alignment
    • video-world-model
    • GRPO
    • RL-post-training
    • long-horizon
    • benchmark

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community