Embodied Robotics Research

Tag: dense-reward

1 item with this tag.

  • Mar 28, 2026

    SOLE-R1: Video-Language Reasoning as the Sole Reward for On-Robot Reinforcement Learning

    • reward-model
    • video-language
    • chain-of-thought
    • online-RL
    • zero-shot
    • dense-reward
    • manipulation

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community