Embodied Robotics Research

Tag: reward-model

2 items with this tag.

  • Mar 28, 2026

    SOLE-R1: Video-Language Reasoning as the Sole Reward for On-Robot Reinforcement Learning

    • reward-model
    • video-language
    • chain-of-thought
    • online-RL
    • zero-shot
    • dense-reward
    • manipulation
  • Mar 17, 2026

    Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models

    • reward-model
    • VLM
    • online-RL
    • imitation-learning
    • robot-manipulation
    • sample-efficient

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community