Embodied Robotics Research
Search
Search
Dark mode
Light mode
Explorer
Tag: reward-free
2 items with this tag.
Jun 03, 2026
FlowPRO: Reward-Free Reinforced Fine-Tuning of Flow-Matching VLAs via Proximalized Preference Optimization
vla
reinforcement-fine-tuning
flow-matching
preference-optimization
reward-free
tencent-robotics
May 12, 2026
RAW-Dream: Reinforcing VLAs in Task-Agnostic World Models
world-model
rl
vla
task-agnostic
reward-free
vlm-reward
microsoft-research