Embodied Robotics Research
Search
Search
Dark mode
Light mode
Explorer
Tag: tencent-robotics
1 item with this tag.
Jun 03, 2026
FlowPRO: Reward-Free Reinforced Fine-Tuning of Flow-Matching VLAs via Proximalized Preference Optimization
vla
reinforcement-fine-tuning
flow-matching
preference-optimization
reward-free
tencent-robotics