Embodied Robotics Research

Tag: reward-free

2 items with this tag.

  • Jun 03, 2026

    FlowPRO: Reward-Free Reinforced Fine-Tuning of Flow-Matching VLAs via Proximalized Preference Optimization

    • vla
    • reinforcement-fine-tuning
    • flow-matching
    • preference-optimization
    • reward-free
    • tencent-robotics
  • May 12, 2026

    RAW-Dream: Reinforcing VLAs in Task-Agnostic World Models

    • world-model
    • rl
    • vla
    • task-agnostic
    • reward-free
    • vlm-reward
    • microsoft-research

Created with Quartz v4.5.2 © 2026

  • GitHub
  • Discord Community