AcceRL: A Distributed Asynchronous Reinforcement Learning and World Model Framework for Vision-Language-Action Models

Summary

AcceRL is a fully asynchronous and decoupled RL training framework designed specifically for large-scale VLA models. It physically isolates training, inference, and rollout workers to eliminate synchronization barriers that degrade GPU utilization in standard synchronous pipelines. AcceRL is also the first framework to integrate a plug-and-play trainable world model into a distributed async RL pipeline for generating virtual experiences.

Key Contributions

Fully asynchronous decoupled architecture physically separating training, inference, and environment rollout to eliminate blocking synchronization
Plug-and-play world model integration enabling virtual experience generation within the same async pipeline, reducing costly real-world rollout time
Super-linear throughput scaling and highly efficient hardware utilization on LIBERO benchmarks
Open-source at github.com/distanceLu/AcceRL

Significance

Provides critical infrastructure for scaling RL fine-tuning of VLAs, demonstrating that decoupled async design with integrated world-model imagination can dramatically improve training efficiency — an enabling technology as the field moves toward larger-scale robot RL.

Embodied Robotics Research

Explorer

AcceRL: A Distributed Asynchronous Reinforcement Learning and World Model Framework for Vision-Language-Action Models

Summary

Key Contributions

Significance

Links

Graph View

Table of Contents

Backlinks