Weekly Research Digest — 2026-05-06

8 new entries this week across 3 topic areas.


Vision-Language-Action (VLA) Models

ReleaseVenueSignificance
acot-vla-action-chain-of-thought-for-vla-models ACoT-VLACVPR 2026Reasons directly in action space via coarse action-intent chains, closing the modality gap between planning and execution
abot-m0-vla-foundation-model-action-manifold-learning ABot-M0arXivAlibaba’s 6M-trajectory cross-embodiment foundation model with Action Manifold Learning for stable diffusion-free action prediction
dualcot-vla-visual-linguistic-chain-of-thought-parallel-reasoning DualCoT-VLAarXivParallel dual-modal CoT (visual + linguistic) in a single forward pass — SOTA on LIBERO and RoboCasa with no latency penalty

World Models for Robotics

ReleaseVenueSignificance
h-wm-hierarchical-world-model-task-motion-planning H-WMarXivHierarchical logical+visual world model enabling robust long-horizon TAMP guidance for VLA policies
chain-of-world-world-model-thinking-latent-motion Chain of World (CoWVLA)CVPR 2026Latent motion chains as world-model reasoning for VLA pretraining, outperforming pixel-space world models at lower compute
hierarchical-planning-with-latent-world-models Hierarchical Planning with Latent World ModelsarXivMeta FAIR’s two-level latent planner achieves 70% zero-shot pick-&-place vs. 0% for flat world models

Reinforcement Learning for Robotics

ReleaseVenueSignificance
rlinf-co-sim-real-co-training-vla RLinf-CoarXivClosed-loop sim RL with real-data anchor improves VLA success +24% (OpenVLA) and +20% (π₀.₅) over SFT co-training
what-matters-sim-to-online-rl-real-robots What Matters for Sim-to-Online RLarXiv100-run ablation across 3 real robots identifies concrete design choices (data retention, delayed critic) for stable online RL

Generated automatically. All entries verified via web search.