Last modified: 6 May 2026 This note is organized as I study Embodied AI. The content is mainly extracted from excellent papers. I reorganized the material for my own understanding.

$\pi_0$: A Vision-Language-Action Flow Model for General Robot Control

FAST: Efficient Action Tokenization for Vision-Language-Action Models

$\pi_{0.5}$: A Vision-Language-Action Model with Open-World Generalization

$\pi_{0.6}^*$: A VLA that Learns from Experience

World Action Models are Zero-shot Policies

$\pi_{0.7}$: A Steerable Generalist Robotic Foundation Model with Emergent Capabilities