Last modified: 6 May 2026
This note is organized as I study Embodied AI. The content is mainly extracted from excellent papers. I reorganized the material for my own understanding.
$\pi_0$: A Vision-Language-Action Flow Model for General Robot Control
FAST: Efficient Action Tokenization for Vision-Language-Action Models
$\pi_{0.5}$: A Vision-Language-Action Model with Open-World Generalization
$\pi_{0.6}^*$: A VLA that Learns from Experience
World Action Models are Zero-shot Policies
$\pi_{0.7}$: A Steerable Generalist Robotic Foundation Model with Emergent Capabilities