1 | Oier Mees

From Human Videos to Robot Manipulation: A Survey on Scalable Vision-Language-Action Learning with Human-centric Data

IJCAI 2026

Zhiyuan Feng, Qixiu Li, Huizhi Liang, Rushuai Yang, Yichao Shen, Zhiying Du, Zhaowei Zhang, Yu Deng, Li Zhao, Hao Zhao, Zongqing Lu, Oier Mees, Marc Pollefeys, Jiaolong Yang, Baining Guo

From Human Videos to Robot Manipulation: A Survey on Scalable Vision-Language-Action Learning with Human-centric Data

mimic-video: Video-Action Models for Generalizable Robot Control Beyond VLAs

RSS 2026

Jonas Pai, Liam Achenbach, Victoriano Montesinos, Benedek Forrai, Oier Mees, Elvis Nava

Training Strategies for Efficient Embodied Reasoning

CoRL 2025

William Chen, Suneel Belkhale, Suvir Mirchandani, Danny Driess, Oier Mees, Karl Pertsch, Sergey Levine

FAST: Efficient Action Tokenization for Vision-Language-Action Models

RSS 2025

Karl Pertsch, Kyle Stachowicz, Brian Ichter, Danny Driess, Suraj Nair, Quan Vuong, Oier Mees, Chelsea Finn, Sergey Levine

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding

ICRA 2025

Joshua Jones, Oier Mees, Carmelo Sferrazza, Kyle Stachowicz, Pieter Abbeel, Sergey Levine

GHIL-Glue: Hierarchical Control with Filtered Subgoal Images

ICRA 2025

Kyle B Hatch, Ashwin Balakrishna, Oier Mees, Suraj Nair, Seohong Park, Blake Wulfe, Masha Itkina, Benjamin Eysenbach, Sergey Levine, Thomas Kollar, Benjamin Burchfiel