| Title: |
GEM: A Generalizable Ego-Vision Multimodal World Model for Fine-Grained Ego-Motion, Object Dynamics, and Scene Composition Control |
| Authors: |
Hassan, Mariam; Stapf, Sebastian; Rahimi, Ahmad; Rezende, Pedro M B; Haghighi, Yasaman; Bruggemann, David; Katircioglu, Isinsu; Zhang, Lin; Chen, Xiaoran; Saha, Suman; Cannici, Marco; Aljalbout, Elie; Ye, Botao; Wang, Xi; Davtyan, Aram; Salzmann, Mathieu; Scaramuzza, Davide; Pollefeys, Marc; Favaro, Paolo; Alahi, Alexandre |
| Source: |
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) CVPR Computer Vision and Pattern Recognition (CVPR), 2025 IEEE/CVF Conference on. :22404-22415 Jun, 2025 |
| Relation: |
2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |
| Database: |
IEEE Xplore Digital Library |