Paper

Grounding World Simulation Models in a Real-World Metropolis

2026.03.18

·Web·by 네루

#Computer Vision#Generative AI#RAG#Video Generation#World Model

Key Points

1Seoul World Model (SWM) is a novel world simulation model that grounds autoregressive video generation in real-world cityscapes by using retrieval-augmented conditioning on a vast street-view image database.
2To enable faithful and long-horizon generation, SWM introduces innovations such as cross-temporal pairing, a diverse synthetic dataset for varied trajectories, view interpolation, and a Virtual Lookahead Sink that continuously re-grounds the model.
3This allows SWM to generate spatially faithful and temporally consistent videos spanning kilometers with free-form navigation and text-prompted scenario control, outperforming existing methods for real-world urban environments.

Paper

2026.03.18

·Web·by 네루

#Computer Vision#Generative AI#RAG#Video Generation#World Model

1Seoul World Model (SWM) is a novel world simulation model that grounds autoregressive video generation in real-world cityscapes by using retrieval-augmented conditioning on a vast street-view image database.
2To enable faithful and long-horizon generation, SWM introduces innovations such as cross-temporal pairing, a diverse synthetic dataset for varied trajectories, view interpolation, and a Virtual Lookahead Sink that continuously re-grounds the model.
3This allows SWM to generate spatially faithful and temporally consistent videos spanning kilometers with free-form navigation and text-prompted scenario control, outperforming existing methods for real-world urban environments.