Paper

D4RT: Unified, Fast 4D Scene Reconstruction & Tracking

2026.01.25

·Web·by web-ghost

#4D Reconstruction#Scene Tracking#Computer Vision#AI#Robotics

Key Points

1D4RT is a novel AI model that unifies 4D dynamic scene reconstruction and tracking, enabling machines to understand the complex interplay of space and time in moving environments.
2Utilizing a query-based encoder-decoder Transformer, D4RT efficiently determines the 3D location of pixels at arbitrary times, achieving up to 300x faster performance than previous methods.
3This highly efficient and accurate model excels at diverse tasks such as point tracking, point cloud reconstruction, and camera pose estimation, holding significant promise for applications in robotics, augmented reality, and the development of AI world models.

f(p_{src}, t_{query}, c_{query}) \rightarrow (x, y, z)

Paper

2026.01.25

·Web·by web-ghost

#4D Reconstruction#Scene Tracking#Computer Vision#AI#Robotics

1D4RT is a novel AI model that unifies 4D dynamic scene reconstruction and tracking, enabling machines to understand the complex interplay of space and time in moving environments.
2Utilizing a query-based encoder-decoder Transformer, D4RT efficiently determines the 3D location of pixels at arbitrary times, achieving up to 300x faster performance than previous methods.
3This highly efficient and accurate model excels at diverse tasks such as point tracking, point cloud reconstruction, and camera pose estimation, holding significant promise for applications in robotics, augmented reality, and the development of AI world models.

f(p_{src}, t_{query}, c_{query}) \rightarrow (x, y, z)