I2-SLAM

Abstract

We present an inverse image-formation module that can enhance the robustness of existing visual SLAM pipelines for casually captured scenarios. Casual video captures often suffer from motion blur and varying appearances, which degrade the final quality of coherent 3D visual representation. We propose integrating the physical imaging into the SLAM system, which employs linear HDR radiance maps to collect measurements. Specifically, individual frames aggregate images of multiple poses along the camera trajectory to explain prevalent motion blur in hand-held videos. Additionally, we accommodate per-frame appearance variation by dedicating explicit variables for image formation steps, namely white balance, exposure time, and camera response function. Through joint optimization of additional variables, the SLAM pipeline produces high-quality images with more accurate trajectories. Extensive experiments demonstrate that our approach can be incorporated into recent visual SLAM pipelines using various scene representations, such as neural radiance fields or Gaussian splatting.

Method

We reconstruct a sharp HDR radiance field map. Motion blur is simulated by integration of sharp images, which are obtained from virtual camera poses during the exposure time. Then we obtain the blurry LDR image by applying differentiable tone mapping module. SLAM methods simultaneously perform tracking and mapping from degraded images to reconstruct a sharp HDR map.

Results

I²-SLAM is a generic module that improves the quality of existing visual SLAM approaches by inverting the image formation process for casually captured videos.

RGB-D SLAM results in ScanNet dataset

Input frame

SplaTAM [1]

I²-SLAM (Ours)

Input frame

SplaTAM [1]

I²-SLAM (Ours)

RGB SLAM results in TUM dataset

Input frame

NeRF-SLAM [2]

I²-SLAM (Ours)

Input frame

NeRF-SLAM [2]

I²-SLAM (Ours)

References

[1] Nikhil Keetha et al., SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM, CVPR 2024
[2] Antoni Rosinol et al., NeRF-SLAM: Real-Time Dense Monocular SLAM with Neural Radiance Fields, IROS 2023

BibTeX

@InProceedings{I2-SLAM_2024,
  author    = {Bae, Gwangtak and Choi, Changwoon and Heo, Hyeongjun and Kim, Sang Min and Kim, Young Min},
  editor    = {Leonardis, Ales{\v{s}} and Ricci, Elissa and Roth, Stefan and Russakovsky, Olga and Sattler, Torsten and Varol, G{\"u}l},
  title     = {I2-SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM},
  booktitle = {Computer Vision -- ECCV 2024},
  year      = {2025},
  publisher = {Springer Nature Switzerland},
  address   = {Cham},
  pages     = {72 -- 89},
  isbn=     = {978-3-031-73383-3}
}

I²-SLAM: Inverting Imaging Process for
Robust Photorealistic Dense SLAM

By inverting imaging process, I²-SLAM reconstructs photorealistic and sharp HDR maps
from casually-captured inputs which contain severe motion blur and varying appearances

Abstract

Method

Results

RGB-D SLAM results in ScanNet dataset

RGB SLAM results in TUM dataset

References

BibTeX

I2-SLAM: Inverting Imaging Process for Robust Photorealistic Dense SLAM

By inverting imaging process, I2-SLAM reconstructs photorealistic and sharp HDR maps from casually-captured inputs which contain severe motion blur and varying appearances

Abstract

Method

Results

RGB-D SLAM results in ScanNet dataset

RGB SLAM results in TUM dataset

References

BibTeX

I²-SLAM: Inverting Imaging Process for
Robust Photorealistic Dense SLAM

By inverting imaging process, I²-SLAM reconstructs photorealistic and sharp HDR maps
from casually-captured inputs which contain severe motion blur and varying appearances