Login / Signup
MMScan: A Multi-Modal 3D Scene Dataset with Hierarchical Grounded Language Annotations.
Ruiyuan Lyu
Tai Wang
Jingli Lin
Shuai Yang
Xiaohan Mao
Yilun Chen
Runsen Xu
Haifeng Huang
Chenming Zhu
Dahua Lin
Jiangmiao Pang
Published in:
CoRR (2024)
Keyphrases
</>
multi modal
d scene
image annotation
single image
depth map
camera parameters
multi modality
optical flow
coordinate frame
scene geometry
flow estimation
cross modal
planar patches
camera viewpoint
least squares
viewpoint
high dimensional
three dimensional
data sets