Login / Signup
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation.
Haiyang Wang
Hao Tang
Shaoshuai Shi
Aoxue Li
Zhenguo Li
Bernt Schiele
Liwei Wang
Published in:
ICCV (2023)
Keyphrases
</>
multi modal
multi modality
audio visual
semantic concepts
video search
cross modal
high dimensional
eye movements
fusing multiple
feature selection
mutual information
image classification
visual recognition