Login / Signup
UniTR: A Unified and Efficient Multi-Modal Transformer for Bird's-Eye-View Representation.
Haiyang Wang
Hao Tang
Shaoshuai Shi
Aoxue Li
Zhenguo Li
Bernt Schiele
Liwei Wang
Published in:
CoRR (2023)
Keyphrases
</>
multi modal
audio visual
multi modality
high dimensional
image annotation
cross modal
video retrieval
humanoid robot
machine learning
computer vision
image processing
fuzzy logic
multimedia retrieval
single modality
uni modal