Login / Signup
TVT: Three-Way Vision Transformer through Multi-Modal Hypersphere Learning for Zero-Shot Sketch-Based Image Retrieval.
Jialin Tian
Xing Xu
Fumin Shen
Yang Yang
Heng Tao Shen
Published in:
AAAI (2022)
Keyphrases
</>
multi modal
multi modality
image processing
audio visual
high dimensional
video search
information retrieval
multiscale
image annotation
fusing multiple
sketch based image retrieval