Login / Signup
You Only Look & Listen Once: Towards Fast and Accurate Visual Grounding.
Qing Du
Yucheng Luo
Published in:
ICDCS Workshops (2022)
Keyphrases
</>
visual features
visual information
high quality
computationally efficient
high level
visual representation
visual exploration
real time
neural network
machine learning
social networks
computer vision
high accuracy
multi modal
visual perception