Login / Signup
A cross-modal crowd counting method combining CNN and cross-modal transformer.
Shihui Zhang
Wei Wang
Weibo Zhao
Lei Wang
Qunpeng Li
Published in:
Image Vis. Comput. (2023)
Keyphrases
</>
cross modal
multi modal
multimedia retrieval
similarity measure
visual recognition