Login / Signup

A cross-modal crowd counting method combining CNN and cross-modal transformer.

Shihui ZhangWei WangWeibo ZhaoLei WangQunpeng Li
Published in: Image Vis. Comput. (2023)
Keyphrases
  • cross modal
  • multi modal
  • multimedia retrieval
  • similarity measure
  • visual recognition