Login / Signup
Joint Learning of Visual-Audio Saliency Prediction and Sound Source Localization on Multi-face Videos.
Minglang Qiao
Yufan Liu
Mai Xu
Xin Deng
Bing Li
Weiming Hu
Ali Borji
Published in:
CoRR (2021)
Keyphrases
</>
source localization
visual features
reinforcement learning
pattern recognition
visual information
mobile learning
high level
non stationary