Login / Signup
AudViSum: Self-Supervised Deep Reinforcement Learning for Diverse Audio-Visual Summary Generation.
Sanjoy Chowdhury
Aditya Patra
Subhrajyoti Dasgupta
Ujjwal Bhattacharya
Published in:
BMVC (2021)
Keyphrases
</>
audio visual
summary generation
reinforcement learning
multi modal
visual information
video summarization
multi stream
emotion recognition
person authentication
temporal context
visual data
audio visual speech recognition
machine learning
multiscale
image data
low level
object recognition