Audio-Visual Scene-Aware Dialog.
Huda AlAmriVincent CartillierAbhishek DasJue WangStefan LeePeter AndersonIrfan EssaDevi ParikhDhruv BatraAnoop CherianTim K. MarksChiori HoriPublished in: CoRR (2019)
Keyphrases
- audio visual
- visual data
- video scene
- multi modal
- visual information
- video sequences
- emotion recognition
- person authentication
- multi stream
- multimedia
- video summarization
- audio visual speech recognition
- temporal context
- natural language
- high dimensional data
- input image
- user interface
- moving objects
- multimodal fusion
- video data
- image data
- high dimensional
- three dimensional
- low level
- image sequences
- computer vision