Coordinated Joint Multimodal Embeddings for Generalized Audio-Visual Zero-shot Classification and Retrieval of Videos.
Kranti Kumar ParidaNeeraj MatiyaliTanaya GuhaGaurav SharmaPublished in: WACV (2020)
Keyphrases
- audio visual
- multi modal
- video summarization
- audio visual content
- visual data
- visual information
- multimodal fusion
- sports video
- multi stream
- multimedia
- image classification
- pattern recognition
- audio visual speech recognition
- audio features
- information retrieval systems
- feature vectors
- image database
- text classification
- information retrieval
- temporal context
- feature selection
- person authentication
- video sequences
- feature extraction
- video content
- video frames
- training set