Multi-level Attention Network using Text, Audio and Video for Depression Prediction.
Anupama RaySiddharth KumarRutvik ReddyPrerana MukherjeeRitu GargPublished in: AVEC@MM (2019)
Keyphrases
- multimedia
- audio video
- audio content
- digital video
- video data
- natural language descriptions
- video sequences
- information retrieval
- video content analysis
- text graphics
- prediction accuracy
- radial basis function network
- news video
- closed captions
- audio files
- video content
- peer to peer
- scene change detection
- multimedia processing
- multimedia information
- content based video retrieval
- text detection
- visual data
- video files
- video database
- prediction model
- video frames
- keywords
- video material
- video search
- video collections
- audio signals
- video recordings
- lecture videos
- video scene
- soccer video
- real time
- multimedia documents
- audio visual
- video analysis
- text data
- visual attention
- network structure
- video streams
- text mining
- neural network