Automatic Language Identification in music videos with low level audio and visual features.
Vijay ChandrasekharMehmet Emre SarginDavid A. RossPublished in: ICASSP (2011)
Keyphrases
- audio features
- visual features
- speaker identification
- language identification
- low level
- visual information
- image classification
- acoustic features
- visual content
- low level features
- semantic concepts
- image search
- visual data
- image retrieval
- high level
- key frames
- audio visual
- keywords
- human actions
- feature set
- image collections
- image annotation
- semantic gap
- higher level
- speaker verification
- semantic features
- music information retrieval
- document images
- video shots
- language model