City-Identification of Flickr Videos Using Semantic Acoustic Features.
Benjamin ElizaldeGuan-Lin ChaoMing ZengIan R. LanePublished in: BigMM (2016)
Keyphrases
- acoustic features
- audio features
- automatic speech recognition
- speaker verification
- speech signal
- visual features
- music information retrieval
- semantic information
- image collections
- high level
- information extraction
- image retrieval
- video sequences
- video content
- hidden markov models
- pattern recognition
- feature extraction
- face recognition