No-Audio Multimodal Speech Detection in Crowded Social Settings Task at MediaEval 2018.
Laura Cabrera QuirosEkin GedikHayley HungPublished in: MediaEval (2018)
Keyphrases
- audio visual
- multi modal
- visual information
- emotion recognition
- audio features
- multimodal fusion
- visual data
- multi stream
- detection method
- automatic detection
- multimedia
- false positives
- social media
- audio stream
- social interaction
- detection algorithm
- speaker identification
- speaker verification
- social networks
- object detection
- crowded scenes
- music retrieval
- audio signals
- social networking
- event detection
- speech processing
- sound source
- visual speech
- digital audio
- cross modal