Lost in segmentation: Three approaches for speech/non-speech detection in consumer-produced videos.
Benjamin ElizaldeGerald FriedlandPublished in: ICME (2013)
Keyphrases
- speech recognition
- speech synthesis
- audio visual
- spoken language
- noisy environments
- speech signal
- edge detection
- temporal segmentation
- video scene
- automatic speech recognition
- text to speech
- segmentation algorithm
- image segmentation
- motion segmentation
- medical images
- segmentation accuracy
- endoscopic video
- voice activity detection
- image segmentation algorithms
- word segmentation
- video content
- detection rate
- segmentation method
- moving objects
- multiscale
- image sequences