Using enhanced F0-trajectories for multiple speaker detection in audio monitoring scenarios.
Alessia Cornaggia-UrrigshardtFrank KurthPublished in: EUSIPCO (2015)
Keyphrases
- audio visual
- object detection
- real time
- automatic detection
- monitoring system
- detection method
- false positives
- prosodic features
- multimedia
- real world
- detection algorithm
- visual information
- false alarms
- detection accuracy
- action recognition
- signal processing
- detection rate
- metadata
- automatic speech recognition
- speaker identification
- neural network