Online multi-speaker tracking using multiple microphone arrays informed by auditory scene analysis.
Axel PlingeGernot A. FinkPublished in: EUSIPCO (2013)
Keyphrases
- scene analysis
- real time
- video scene
- automatic speech recognition
- online learning
- computer vision
- state space
- visual tracking
- speech recognition
- kalman filter
- visual information
- speaker diarization
- particle filter
- signal processing
- object tracking
- information processing
- partial occlusion
- multiple targets
- sound source
- low level
- feature space