Multi-Speaker Tracking From an Audio-Visual Sensing Device.
Xinyuan QianAlessio BruttiOswald LanzMaurizio OmologoAndrea CavallaroPublished in: IEEE Trans. Multim. (2019)
Keyphrases
- audio visual
- particle filter
- multimedia
- appearance model
- speaker identification
- audio visual speech recognition
- prosodic features
- automatic transcription
- signal processing
- audio stream
- object tracking
- motion tracking
- particle filtering
- audio video
- acoustic features
- audio signals
- visual tracking
- kalman filter
- visual information
- speech recognition
- speaker recognition
- motion analysis
- multi stream
- motion estimation
- image sequences