Banquet speaker: "The MP3 story and more: Perceptual audio coding from its beginnings to the present".
Jürgen HerrePublished in: SoCC (2013)
Keyphrases
- audio visual
- speaker identification
- prosodic features
- cross modal
- automatic transcription
- audio stream
- coding scheme
- speaker verification
- multi modal
- multimedia
- signal processing
- visual information
- speaker diarization
- audio features
- visual data
- human perception
- speech recognition
- story telling
- acoustic features
- coding method
- human visual system
- broadcast news
- low level
- linear predictive coding
- audio video
- interactive storytelling
- visual speech
- video signals
- music information retrieval
- linear prediction
- multimedia information
- perceptual grouping