Parametric AM/FM decomposition for speech and audio coding.
Tom BäckströmSascha DischPublished in: WASPAA (2009)
Keyphrases
- audio stream
- audio visual
- linear predictive coding
- broadcast news
- audio signals
- speaker identification
- text to speech
- emotion recognition
- audio features
- cepstral features
- linear predictive
- linear prediction
- digital audio
- acoustic signals
- speech signal
- speech recognition
- speech music discrimination
- coding scheme
- speech processing
- automatic speech recognition
- audio recordings
- audio video
- acoustic signal
- spoken language
- speech synthesis
- non stationary
- automatic transcription
- signal processing
- multimedia
- multi stream
- spoken documents
- prosodic features
- speaker recognition
- multi modal
- human language
- coding method
- digital video
- video streams
- voice activity detection
- acoustic features
- mel frequency cepstral coefficients
- wavelet packet
- video signals
- parametric models
- cepstral coefficients
- visual data
- gaussian mixture model
- filter bank