A wideband speech and audio coding candidate for ITU-T G.711WBE standardization.
Yusuke HiwasakiTakeshi MoriShigeaki SasakiHitoshi OhmuroAkitoshi KataokaPublished in: ICASSP (2008)
Keyphrases
- speech signal
- audio stream
- speaker identification
- audio visual
- linear prediction
- linear predictive
- broadcast news
- linear predictive coding
- audio signals
- speech quality
- speech recognition
- cepstral features
- mel frequency cepstral coefficients
- automatic speech recognition
- acoustic features
- speech processing
- text to speech
- emotion recognition
- coding scheme
- audio recordings
- digital audio
- acoustic signals
- audio features
- cepstral coefficients
- spoken documents
- prosodic features
- audio video
- signal processing
- speaker recognition
- audio signal
- video signals
- speech synthesis
- automatic transcription
- visual information
- multi modal
- multimedia
- speech music discrimination
- compression standards
- multi stream
- signal to noise ratio
- non stationary
- speaker diarization
- noisy environments
- coding method
- visual speech
- speaker verification
- packet loss
- hidden markov models