A High-Quality Speech and Audio Codec With Less Than 10 ms Delay.
Jean-Marc ValinTimothy B. TerriberryChristopher MontgomeryGregory MaxwellPublished in: CoRR (2016)
Keyphrases
- high quality
- audio stream
- audio visual
- broadcast news
- audio signals
- speaker identification
- text to speech
- cepstral features
- emotion recognition
- speech segments
- audio features
- speech recognition
- digital audio
- audio recordings
- acoustic signals
- automatic transcription
- speech processing
- prosodic features
- multimedia
- speech music discrimination
- audio video
- visual information
- automatic speech recognition
- linear predictive coding
- low quality
- speech synthesis
- multi stream
- video coding
- multi modal
- speech signal
- hidden markov models
- acoustic features
- visual data
- human language
- pac man
- signal processing
- voice activity detection
- image quality
- high resolution
- gaussian mixture model
- spontaneous speech
- visual speech
- video streams
- audio signal
- spoken documents
- speaker diarization
- motion estimation
- speaker verification
- digital video
- coding method