Keyphrases
- audio stream
- audio visual
- broadcast news
- audio signals
- speaker identification
- emotion recognition
- text to speech
- audio features
- digital audio
- cepstral features
- speech processing
- audio recordings
- linear predictive coding
- automatic transcription
- prosodic features
- speech recognition
- speech music discrimination
- multimedia
- facial animation
- acoustic features
- speech synthesis
- multi stream
- signal processing
- multi modal
- text input
- audio video
- speech signal
- acoustic signals
- spoken documents
- computer graphics
- text recognition
- automatic speech recognition
- user interface
- content based video retrieval
- visual information
- motion capture
- low level
- gaussian mixture model
- human language
- speaker diarization
- voice activity detection
- visual speech
- handwritten characters
- audio signal
- speech recognition technology
- speech corpus
- speaker recognition
- computer animation
- recognition engine
- mel frequency cepstral coefficients