The future of speech and audio in the interface.
Barry AronsElizabeth D. MynattPublished in: CHI Conference Companion (1994)
Keyphrases
- audio stream
- audio visual
- broadcast news
- audio signals
- emotion recognition
- text to speech
- speaker identification
- speech processing
- digital audio
- visual information
- cepstral features
- audio recordings
- audio video
- audio features
- speech recognition
- text input
- long term
- multi stream
- multimedia
- user friendly
- prosodic features
- linear predictive coding
- voice activity detection
- acoustic features
- speaker verification
- speech synthesis
- noisy environments
- visual data
- signal processing
- hands free
- video sequences
- metadata
- speech music discrimination