RTP Payload Format for the Opus Speech and Audio Codec.
Julian SpittkaKoen VosJean-Marc ValinPublished in: RFC (2015)
Keyphrases
- audio stream
- audio visual
- multimedia
- broadcast news
- audio signals
- emotion recognition
- text to speech
- speaker identification
- audio features
- digital audio
- audio recordings
- cepstral features
- end to end
- speech recognition
- speech processing
- prosodic features
- linear predictive coding
- speech signal
- acoustic features
- high speed
- multi modal
- multi stream
- acoustic signals
- automatic transcription
- video coding
- speech music discrimination
- video codec
- spoken documents
- automatic speech recognition
- visual speech
- metadata
- audio video
- motion estimation
- speaker verification
- visual information
- network traffic
- voice activity detection
- digital video
- hidden markov models
- human language
- spoken language
- audio signal
- image compression
- noisy environments
- association rule discovery
- speaker diarization
- speech synthesis
- distributed video coding
- visual data
- feature set