Treemaps to visualise and navigate speech audio.
Fahmi AbdulhamidStuart MarshallPublished in: OZCHI (2013)
Keyphrases
- audio stream
- audio visual
- broadcast news
- audio signals
- speaker identification
- cepstral features
- emotion recognition
- text to speech
- digital audio
- audio features
- linear predictive coding
- speech music discrimination
- audio recordings
- speech processing
- speech recognition
- audio video
- multi stream
- prosodic features
- multimedia
- audio visual speech recognition
- acoustic signals
- automatic transcription
- spoken documents
- speech signal
- signal processing
- speech synthesis
- multi modal
- visual information
- visual data
- speech recognition technology
- speaker diarization
- gaussian mixture model
- human language
- spontaneous speech
- speaker recognition
- feature extraction
- music information retrieval
- endpoint detection
- language acquisition
- neural network
- automatic speech recognition
- video streams
- speaker verification
- spoken document retrieval