Computational auditory scene analysis by using statistics of high-dimensional speech dynamics and sound source direction.
Johannes NixMichael KleinschmidtVolker HohmannPublished in: INTERSPEECH (2003)
Keyphrases
- sound source
- computational auditory scene analysis
- speech signal
- high dimensional
- audio visual
- source localization
- speech recognition
- multi modal
- sound signals
- visual data
- low dimensional
- high dimensional data
- visual information
- non stationary
- hidden markov models
- focus of attention
- machine learning
- dimensionality reduction
- pattern recognition
- acoustic features
- higher level
- nearest neighbor
- data points
- image sequences
- computer vision
- information retrieval