A bilingual multi-modal voice corpus for language and speaker recognition (LASR) services.
Steven D. BeckReva SchwartzHirotaka NakasonePublished in: Odyssey (2004)
Keyphrases
- multi modal
- speaker recognition
- parallel corpus
- gaussian mixture model
- audio visual
- speaker verification
- machine translation system
- cross lingual
- vector quantization
- machine translation
- target language
- probabilistic neural network
- natural language
- emotion recognition
- speaker identification
- semantic concepts
- cross language information retrieval
- statistical machine translation
- image annotation
- video search
- speech signal
- noisy environments
- neural network
- image compression
- probabilistic model
- multiresolution
- high dimensional
- machine learning