Multimodal and Multilingual Embeddings for Large-Scale Speech Mining.
Paul-Ambroise DuquenneHongyu GongHolger SchwenkPublished in: NeurIPS (2021)
Keyphrases
- audio visual
- multimodal interfaces
- multi lingual
- speech recognition
- multi modal
- multi stream
- spoken language
- text mining
- small scale
- web news
- real world
- mining algorithm
- low dimensional
- speech signal
- euclidean space
- cross lingual
- manifold learning
- vector space
- data mining
- human computer interaction
- data mining techniques
- dimensionality reduction
- frequent patterns
- knowledge discovery
- information retrieval
- multimedia
- digital libraries
- text to speech
- feature space
- automatic speech recognition
- language independent
- information extraction
- cross language
- privacy preserving
- association rule mining
- pattern mining