Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities.
Andros TjandraNayan SinghalDavid ZhangOzlem KalinliAbdelrahman MohamedDuc LeMichael L. SeltzerPublished in: ICASSP (2023)
Keyphrases
- generalization capabilities
- language independent
- cross lingual
- multi lingual
- multilingual information retrieval
- radial basis function
- description languages
- multilingual documents
- language resources
- comparable corpora
- machine translation
- digital libraries
- cross lingual information retrieval
- n gram
- automatic speech recognition
- genetic algorithm
- similarity measure
- language specific
- statistical machine translation
- character n grams
- named entities
- decision trees