MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages.
Jack FitzGeraldChristopher HenchCharith PerisScott MackieKay RottmannAna SanchezAaron NashLiam UrbachVishesh KakaralaRicha SinghSwetha RanganathLaurie CristMisha BritanWouter LeeuwisGökhan TürPrem NatarajanPublished in: ACL (1) (2023)
Keyphrases
- natural language understanding
- language independent
- cross lingual
- natural language
- multi lingual
- language specific
- multilingual information retrieval
- text understanding
- semantic analysis
- lexical knowledge
- knowledge representation
- semantic representations
- natural language processing
- language understanding
- language resources
- multilingual documents
- machine translation
- spoken dialog systems
- dialogue system
- database
- cross language
- machine learning
- databases
- language identification
- n gram
- domain knowledge