MASSIVE: A 1M-Example Multilingual Natural Language Understanding Dataset with 51 Typologically-Diverse Languages.
Jack FitzGeraldChristopher HenchCharith PerisScott MackieKay RottmannAna SanchezAaron NashLiam UrbachVishesh KakaralaRicha SinghSwetha RanganathLaurie CristMisha BritanWouter LeeuwisGökhan TürPrem NatarajanPublished in: CoRR (2022)
Keyphrases
- natural language understanding
- natural language
- language independent
- language specific
- cross lingual
- text understanding
- multi lingual
- multilingual information retrieval
- semantic analysis
- language understanding
- knowledge representation
- natural language processing
- lexical knowledge
- multilingual documents
- language resources
- semantic representations
- spoken dialog systems
- dialogue system
- machine translation
- database
- cross language
- information retrieval
- real world
- databases
- language identification
- machine translation system
- contextual information
- search engine