One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia.
Alham Fikri AjiGenta Indra WinataFajri KotoSamuel CahyawijayaAde RomadhonyRahmad MahendraKemal KurniawanDavid MoeljadiRadityo Eko PrasojoTimothy BaldwinJey Han LauSebastian RuderPublished in: CoRR (2022)
Keyphrases
- text summarization
- expressive power
- language independent
- machine translation
- syntactic and semantic dependencies
- databases
- question answering
- natural language
- case study
- co occurrence
- information extraction
- information technology
- cross lingual
- semantic analysis
- knowledge base
- target language
- grammatical inference
- south african
- grammar induction
- machine learning