Constructing and Expanding Low-Resource and Underrepresented Parallel Datasets for Indonesian Local Languages.
Joanito Agili LopoRadius TanonePublished in: CoRR (2024)
Keyphrases
- language independent
- parallel processing
- expressive power
- resource management
- multi lingual
- benchmark datasets
- data sets
- resource allocation
- uci machine learning repository
- machine translation
- shared memory
- language identification
- computer architecture
- massively parallel
- cross lingual
- database
- web resources
- feature selection
- parallel implementation
- databases
- target language
- neural network
- sampling methods
- learning algorithm
- parallel execution
- search engine
- query language