xSIM++: An Improved Proxy to Bitext Mining Performance for Low-Resource Languages.
Mingda ChenKevin HeffernanOnur ÇelebiAlexandre MourachkoHolger SchwenkPublished in: ACL (2) (2023)
Keyphrases
- expressive power
- data mining
- mining algorithm
- web mining
- frequent itemsets
- knowledge discovery
- data mining techniques
- data mining algorithms
- sequential patterns
- databases
- text mining
- resource constraints
- language independent
- resource management
- data mining methods
- natural language
- resource allocation
- association rule mining
- grammatical inference
- multi lingual