Login / Signup
xSIM++: An Improved Proxy to Bitext Mining Performance for Low-Resource Languages.
Mingda Chen
Kevin Heffernan
Onur Çelebi
Alexandre Mourachko
Holger Schwenk
Published in:
CoRR (2023)
Keyphrases
</>
knowledge discovery
text mining
expressive power
data mining
data mining techniques
resource allocation
sequential patterns
language independent
target language
database
high levels
sequential pattern mining
cross lingual
resource management
information resources
mining algorithm
web mining
itemsets
search engine