Login / Signup
Language ID in the Wild: Unexpected Challenges on the Path to a Thousand-Language Web Text Corpus.
Isaac Caswell
Theresa Breiner
Daan van Esch
Ankur Bapna
Published in:
CoRR (2020)
Keyphrases
</>
natural language
web mining
artificial intelligence
web documents
text corpus