Lemmatizing and POS-tagging Akkadian with BabyLemmatizer and Dictionary-Based Post-Correction.
Aleksi SahalaTero AlstolaJonathan ValkKrister LindénPublished in: CLARIN Annual Conference (2022)
Keyphrases
- pos tagging
- word segmentation
- part of speech
- named entity recognition
- chinese word segmentation
- n gram
- dependency parsing
- language independent
- natural language processing
- machine translation
- domain adaptation
- penn treebank
- pos taggers
- language modeling
- document analysis
- query translation
- machine learning
- text classification
- tf idf
- word sense disambiguation
- wordnet
- semi supervised
- information extraction
- knowledge discovery
- similarity measure