Highly Language-Independent Word Lemmatization Using a Machine-Learning Classifier.
Iskander AkhmetovAlexandr PakIrina UaliyevaAlexander F. GelbukhPublished in: Computación y Sistemas (2020)
Keyphrases
- language independent
- n gram
- machine learning
- word level
- text classification
- word segmentation
- language specific
- feature selection
- chinese text retrieval
- word meanings
- learning algorithm
- decision trees
- support vector machine
- parallel corpus
- machine translation
- training data
- text retrieval
- natural language processing
- language model
- machine learning algorithms
- information extraction
- data mining
- active learning
- word sense disambiguation
- cross lingual
- co occurrence
- word recognition
- out of vocabulary
- word forms
- automatic summarization
- sentiment analysis
- digital libraries
- cross language
- language modeling
- test collection
- data analysis
- semi supervised learning
- training set
- text mining
- supervised learning
- knowledge discovery
- knowledge representation