An unsupervised Hindi stemmer with heuristic improvements.
Amaresh Kumar PandeyTanveer J. SiddiquiPublished in: AND (2008)
Keyphrases
- supervised classification
- optimal solution
- data driven
- machine learning
- text classification
- language model
- search algorithm
- semi supervised
- simulated annealing
- information extraction
- dynamic programming
- text categorization
- n gram
- tabu search
- machine translation
- named entity recognition
- data sets
- language independent
- greedy heuristic