Morphological random forests for language modeling of inflectional languages.
Ilya OparinOndrej GlembekLukás BurgetJan CernockýPublished in: SLT (2008)
Keyphrases
- random forests
- language modeling
- word forms
- cross lingual
- language independent
- language model
- n gram
- random forest
- information retrieval
- decision trees
- multiword
- retrieval model
- text classification
- comparable corpora
- logistic regression
- query expansion
- machine learning algorithms
- cross language
- probabilistic model
- ensemble methods
- decision tree ensembles
- parallel corpora
- image processing
- web documents
- machine translation
- document retrieval
- prediction accuracy
- feature set
- feature vectors
- support vector
- neural network
- data sets