Low cost Portability for statistical machine translation based on n-gram frequency and TF-IDF.
Matthias EckStephan VogelAlex WaibelPublished in: IWSLT (2005)
Keyphrases
- n gram
- tf idf
- language modelling
- language model
- retrieval model
- vector space model
- text classification
- information retrieval
- text documents
- term frequency
- weighting scheme
- bag of words
- text categorization
- language modeling
- part of speech
- document clustering
- ranking algorithm
- document retrieval
- machine learning
- information extraction
- text mining
- information retrieval systems
- wordnet
- query expansion
- feature selection