Data Pre-Processing to Train a Better Lithuanian-English MT System.
Daiga DeksneRaivis SkadinsPublished in: Baltic HLT (2012)
Keyphrases
- data pre processing
- machine translation
- query translation
- data analysis
- preprocessing
- data mining
- target language
- machine translation system
- cross lingual
- parallel corpora
- cross language information retrieval
- statistical machine translation
- dimension reduction
- natural language processing
- natural language
- feature selection
- pattern extraction
- machine learning
- information extraction
- data preparation
- decision tree algorithm
- missing values
- data mining process
- cross language
- real world
- post processing
- data warehouse
- high dimensionality
- cluster analysis
- association rules
- databases