Statistical Approaches to Patent Translation for PatentMT - Experiments with Various Settings of Training Data.
Yuen-Hsien TsengChao-Lin LiuChia-Chi TsaiJui-Ping WangYi-Hsuan ChuangJames JengPublished in: NTCIR (2011)
Keyphrases
- statistical approaches
- training data
- key phrase extraction
- learning algorithm
- training set
- decision trees
- test data
- test set
- machine translation
- information retrieval
- training process
- data sets
- supervised learning
- patent search
- statistical methods
- prior knowledge
- training examples
- classification accuracy
- classification models
- training samples
- semi supervised learning
- labeled data
- text classification
- generalization error
- cross language information retrieval
- learned from training data
- databases
- patent documents
- statistical machine translation
- machine learning
- training instances
- target language
- query translation
- noisy data
- document collections
- higher order