A Classifier to Determine Whether a Document is Professionally or Machine Translated.
Michael LuckertMortiz Schaefer-KehnertWelf LöweMorgan EricssonAnna WingkvistPublished in: BIR (2016)
Keyphrases
- text classifiers
- document collections
- information retrieval systems
- training data
- classifier systems
- decision trees
- keywords
- training documents
- document images
- linear classifiers
- training set
- feature selection
- learning algorithm
- tf idf
- document classification
- semantic information
- flowshop
- svm classifier
- web documents
- text classification
- neural network
- text documents
- classification method
- database
- classifier ensemble
- clustering algorithm
- batch processing