Dramatically Reducing Training Data Size Through Vocabulary Saturation.
William LewisSauleh EetemadiPublished in: WMT@ACL (2013)
Keyphrases
- training data
- data sets
- scales linearly
- learning algorithm
- artificial intelligence
- decision trees
- prior knowledge
- learned from training data
- computational complexity
- keywords
- machine learning
- databases
- training set
- bayesian networks
- classification accuracy
- input data
- test set
- test data
- training process
- training dataset
- small size
- fixed size