One Document, Many Revisions: A Dataset for Classification and Description of Edit Intents.
Dheeraj RajagopalXuchao ZhangMichael GamonSujay Kumar JauharDiyi YangEduard H. HovyPublished in: LREC (2022)
Keyphrases
- document classification
- classification accuracy
- benchmark datasets
- feature set
- classification method
- decision trees
- pattern classification
- database
- support vector machine
- pattern recognition
- machine learning
- classification scheme
- high level
- training dataset
- information retrieval
- information retrieval systems
- support vector machine svm
- automatic classification
- classification rules
- data sets
- web documents
- document collections
- text classification
- training set
- feature extraction
- model selection
- image classification
- supervised learning
- class labels
- multi class
- feature vectors
- classification models
- feature space
- support vector
- feature selection
- uci datasets