ChemTables: a dataset for semantic classification on tables in chemical patents.
Zenan ZhaiChristian DruckenbrodtCamilo ThorneSaber A. AkhondiDat Quoc NguyenTrevor CohnKarin VerspoorPublished in: J. Cheminformatics (2021)
Keyphrases
- classification accuracy
- database
- classification scheme
- automatic classification
- pattern recognition
- feature space
- benchmark datasets
- feature set
- image classification
- uci datasets
- databases
- decision trees
- feature selection
- semantic information
- classification systems
- neural network
- information retrieval
- classification rules
- classification method
- support vector machine svm
- feature extraction
- support vector
- class labels
- preprocessing
- classification algorithm
- training samples
- machine learning methods
- pattern classification
- classification models
- domain specific
- support vector machine
- intellectual property
- training set