Building the Dresden Web Table Corpus: A Classification Approach.
Julian EberiusKatrin BraunschweigMarkus HentschMaik ThieleAhmad AhmadovWolfgang LehnerPublished in: BDC (2015)
Keyphrases
- supervised machine learning
- decision trees
- classification method
- image classification
- web applications
- website
- machine learning methods
- web pages
- pattern recognition
- classification scheme
- feature space
- benchmark datasets
- automatic classification
- feature selection
- information retrieval
- web content
- support vector machine svm
- classification algorithm
- web mining
- training samples
- classification rules
- linked data
- link analysis
- user generated content
- textual features
- decision rules
- test set
- web documents
- database
- information sources
- model selection
- text classification
- support vector machine
- classification accuracy
- training set
- feature extraction
- machine learning
- data sets