FastText and XGBoost Content-Based Classification for Employment Web Scraping.
Arkadiusz TalunPawel DrozdaLeszek BukowskiRafal SchererPublished in: ICAISC (2) (2020)
Keyphrases
- classification accuracy
- classification systems
- web applications
- classification algorithm
- pattern classification
- information technology
- text classification
- image classification
- support vector machine svm
- feature vectors
- image retrieval
- support vector
- classification method
- decision trees
- automatic classification
- class labels
- machine learning methods
- semantic web
- web technologies
- classification process
- support vector machine
- feature selection
- feature space
- web pages
- multimedia
- website
- web mining
- data sets
- data mining
- classification scheme
- linked data
- unsupervised learning
- training data
- pattern recognition
- decision rules
- benchmark datasets
- knn
- information sources