Data Enrichment Pipeline Model for Web Classification Based on Web Scraping and Machine Learning.
Evegeniya SamsonovaZlatan MoricGoran GvozdenTomislav HlupiPublished in: MIPRO (2024)
Keyphrases
- data points
- input data
- machine learning
- feature space
- high dimensional data
- dimensionality reduction
- experimental data
- information sources
- website
- probabilistic model
- data sets
- classification algorithm
- web data
- classification models
- end users
- support vector machine
- feature extraction
- decision trees
- web pages
- database
- semantic web
- web documents
- linked data
- machine learning methods
- web mining
- test data
- data extraction
- roc analysis
- web applications
- classification accuracy
- data analysis
- pattern recognition
- learning algorithm
- training samples
- text classification
- classification method
- supervised learning
- active learning
- network structure
- data model
- learning models
- web resources
- neural network