Machine-Learning directed Article Detection on the Web using DOM and text-based features.
Shobhit MathurPritam NikamHarshita PatidarRohan Bapusaheb GaikwadPreeti Narayan NayakPublished in: CCNC (2021)
Keyphrases
- machine learning
- website
- false positives
- textual features
- web pages
- feature extraction
- pattern recognition
- web applications
- feature set
- detection algorithm
- image features
- data mining
- support vector machine classifier
- detection method
- feature space
- feature selection
- database
- low level
- false alarms
- linked data
- web content
- feature vectors
- keypoints
- web documents
- computer vision
- machine learning approaches
- detection rate
- machine learning methods
- decision trees
- end users
- anomaly detection
- xml documents
- co occurrence
- text mining
- object detection
- information extraction
- support vector machine
- classification accuracy