Using Domain Top-page Similarity Feature in Machine Learning-Based Web Phishing Detection.
Nuttapong SanglerdsinlapachaiArnon RungsawangPublished in: WKDD (2010)
Keyphrases
- machine learning
- website
- web pages
- web mining
- detection algorithm
- web data
- web applications
- detection method
- similarity measure
- object detection
- web content
- domain specific
- page content
- learning algorithm
- natural language processing
- web browsing
- end users
- feature vectors
- feature selection
- anchor text
- spam filtering
- content features
- specific domains
- data mining
- detection rate
- machine learning methods
- page importance
- search engine
- web documents
- semantic web
- distance measure
- social networks
- decision trees
- web search
- hyperlink structure
- recommender systems
- web crawler
- information extraction
- text mining
- transfer learning
- web graph
- link analysis
- web users
- machine learning algorithms