Identification of malicious web pages for crawling based on network-related attributes of web server.
Gen HattoriKazunori MatsumotoChihiro OnoYasuhiro TakishimaPublished in: IUCS (2010)
Keyphrases
- web server
- web pages
- web crawlers
- website
- web browser
- search engine
- database server
- related web pages
- web search engines
- web search
- web content
- dynamic content
- web users
- web data
- keywords
- web documents
- link analysis
- web server logs
- web usage mining
- end users
- network traffic
- log files
- web logs
- web graph
- admission control
- focused crawling
- user sessions
- link prediction
- machine learning
- link structure
- data extraction
- user interface
- data objects