Soft-404 Pages, A Crawling Problem.
Víctor M. PrietoManuel ÁlvarezFidel CachedaPublished in: J. Digit. Inf. Manag. (2014)
Keyphrases
- search engine
- web pages
- web crawling
- focused crawling
- web graph
- web crawlers
- focused crawler
- web crawler
- website
- topic specific
- web applications
- link structure
- link analysis
- result merging
- keywords
- web search
- page importance
- web users
- text content
- textual content
- graph mining
- data objects
- ranking algorithm
- web mining
- web documents
- social networks
- information retrieval
- databases
- search queries
- query logs
- hard constraints
- web search engines
- page content
- neural network