Evaluating Global Link Structure of the Web for Focused Crawling in the Genomics and Genetics Domains.
Ari PirkolaTuomas TalvensaariPublished in: HEALTHINF (2009)
Keyphrases
- link structure
- focused crawling
- focused crawler
- web documents
- web pages
- link analysis
- web data
- topic specific
- anchor text
- semi structured
- ranking algorithm
- web content
- information extraction
- web mining
- website
- web graph
- search engine
- web search engines
- semantic information
- web search
- web sources
- keywords
- text content
- web users
- link prediction
- text mining
- wikipedia articles
- data mining
- data fusion