Got 404s? Crawling and Analyzing an Institution's Web Domain.
Martin KleinLyudmila BalakirevaPublished in: TPDL (2022)
Keyphrases
- web pages
- web mining
- web crawling
- website
- specific domains
- domain specific
- web applications
- web crawlers
- web data
- web content
- web users
- web graph
- information sources
- semantic web
- search engine
- domain independent
- database
- linked data
- web technologies
- cross domain
- user experience
- domain ontology
- web queries
- web information
- topic specific
- web information retrieval
- focused crawling
- meta search
- web crawler
- link analysis