Focused Crawl of Web Archives to Build Event Collections.
Martin KleinLyudmila BalakirevaHerbert Van de SompelPublished in: WebSci (2018)
Keyphrases
- web pages
- internet archive
- digital libraries
- metadata
- web crawlers
- web applications
- website
- web documents
- focused crawling
- web search
- semantic web
- web crawler
- real world events
- information retrieval
- digital collections
- news reports
- cultural heritage
- web users
- search engine
- multimedia
- event detection
- user generated content
- web data
- focused crawler
- music collections
- end users
- information sources
- document collections
- web crawling
- web mining
- web scale
- data extraction
- deep web
- topic specific
- link analysis
- web content
- linked data
- document archives
- databases