Focused Crawl of Web Archives to Build Event Collections.
Martin KleinLyudmila BalakirevaHerbert Van de SompelPublished in: CoRR (2018)
Keyphrases
- web pages
- internet archive
- digital libraries
- website
- focused crawling
- focused crawler
- web applications
- web crawlers
- metadata
- news reports
- web search
- web documents
- deep web
- search engine
- web users
- web data
- information retrieval
- web content
- semantic web
- web crawler
- web crawling
- digital archives
- web mining
- data repositories
- web sources
- document collections
- real world events
- music collections
- multimedia
- event driven
- event recognition
- user generated content
- web technologies
- digital collections
- link analysis
- web resources
- end users
- cultural heritage