Analyzing Web Archives Through Topic and Event Focused Sub-collections.
Gerhard GossenElena DemidovaThomas RissePublished in: CoRR (2016)
Keyphrases
- focused crawler
- internet archive
- metadata
- digital libraries
- news articles
- focused crawling
- website
- web pages
- topic specific
- web applications
- web communities
- news stories
- information retrieval
- real world events
- news reports
- web documents
- online news
- data sets
- digital archives
- web data
- web mining
- cultural heritage
- event driven
- event detection
- multimedia
- emerging topics
- web users
- data collections
- digital objects
- web technologies
- semantic web
- digital collections
- information sources
- document archives