Large-Scale Collections Under The Magnifying Glass: Format Identification For Web Archives.
Clément OuryPublished in: iPRES (2010)
Keyphrases
- metadata
- web scale
- internet archive
- digital libraries
- website
- multimedia
- web applications
- chinese web
- music collections
- semantic web
- web documents
- web resources
- information sources
- document collections
- web mining
- digital collections
- information retrieval
- real world
- web data
- small scale
- cultural heritage
- web communities
- plain text
- web pages
- databases
- data repositories
- real life