Evaluating text reuse discovery on the web.
Stanford ChiuIbrahim UysalW. Bruce CroftPublished in: IIiX (2010)
Keyphrases
- web documents
- information retrieval and extraction
- text information
- website
- web applications
- textual data
- digital documents
- textual features
- plain text
- text content
- web resources
- information retrieval
- web mining
- link analysis
- content and structure
- newspaper articles
- multi lingual
- data mining
- web images
- text retrieval
- web pages
- social networks
- database
- information discovery
- information sources
- knowledge discovery
- end users
- content features
- html pages
- semantic markup