Detecting Content Drift on the Web Using Web Archives and Textual Similarity.
Brenda Reyes AyalaQiufeng DuJuyi HanPublished in: TPDL Workshops (2022)
Keyphrases
- web content
- web resources
- user generated content
- web documents
- website
- metadata
- web pages
- web applications
- multimedia
- semantic web
- link analysis
- dynamic content
- text information
- textual data
- similarity measure
- digital libraries
- information sources
- content management
- user interests
- content similarity
- web technologies
- web users
- web mining
- end users
- linked data
- online resources
- plain text
- textual features