Copy detection in Chinese documents using Ferret.
Jun Peng BaoCaroline LyonPeter C. R. LanePublished in: Lang. Resour. Evaluation (2006)
Keyphrases
- copy detection
- information retrieval
- document collections
- keyword extraction
- free text
- relevant documents
- text documents
- document classification
- web documents
- information retrieval systems
- document clustering
- metadata
- text summarization
- xml documents
- legal documents
- structured documents
- chinese text
- latent semantic analysis
- vector space model
- document retrieval
- test collection
- question answering
- digital libraries