Near-Duplicates Detection for Vietnamese Documents in Large Database.
Cong Thanh TruongThe Duy BuiBao Son PhamPublished in: ALPIT (2008)
Keyphrases
- database
- databases
- metadata
- document collections
- database systems
- xml documents
- information retrieval systems
- document representation
- vector space
- detection algorithm
- database management systems
- data management
- document classification
- relevant documents
- object detection
- data model
- text mining
- database applications
- semantic information
- query language
- automatic detection
- oracle database
- keywords
- information retrieval
- retrieval engine