Rule-based word clustering for document metadata extraction.
Hui HanEren ManavogluHongyuan ZhaKostas TsioutsiouliklisC. Lee GilesXiangmin ZhangPublished in: SAC (2005)
Keyphrases
- metadata extraction
- document clustering
- clustering algorithm
- clustering method
- digital libraries
- keywords
- information retrieval systems
- retrieval systems
- web documents
- databases
- co occurrence
- document images
- metadata
- information retrieval
- text mining
- document retrieval
- text documents
- structured documents
- machine learning