Automatic thesaurus generation for Chinese documents.
Yuen-Hsien TsengPublished in: J. Assoc. Inf. Sci. Technol. (2002)
Keyphrases
- information retrieval systems
- document collections
- information retrieval
- keywords
- xml documents
- document indexing
- metadata
- keyword extraction
- digital libraries
- document retrieval
- latent semantic analysis
- vector space model
- document clustering
- semi automatic
- web documents
- query expansion
- domain specific
- chinese text
- controlled vocabulary
- concept space
- multi document summarization
- retrieval systems
- word segmentation
- document classification
- text retrieval
- electronic documents
- natural language processing
- lexical chains
- latent semantic
- web pages