Exploiting corpus-related ontologies for conceptualizing document corpora.
Hai-Tao ZhengCharles BorchertHong-Gee KimPublished in: J. Assoc. Inf. Sci. Technol. (2009)
Keyphrases
- document corpus
- text corpus
- text corpora
- text collections
- knowledge base
- specific domains
- hand crafted
- training corpus
- document level
- document collections
- retrieval systems
- news corpus
- natural language processing
- semantic web
- document retrieval
- document images
- web documents
- manually annotated
- semantic relationships
- statistical machine translation
- text classification
- information retrieval systems
- text mining
- lexical resources
- word frequency
- chinese english
- annotated corpus
- information retrieval
- wide coverage