Enriching Documents with Examples: A Corpus Mining Approach.
Jinhan KimSanghoon LeeSeung-won HwangSunghun KimPublished in: ACM Trans. Inf. Syst. (2013)
Keyphrases
- newspaper articles
- person names
- text mining
- document collections
- text corpora
- text documents
- word frequencies
- document level
- multiword
- information retrieval
- key concepts
- knowledge discovery
- web documents
- text corpus
- expert finding
- xml documents
- web mining
- similar documents
- text data
- training corpus
- information retrieval systems
- document classification
- pattern mining
- document retrieval
- data mining techniques
- document clustering
- co occurrence
- topic segmentation
- sentence level
- training documents
- document corpus
- metadata
- data mining
- statistical machine translation
- word pairs
- document representation
- web data
- itemsets
- relevant documents
- retrieval systems
- user queries