Collecting legacy corpora from social science research for text mining evaluation.
Bei YuMin-Chun KuPublished in: ASIST (2010)
Keyphrases
- text mining
- natural language processing
- text corpora
- textual documents
- text data
- reverse engineering
- text classification
- text categorisation
- computational linguistics
- evaluation model
- textual data
- data collection
- social sciences
- evaluation criteria
- gold standard
- evaluation method
- document clustering
- text documents
- information extraction
- data sets