A Benchmark for Text Expansion: Datasets, Metrics, and Baselines.
Yi ChenHaiyun JiangWei BiRui WangLongyue WangShuming ShiRuifeng XuPublished in: CoRR (2023)
Keyphrases
- database
- text data
- information retrieval
- text retrieval
- free text
- data sets
- text documents
- text collections
- text mining
- real world
- textual data
- document level
- text classification
- pattern matching
- keywords
- similarity measure
- automatically extracted
- quality metrics
- string matching
- document analysis
- similarity metrics
- uci machine learning repository