Mining Document Collections to Facilitate Accurate Approximate Entity Matching.
Surajit ChaudhuriVenkatesh GantiDong XinPublished in: Proc. VLDB Endow. (2009)
Keyphrases
- document collections
- information retrieval systems
- document retrieval
- information retrieval
- test collection
- digital libraries
- text retrieval
- document clustering
- relevant documents
- text mining
- document representation
- data mining
- text collections
- text corpora
- scatter gather
- knowledge discovery
- text data
- ad hoc retrieval
- learning algorithm