Login / Signup
Constructing a text corpus for inexact duplicate detection.
Jack G. Conrad
Cindy P. Schriber
Published in:
SIGIR (2004)
Keyphrases
</>
duplicate detection
text corpus
text corpora
text documents
record linkage
named entities
data cleaning
text mining
wikipedia articles
training corpus
text classification
wordnet
data sets
website
metadata
information retrieval
databases