Semi-supervised clustering for de-duplication.
Shrinu KushagraShai Ben-DavidIhab F. IlyasPublished in: CoRR (2018)
Keyphrases
- semi supervised clustering
- semi supervised
- metric learning
- background knowledge
- unsupervised clustering
- semi supervised learning
- pairwise constraints
- semi supervised classification
- nonnegative matrix factorization
- document clustering
- labeled data
- k means
- unlabeled data
- hidden markov random fields
- clustering algorithm
- pairwise
- machine learning
- supervised learning
- multi class
- domain knowledge
- information retrieval