Semi-supervised clustering for de-duplication.
Shrinu KushagraShai Ben-DavidIhab F. IlyasPublished in: AISTATS (2019)
Keyphrases
- semi supervised clustering
- semi supervised
- metric learning
- unsupervised clustering
- pairwise constraints
- nonnegative matrix factorization
- background knowledge
- semi supervised learning
- semi supervised classification
- k means
- clustering algorithm
- unlabeled data
- document clustering
- labeled data
- machine learning
- distance metric
- euclidean distance
- text categorization
- support vector
- data sets
- hidden markov random fields