Semi-Supervised Clustering of XML Documents: Getting the Most from Structural Information.
Eduardo Bezerra da SilvaMarta MattosoGeraldo XexéoPublished in: ICDE Workshops (2006)
Keyphrases
- structural information
- semi supervised clustering
- xml documents
- semantic information
- semi supervised
- background knowledge
- pairwise constraints
- unsupervised clustering
- metric learning
- relational databases
- semi supervised learning
- xml data
- semi supervised classification
- data representation
- xml schema
- clustering algorithm
- nonnegative matrix factorization
- labeled data
- machine learning
- pairwise
- document clustering
- unlabeled data
- k means
- clustering method
- distance metric
- active learning
- supervised learning
- supervised classification
- database
- object recognition
- text mining
- logic programs