Data Integration via Constrained Clustering: An Application to Enzyme Clustering.
Elisa Boari de LimaRaquel Cardoso de Melo MinardiWagner Meira Jr.Mohammed Javeed ZakiPublished in: SDM (2011)
Keyphrases
- data integration
- constrained clustering
- duplicate detection
- data cleaning
- instance level constraints
- clustering method
- k means
- clustering algorithm
- data exchange
- data model
- hierarchical clustering
- data management
- databases
- spectral clustering
- data sources
- semi supervised
- pairwise constraints
- data warehouse
- data warehousing
- pair wise constraints
- data clustering
- data extraction
- business intelligence
- constraint satisfaction
- cluster ensemble
- schema mappings
- biological data
- document clustering
- similarity measure
- cluster analysis
- linked data
- record linkage
- data processing
- query language
- semi supervised clustering
- web pages
- end users
- data sets