OntoDataClean: Ontology-Based Integration and Preprocessing of Distributed Data.
David Pérez-ReyAlberto AnguitaJosé CrespoPublished in: ISBMDA (2006)
Keyphrases
- distributed data
- preprocessing
- integrating heterogeneous
- data sharing
- data mining algorithms
- semantically heterogeneous
- distributed data mining
- data integration
- data distribution
- communication cost
- databases
- privacy concerns
- rare events
- feature extraction
- file system
- heterogeneous information systems
- database systems
- index structure
- data structure