Unsupervised identification of redundant domain entries in InterPro database using clustering techniques.
Ahmet Süreyya RifaiogluTunca DoganTolga CanPublished in: BCB (2015)
Keyphrases
- database
- unsupervised learning
- databases
- information bottleneck
- domain specific
- clustering algorithm
- database systems
- relational databases
- k means
- unsupervised manner
- information theoretic
- training data
- agglomerative clustering
- unsupervised classification
- database management systems
- clustering method
- database applications
- supervised classification
- data clustering
- domain independent
- distance metric
- data management
- query language
- data model
- categorical data
- cross domain
- anomaly detection
- metadata