DPCfam: Unsupervised protein family classification by Density Peak Clustering of large sequence datasets.
Elena Tea RussoFederico BaroneAlex BatemanStefano CozziniMarco PuntaAlessandro LaioPublished in: PLoS Comput. Biol. (2022)
Keyphrases
- unsupervised learning
- protein families
- unsupervised classification
- unsupervised clustering
- supervised classification
- protein classification
- supervised learning
- unsupervised feature selection
- machine learning
- gene expression profiles
- protein sequences
- feature selection
- dna sequences
- high precision
- self organizing maps
- text classification
- semi supervised
- k means
- document clustering
- high dimensionality
- machine learning methods
- clustering analysis
- model selection
- restricted boltzmann machine
- clustering algorithm