MP4: a machine learning based classification tool for prediction and functional annotation of pathogenic proteins from metagenomic and genomic datasets.
Ankit GuptaAditya S. MalweGopal N. SrivastavaParikshit ThoudamKeshav HibareVineet K. SharmaPublished in: BMC Bioinform. (2022)
Keyphrases
- machine learning
- protein classification
- protein function prediction
- sequence data
- decision trees
- text classification
- machine learning methods
- benchmark datasets
- active learning
- uci machine learning repository
- feature selection
- protein families
- sequence analysis
- machine learning algorithms
- supervised learning
- support vector machine
- protein sequences
- predicting protein
- gene products
- gene prediction
- genome sequencing
- predictive modeling
- learning algorithm
- image classification
- dna binding
- computational biology
- genomic sequences
- protein structure
- high throughput
- feature space
- drug design
- gene function
- computational approaches
- data mining
- amino acids
- subcellular localization