An Effective Data Mining Technique for Classifying Unaligned Protein Sequences into Functional Families.
Patrick C. H. MaKeith C. C. ChanPublished in: CIT (2006)
Keyphrases
- protein sequences
- data mining techniques
- data warehouse
- data mining
- protein families
- secondary structure
- cluster analysis
- amino acids
- computational biology
- biological sequences
- association rules
- protein structure
- amino acid sequences
- data mining algorithms
- association rule mining
- protein secondary structure
- knowledge discovery
- pairwise
- metadata
- multiple sequence alignment
- protein structural
- protein structure and function
- data mining technology
- multi dimensional
- feature selection
- machine learning