Mining the entire Protein DataBank for frequent spatially cohesive amino acid patterns.
Pieter MeysmanCheng ZhouBoris CuleBart GoethalsKris LaukensPublished in: BioData Min. (2015)
Keyphrases
- amino acids
- frequent patterns
- mining frequent
- protein sequences
- closed patterns
- pattern mining
- protein function
- protein structure prediction
- interesting patterns
- mining algorithm
- protein structure
- protein folding
- sequential patterns
- amino acid sequences
- sequence databases
- frequent subgraphs
- secondary structure
- data mining techniques
- sequence alignment
- physico chemical
- frequently occurring
- tertiary structure
- data mining
- sequential pattern mining
- pattern discovery
- association rules
- physicochemical properties
- association rule mining
- itemsets
- graph data
- protein families
- contact map
- graph mining
- social network analysis
- sequence similarity