Automated protein sequence database classification. I. Integration of compositional similarity search, local similarity search, and multiple sequence alignment.
Jérôme GracyP. ArgosPublished in: Bioinform. (1998)
Keyphrases
- similarity search
- protein sequences
- multiple sequence alignment
- indexing techniques
- database
- remote homology detection
- secondary structure
- sequence databases
- biological sequences
- computational biology
- multiple alignment
- metric space
- indexing structure
- high dimensional
- amino acids
- query processing
- similarity measure
- sequence alignment
- feature vectors
- knn
- high dimensional data
- database systems
- protein structure
- machine learning
- memory efficient
- databases
- database management systems
- data management
- r tree
- data integration
- data model
- feature extraction
- multiple sequence alignments
- access methods
- pairwise
- protein structure prediction
- data structure
- mass spectra
- feature selection