The M5nr: a novel non-redundant database containing protein sequences and annotations from multiple sources and associated tools.
Andreas WilkeTravis HarrisonJared WilkeningDawn FieldElizabeth M. GlassNikos KyrpidesKonstantinos MavrommatisFolker MeyerPublished in: BMC Bioinform. (2012)
Keyphrases
- multiple sources
- protein sequences
- database
- computational biology
- data from multiple sources
- databases
- sequence databases
- biological sequences
- secondary structure
- amino acids
- protein classification
- sequence analysis
- protein structure
- database systems
- metadata
- protein secondary structure
- remote homology detection
- protein structure and function
- multiple sequence alignment
- relational databases
- data sets
- data sources
- protein function
- domain specific
- amino acid sequences
- website
- experimentally determined
- machine learning