CDSbank: taxonomy-aware extraction, selection, renaming and formatting of protein-coding DNA or amino acid sequences.
Bart HazesPublished in: BMC Bioinform. (2014)
Keyphrases
- amino acid sequences
- sequence analysis
- protein sequences
- protein structure
- amino acids
- secondary structure
- molecular biology
- protein function
- protein structure prediction
- tertiary structure
- computational biology
- predicting protein
- biological sequences
- sequence data
- protein functional
- rna sequences
- protein tertiary structure
- sequence databases
- spatial structure
- protein folding
- protein protein interactions
- automatic extraction
- gene prediction
- information extraction
- computational approaches
- motif discovery
- feature selection
- coarse grained
- computational methods
- natural language processing