ngPhylo: N-Gram Modeled Proteins with Substitution Matrices for Phylogenetic Analysis.
Brigitte HofmeisterBrian R. KingPublished in: BCB (2013)
Keyphrases
- n gram
- phylogenetic analysis
- molecular biology
- language model
- protein structure
- text classification
- protein sequences
- language independent
- sequence alignment
- language modelling
- language modeling
- part of speech
- amino acids
- variable length
- inside outside algorithm
- viterbi algorithm
- language specific
- neural network
- data sources