n-Gram characterization of genomic islands in bacterial genomes.
Gordana Pavlovic-LazeticNenad S. MiticMilos V. BeljanskiPublished in: Comput. Methods Programs Biomed. (2009)
Keyphrases
- n gram
- escherichia coli
- human genome
- genomic data
- sequence data
- genomic sequences
- language model
- comparative genomics
- genome sequences
- high throughput
- dna sequences
- language independent
- language modeling
- protein coding regions
- variable length
- text classification
- viterbi algorithm
- part of speech
- language modelling
- metabolic pathways
- character n grams
- statistical language modeling
- biological data
- web documents
- artificial intelligence
- machine learning
- databases