Identifying genomic signatures of N-gram nucleotide sequences to classify the chromatin states of broad histone track.
Kyung-Eun LeeHyun-Seok ParkPublished in: IMCOM (2015)
Keyphrases
- n gram
- nucleotide sequences
- sequence data
- molecular biology
- dna sequences
- protein sequences
- language model
- genome scale
- text classification
- microarray
- language modeling
- transcription factors
- high throughput
- cell biology
- variable length
- computational biology
- sequence similarity
- web documents
- inside outside algorithm
- biological data
- binding sites
- protein structure
- probabilistic model