Towards a molecules production from DNA sequences based on clustering by 3D cellular automata approach and n-grams technique.
Fatima KabliReda Mohamed HamouAbdelmalek AminePublished in: AICCSA (2015)
Keyphrases
- cellular automata
- dna sequences
- n gram
- cellular automaton
- language model
- text classification
- cellular automata model
- human genome
- variable length
- bag of words
- tandem repeats
- dna computing
- sequence patterns
- binding sites
- coding regions
- lattice gas
- discrete dynamical systems
- biological sequences
- document clustering
- cellular automaton model
- chaotic dynamics
- transcription factors
- character n grams
- motif discovery
- part of speech
- text mining
- genomic sequences
- data analysis
- data mining tasks
- web documents
- knowledge discovery