A Novel Sequence-Based Feature for the Identification of DNA-Binding Sites in Proteins Using Jensen-Shannon Divergence.
Truong Khanh Linh DangCornelia MeckbachRebecca TackeStephan WaackMehmet GültasPublished in: Entropy (2016)
Keyphrases
- binding sites
- jensen shannon divergence
- dna binding
- protein families
- biological sequences
- protein protein
- sequence data
- sequence analysis
- sequence alignment
- dna sequences
- transcription factor binding sites
- selection criterion
- transcription factors
- gene expression
- motif discovery
- sequenced genomes
- cis regulatory
- information theory
- protein interaction
- influenza virus
- information theoretic
- genome sequences
- protein sequences
- statistical significance
- sequence databases
- feature selection
- coding regions
- feature set
- sequence similarity
- amino acid sequences
- mass spectrometry
- regulatory elements
- model selection
- genomic sequences
- rna sequences
- microarray
- biological processes
- protein structure
- mutual information
- protein protein interactions
- learning algorithm
- genome wide
- amino acids
- computational methods
- data mining