A machine learning strategy to identify candidate binding sites in human protein-coding sequence.
Thomas A. DownBernard LeongTim J. P. HubbardPublished in: BMC Bioinform. (2006)
Keyphrases
- learning strategies
- binding sites
- sequence alignment
- transcription factor binding sites
- dna binding
- protein interaction
- influenza virus
- protein families
- coding regions
- biological sequences
- protein protein
- dna sequences
- gene expression levels
- genome sequences
- gene expression
- transcription factors
- sequence data
- regulatory elements
- sequence analysis
- online learning
- drosophila melanogaster
- active learning
- motif discovery
- protein sequences
- statistical significance
- genome wide
- pairwise
- predicting protein
- high throughput
- sequence databases
- sequence similarity
- protein protein interactions
- amino acids
- protein structure
- protein function
- genomic sequences
- genomic data
- regulatory networks
- learning process
- biological processes
- human genome
- text mining
- training data