CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes.
Genis ParraKeith BradnamIan KorfPublished in: Bioinform. (2007)
Keyphrases
- dna sequences
- genomic sequences
- protein coding regions
- regulatory elements
- coding regions
- sequenced genomes
- biological processes
- comparative genomics
- transcription factor binding sites
- binding sites
- genome annotation
- genomic data
- escherichia coli
- genome sequences
- sequence data
- essential genes
- gene clusters
- gene expression
- human genome
- transcription factors
- microarray data
- evolutionary history
- genome rearrangements
- metadata
- semantic annotation
- gene expression data
- gene expression profiles
- genome scale
- comparative analysis
- horizontal gene transfer
- protein families
- metabolic pathways
- gene sets
- motif discovery
- molecular biology
- protein protein interactions
- phylogenetic analysis
- statistically significant
- microarray
- data sets