Cloud-based MOTIFSIM: Detecting Similarity in Large DNA Motif Data Sets.
Ngoc Tam L. TranChun-Hsi HuangPublished in: J. Comput. Biol. (2017)
Keyphrases
- data sets
- dna sequences
- binding sites
- motif discovery
- similarity measure
- biological sequences
- cloud computing
- dna computing
- sequence analysis
- benchmark data sets
- distance measure
- real world
- euclidean distance
- similarity function
- semantic similarity
- real world data sets
- training set
- web browser
- data streams
- human genome
- database
- decision trees
- training data
- similarity measurement
- variable length
- synthetic data
- high dimensional data
- data management