Alignment-Free Sequence Comparison over Hadoop for Computational Biology.
Giuseppe CattaneoUmberto Ferraro PetrilloRaffaele GiancarloGianluca RoscignoPublished in: ICPP Workshops (2015)
Keyphrases
- computational biology
- multiple sequence alignment
- sequence analysis
- protein sequences
- biological sequences
- multiple alignment
- machine learning
- sequence alignment
- molecular biology
- natural language processing
- open source
- multiple sequence alignments
- string kernels
- biological processes
- cloud computing
- phylogenetic trees
- amino acids
- knowledge discovery
- rna sequences
- global alignment
- data sets
- sequence databases
- transcription factors
- big data
- protein structure prediction
- high dimensional