Automated family-based naming of small RNAs for next generation sequencing data using a modified MD5-digest algorithm
Guodong LiuZhihua LiYuefeng LinBino JohnPublished in: CoRR (2012)
Keyphrases
- data sets
- input data
- noisy data
- training data
- data reduction
- synthetic datasets
- data structure
- worst case
- preprocessing
- detection algorithm
- k means
- small number
- dynamic programming
- np hard
- data analysis
- clustering analysis
- data points
- segmentation algorithm
- expectation maximization
- special case
- objective function
- learning algorithm
- simulated annealing
- statistical methods
- computational complexity
- similarity measure