A simple algorithm for identifying integrons and gene cassettes in bacteria on next generation sequencing data.
Guan-Jie HuaChe-Lun HungChuan Yi TangHuiru ZhengPublished in: Int. J. Data Min. Bioinform. (2016)
Keyphrases
- learning algorithm
- data sets
- computational complexity
- data reduction
- preprocessing
- detection algorithm
- noisy data
- k means
- input data
- optimal solution
- missing data
- np hard
- probabilistic model
- clustering method
- knowledge discovery
- data points
- data sources
- data mining techniques
- search space
- data acquisition
- data structure
- objective function
- synthetic datasets