NeatFreq: reference-free data reduction and coverage normalization for De Novo sequence assembly.
Jamison M. McCorrisonPratap VenepallyIndresh SinghDerrick E. FoutsRoger S. LaskenBarbara A. MethéPublished in: BMC Bioinform. (2014)
Keyphrases
- data reduction
- preprocessing
- data compression
- genomic sequences
- feature selection
- classification rules
- representative subset
- data analysis
- knowledge discovery
- singular value decomposition
- classification accuracy
- instance selection
- model selection
- data mining
- database
- machine learning
- rough set theory
- feature vectors
- assembly process
- rna sequences
- clustering algorithm
- feature extraction
- input data
- bayesian networks
- data quality
- learning algorithm
- fuzzy logic
- databases
- pattern recognition
- association rules
- data sets