Comparison of methods for the detection of outliers and associated biomarkers in mislabeled omics data.
Hongwei SunYuehua CuiHui WangHaixia LiuTong WangPublished in: BMC Bioinform. (2020)
Keyphrases
- data sets
- raw data
- database
- noisy data
- human experts
- image data
- statistical analysis
- original data
- statistical methods
- missing values
- data points
- probability distribution
- data mining techniques
- data processing
- data collection
- synthetic data
- significant improvement
- data analysis
- high quality
- training data
- data mining methods
- data quality
- machine learning
- spectral clustering
- decision trees
- high dimensional data
- end users
- statistical tests
- multiple sources
- detect outliers
- false positives
- detection algorithm
- knowledge discovery