A systematic method for solving data imbalance in CRISPR off-target prediction tasks.
Zengrui GuanZhenran JiangPublished in: Comput. Biol. Medicine (2024)
Keyphrases
- synthetic data
- input data
- statistical methods
- missing data
- noisy data
- objective function
- missing values
- test data
- cost function
- data sets
- data processing
- statistical analysis
- high quality
- computational cost
- data structure
- high accuracy
- prior information
- raw data
- combinatorial optimization
- data distribution
- pairwise
- high dimensional data
- data collection
- classification accuracy
- feature set
- prior knowledge
- segmentation method
- clustering method
- preprocessing
- dynamic programming
- original data
- support vector machine
- training data