A novel statistical method for decontaminating T-cell receptor sequencing data.
Ruoxing LiMehmet AltanAlexandre ReubenRuitao LinJohn V. HeymachHai TranRunzhe ChenLatasha LittleShawna HubertJianjun ZhangZiyi LiPublished in: Briefings Bioinform. (2023)
Keyphrases
- synthetic data
- statistical methods
- data sets
- statistical information
- input data
- prior knowledge
- original data
- data sources
- preprocessing
- data collection
- data processing
- significant improvement
- image data
- high accuracy
- detection method
- missing data
- high precision
- cost function
- similarity measure
- data analysis
- high quality
- statistical model
- contingency tables
- correlation analysis
- data quality
- noisy data
- prior information
- model selection
- pairwise
- database
- support vector machine
- data structure
- data points
- statistical analysis
- feature extraction
- raw data
- clustering algorithm
- xml documents
- neural network