Data quality challenges with missing values and mixed types in joint sequence analysis.
Alina LazarLing JinC. Anna SpurlockKesheng WuAlex SimPublished in: IEEE BigData (2017)
Keyphrases
- missing values
- data quality
- sequence analysis
- data cleaning
- missing data
- data imputation
- sequence data
- data warehouse
- computational biology
- incomplete data
- real world
- high dimensional data
- protein sequences
- database
- data structure
- databases
- preprocessing
- data integration
- feature extraction
- data warehousing
- database systems