Beyond Volume: The Impact of Complex Healthcare Data on the Machine Learning Pipeline.
Keith FeldmanLouis FaustXian WuChao HuangNitesh V. ChawlaPublished in: CoRR (2017)
Keyphrases
- machine learning
- data sets
- synthetic data
- data collection
- complex data
- training data
- data structure
- data analysis
- knowledge discovery
- information systems
- raw data
- data processing
- databases
- high dimensional data
- data sources
- high quality
- data mining
- data management
- knowledge acquisition
- information extraction
- computer systems
- statistical analysis
- information technology
- domain experts
- sensor data
- experimental data