Designing Data: Proactive Data Collection and Iteration for Machine Learning.
Aspen HopkinsFred HohmanLuca ZappellaXavier Suau CuadrosDominik MoritzPublished in: CoRR (2023)
Keyphrases
- data collection
- data sets
- machine learning
- data analysis
- collected data
- knowledge discovery
- data processing
- raw data
- original data
- database
- pattern recognition
- high quality
- information extraction
- small number
- natural language processing
- data acquisition
- big data
- sensor data
- synthetic data
- data quality
- collecting data
- statistical methods
- high dimensional data
- model selection
- input data
- data mining techniques
- image data
- xml documents
- data structure
- website
- metadata
- neural network