Biased Embeddings from Wild Data: Measuring, Understanding and Removing.
Adam SuttonThomas Lansdall-WelfareNello CristianiniPublished in: IDA (2018)
Keyphrases
- data sets
- training data
- high quality
- synthetic data
- database
- data structure
- data sources
- raw data
- data analysis
- data processing
- computer systems
- historical data
- complex data
- vector space
- data acquisition
- application domains
- statistical analysis
- data mining techniques
- data mining algorithms
- image data
- data distribution
- test data
- probability distribution
- clustering algorithm
- feature selection
- noisy data
- machine learning
- data mining
- highly correlated
- survey data