Caveat Emptor, Computational Social Science: Large-Scale Missing Data in a Widely-Published Reddit Corpus.
Devin GaffneyJ. Nathan MatiasPublished in: CoRR (2018)
Keyphrases
- missing data
- social sciences
- missing values
- structure from motion
- motion segmentation
- low rank
- matrix factorization
- imprecise data
- computer science
- digital government
- social scientists
- real world
- incomplete data
- multiple imputation
- digital archiving
- data imputation
- diverse fields
- dirichlet process mixture models
- data mining
- databases
- perfect phylogeny