Is the Sample Good Enough? Comparing Data from Twitter's Streaming API with Twitter's Firehose.
Fred MorstatterJürgen PfefferHuan LiuKathleen M. CarleyPublished in: ICWSM (2013)
Keyphrases
- data sets
- data collection
- social networks
- training data
- high quality
- synthetic data
- real time
- data analysis
- data structure
- database
- knowledge discovery
- data processing
- social media data
- online social networks
- data mining techniques
- social media
- statistical analysis
- image data
- sample size
- raw data
- data quality
- data samples
- prior knowledge