Is the Sample Good Enough? Comparing Data from Twitter's Streaming API with Twitter's Firehose.
Fred MorstatterJürgen PfefferHuan LiuKathleen M. CarleyPublished in: CoRR (2013)
Keyphrases
- social media
- data sets
- data sources
- data samples
- synthetic data
- data analysis
- computer systems
- data processing
- raw data
- test data
- social media data
- data collection
- social networking
- statistical analysis
- open source
- image data
- prior knowledge
- training data
- social networks
- streaming data
- social media platforms
- original data
- probability distribution
- data streams
- data mining