Distinct Sampling on Streaming Data with Near-Duplicates.
Jiecao ChenQin ZhangPublished in: CoRR (2018)
Keyphrases
- data mining
- streaming data
- data streams
- data stream mining
- concept drift
- skewed data
- sliding window
- data distribution
- stream mining
- evolving data streams
- anomaly detection
- massive data streams
- machine learning
- data streaming
- data analysis
- stream data
- stream processing
- continuous queries
- non stationary
- pattern recognition
- incoming data
- database
- text mining
- image analysis
- image processing
- data sets
- concept drifting data streams