An EM-based technique for approximating long-tailed data sets with PH distributions.
Alma RiskaVesselin DievEvgenia SmirniPublished in: Perform. Evaluation (2004)
Keyphrases
- data sets
- heavy tailed
- gaussian distribution
- heavy tails
- summary statistics
- probability distribution
- real world data sets
- benchmark data sets
- information retrieval
- real world
- real time
- data mining
- data sources
- input data
- high dimensional data
- databases
- statistical distributions
- synthetic and real world data sets
- exponential distributions