A lower bound on the sample size needed to perform a significant frequent pattern mining task.
Stéphanie JacquemontFrançois JacquenetMarc SebbanPublished in: Pattern Recognit. Lett. (2009)
Keyphrases
- sample size
- frequent pattern mining
- upper bound
- lower bound
- frequent patterns
- worst case
- pattern mining
- huge number
- model selection
- statistical power
- mining frequent patterns
- random sampling
- progressive sampling
- temporal pattern mining
- sample complexity
- pattern growth
- confidence intervals
- objective function
- frequent pattern discovery
- small sample
- itemsets
- frequent itemsets
- frequent itemset mining
- pac learning
- statistical hypothesis testing
- candidate generation
- data structure
- concept class
- np hard
- frequent pattern mining algorithms
- random sample
- mining algorithm
- computational complexity
- small number
- data mining
- variance reduction
- minimum support
- statistical significance
- concept classes
- vc dimension
- sequential pattern mining