Maximally Consistent Sampling and the Jaccard Index of Probability Distributions.
Ryan MoultonYunjiang JiangPublished in: ICDM Workshops (2018)
Keyphrases
- probability distribution
- similarity measure
- random variables
- similarity metric
- sampling strategies
- sampling algorithm
- database
- index structure
- sample size
- bayesian networks
- probabilistic model
- decision trees
- parameter space
- edit distance
- random sampling
- information retrieval
- globally optimal
- machine learning
- sampling methods
- cosine similarity
- sampled data
- data mining