On Extending NLP Techniques from the Categorical to the Latent Space: KL Divergence, Zipf's Law, and Similarity Search.
Adam HareYu ChenYinan LiuZhenming LiuChristopher G. BrintonPublished in: CoRR (2020)
Keyphrases
- similarity search
- high dimensional
- gaussian mixture
- probabilistic latent semantic analysis
- low dimensional
- distance function
- metric space
- high dimensional data
- similarity measure
- natural language processing
- latent variables
- query processing
- knn
- dimensionality reduction
- vector space
- expectation maximization
- gaussian process
- co occurrence
- em algorithm
- pattern recognition
- data sets
- data points
- euclidean distance
- latent dirichlet allocation