Detecting Compositionality of Multi-Word Expressions using Nearest Neighbours in Vector Space Models.
Douwe KielaStephen ClarkPublished in: EMNLP (2013)
Keyphrases
- vector space model
- nearest neighbour
- multiword
- language model
- document representation
- text clustering
- information retrieval
- vector space
- context sensitive
- document clustering
- latent semantic indexing
- tf idf
- semantic similarity
- decision trees
- knn
- web documents
- euclidean distance
- retrieval model
- training set
- neural network
- pattern recognition
- semantic information
- document retrieval
- retrieval systems
- part of speech
- k nearest neighbor
- bayesian networks
- probabilistic model
- support vector machine