Similarity Word-Sequence Kernels for Sentence Clustering.
Jesús Andrés-FerrerGermán Sanchis-TrillesFrancisco CasacubertaPublished in: SSPR/SPR (2010)
Keyphrases
- sentence similarity
- similarity function
- clustering algorithm
- sentence level
- similarity measure
- similarity calculation
- word level
- similarity scores
- distance metric
- syntactic analysis
- similarity matrices
- similar objects
- cosine similarity
- word similarity
- clustering method
- semantic similarity
- dissimilarity measure
- proximity measures
- kernel methods
- noun phrases
- k means
- text representation
- syntactic information
- word pairs
- text corpus
- natural language
- support vector
- similarity matrix
- natural language text
- distance function
- word frequency
- intra cluster
- n gram
- distance measure
- lexico syntactic
- syntactic categories
- word order
- stop words
- data points