The Clustering of Author's Texts of English Fiction in the Vector Space of Semantic Fields
Bohdan M. PavlyshenkoPublished in: CoRR (2012)
Keyphrases
- vector space
- natural language
- linguistic analysis
- cosine similarity
- clustering algorithm
- brazilian portuguese
- vector space model
- retrieval model
- feature vectors
- k means
- distance measure
- covariance matrices
- clustering method
- latent semantic indexing
- document space
- euclidean space
- similarity search
- multidimensional scaling
- computer vision
- concept space
- rhetorical structure theory
- document clustering
- low dimensional
- natural language text
- word sense
- data points
- latent semantic
- machine translation
- information extraction
- high dimensional
- data analysis
- keywords
- information retrieval
- database