On Clustering Algorithms: Applications in Word-Embedding Documents.
Israel MendonçaAntoine TrouvéAkira FukudaKazuaki J. MurakamiPublished in: J. Comput. (2019)
Keyphrases
- document clustering
- clustering algorithm
- word frequencies
- word spotting
- vector space
- document collections
- keywords
- word frequency
- text documents
- related words
- latent topics
- text clustering
- information retrieval
- natural language text
- information retrieval systems
- text corpus
- printed documents
- term frequency
- multiword
- cluster analysis
- web documents
- spoken documents
- xml documents
- related documents
- concept space
- document space
- co occurrence
- training corpus
- relevant documents
- word pairs
- sentence level
- stop words
- page layout
- word co occurrence
- word similarity
- cluster labels
- word recognition
- vector space model
- term weighting
- document representation
- data clustering
- clustering method
- linguistic information
- text corpora
- k means
- sentence similarity
- n gram
- information extraction
- wordnet