Text Document Clustering: Wordnet vs. TF-IDF vs. Word Embeddings.
Michal MarcinczukMateusz GniewkowskiTomasz WalkowiakMarcin BedkowskiPublished in: GWC (2021)
Keyphrases
- tf idf
- document clustering
- text documents
- wordnet
- word sense
- word sense disambiguation
- co occurrence
- term frequency
- text clustering
- semantic information
- natural language processing
- word pairs
- vector space model
- cosine similarity
- text mining
- keywords
- semantic relations
- document representation
- clustering algorithm
- knowledge base
- vector space
- information extraction
- semantic similarity
- clustering method
- topic models
- document collections
- text categorization
- information retrieval
- text classification
- low dimensional
- dimensionality reduction
- knowledge discovery
- k means