Does the Geometry of Word Embeddings Help Document Classification? A Case Study on Persistent Homology-Based Representations.
Paul MichelAbhilasha RavichanderShruti RijhwaniPublished in: Rep4NLP@ACL (2017)
Keyphrases
- document classification
- persistent homology
- topological features
- morse theory
- text classification
- text categorization
- text documents
- text mining
- classification algorithm
- web documents
- co occurrence
- n gram
- keywords
- neural network
- machine learning
- wordnet
- k nearest neighbor
- support vector machine
- probabilistic model
- information retrieval
- geometric structure
- computational geometry
- data mining
- data sets