Does the Geometry of Word Embeddings Help Document Classification? A Case Study on Persistent Homology Based Representations.
Paul MichelAbhilasha RavichanderShruti RijhwaniPublished in: CoRR (2017)
Keyphrases
- document classification
- persistent homology
- topological features
- morse theory
- text classification
- text categorization
- text mining
- web documents
- classification algorithm
- n gram
- text documents
- computational geometry
- vector space
- co occurrence
- keywords
- link prediction
- feature selection
- dimensionality reduction
- knowledge discovery
- naive bayes
- data sets
- classification accuracy
- learning algorithm