Graph-based Topic Extraction from Vector Embeddings of Text Documents: Application to a Corpus of News Articles.
M. Tarik AltuncuSophia N. YalirakiMauricio BarahonaPublished in: CoRR (2020)
Keyphrases
- text documents
- news articles
- text mining
- document classification
- text data
- document clustering
- text classification
- blog entries
- text collections
- text categorization
- information extraction
- text analysis
- keywords
- topic models
- text corpus
- text corpora
- wordnet
- textual information
- topic modeling
- bag of words
- news stories
- vector space
- named entities
- neural network
- learning algorithm
- latent dirichlet allocation
- feature vectors
- knn
- semi supervised