Unveiling the semantic structure of text documents using paragraph-aware Topic Models.
Simón Roca-SoteloJerónimo Arenas-GarcíaPublished in: CoRR (2018)
Keyphrases
- text documents
- semantic structure
- topic models
- semantic relations
- latent semantic indexing
- topic modeling
- latent dirichlet allocation
- document representation
- text mining
- co occurrence
- news articles
- probabilistic model
- generative model
- text data
- latent topics
- document clustering
- text corpora
- text categorization
- wordnet
- knowledge base
- computer vision
- natural language processing
- search engine