Improving Semantic Coherence of Gujarati Text Topic Model Using Inflectional Forms Reduction and Single-letter Words Removal.
Uttam ChauhanApurva ShahPublished in: ACM Trans. Asian Low Resour. Lang. Inf. Process. (2021)
Keyphrases
- topic models
- word pairs
- text documents
- text corpora
- text mining
- topic modeling
- latent topics
- baseline models
- latent dirichlet allocation
- semantic similarity
- topic tracking
- co occurrence
- probabilistic topic models
- probabilistic model
- text analysis
- latent variables
- keywords
- semantic relations
- information retrieval
- semantic information
- gibbs sampling
- word forms
- semantic relationships
- tf idf
- text data
- generative model
- semantic network
- lda model
- part of speech
- word sense
- natural language
- latent semantic analysis
- relevance model
- text classification
- natural language processing
- information extraction
- syntactic categories