Discovering Coherent Topics from Urdu Text: A Comparative Study of Statistical Models, Clustering Techniques and Word Embedding.
Mubashar MustafaFeng ZengUsama ManzoorLin MengPublished in: ICICT (2023)
Keyphrases
- statistical models
- statistical model
- keywords
- word counts
- sentence level
- sentiment analysis
- statistical modeling
- topic detection
- automatically discovering
- parameter estimation
- clustering algorithm
- stop words
- information retrieval
- translation model
- k means
- text documents
- text data
- keyword extraction
- latent topics
- key concepts
- document clustering
- noun phrases
- text mining
- n gram
- natural language text
- text retrieval
- bayesian networks
- word pairs
- language identification
- newspaper articles
- text corpora
- text collections
- writing style
- probabilistic model
- syntactic analysis
- topic models
- high dimensional data
- bayesian models
- document collections
- multi lingual
- graphical models
- exponential family
- information extraction