Topic Mining based on Word Posterior Probability in Spoken Document.
Lei ZhangGuo-xing ChenXuezhi XiangJing-xin ChangPublished in: J. Softw. (2011)
Keyphrases
- posterior probability
- latent topics
- spoken documents
- topic models
- bayesian networks
- related documents
- word co occurrence
- generative model
- statistical topic models
- bayesian framework
- word frequency
- co occurrence
- concept space
- spoken document retrieval
- probability density function
- probabilistic model
- probability distribution
- keywords
- latent dirichlet allocation
- document collections
- text mining
- information retrieval
- automatic summarization
- conditional probabilities
- prior probabilities
- document images
- data mining techniques
- text documents
- speech recognition
- broadcast news
- document clustering
- multi document summarization
- principal component analysis
- knowledge discovery
- feature selection