Indexing Based on Topic Modeling and MATHML for Building Vietnamese Technical Document Retrieval Effectively.
Tuan Cao XuanLinh Bui KhanhHung Vo TrungHa Nguyen Thi ThuTinh Dao ThanhPublished in: ICCASA (2015)
Keyphrases
- document retrieval
- topic modeling
- information retrieval
- document indexing
- topic models
- text retrieval
- inverted index
- document collections
- language model
- latent dirichlet allocation
- text mining
- retrieval strategies
- document image retrieval
- cross language
- retrieval model
- text classification
- relevant documents
- relevance feedback
- xml retrieval
- prior knowledge
- language modeling framework
- database
- information retrieval systems
- collaborative filtering
- pseudo relevance feedback
- probabilistic model
- machine learning
- generative model
- query terms
- latent variables
- knn