Login / Signup
Improving Document Clustering for Short Texts by Long Documents via a Dirichlet Multinomial Allocation Model.
Yingying Yan
Ruizhang Huang
Can Ma
Liyang Xu
Zhiyuan Ding
Rui Wang
Ting Huang
Bowei Liu
Published in:
APWeb/WAIM (1) (2017)
Keyphrases
</>
document clustering
em algorithm
text documents
probabilistic model
document collections
document representation
vector space model
text classification
tf idf
search engine
clustering algorithm
similarity measure
keywords
information extraction
topic detection
document clusters