Constructing Language Models from Online Forms to Aid Better Document Representation for More Effective Clustering.
Stephen BradshawColm O'RiordanDaragh BradshawPublished in: IC3K (2017)
Keyphrases
- language model
- document representation
- language modeling
- document clustering
- vector space model
- n gram
- document retrieval
- probabilistic model
- test collection
- information retrieval
- retrieval model
- bag of words
- query terms
- query expansion
- clustering algorithm
- relevance model
- document structure
- unsupervised learning
- data fusion
- clustering method
- text data
- cluster analysis
- semantic information
- text documents
- web documents
- document collections
- generative model
- web search
- k means
- feature extraction
- computer vision