Language Modeling by Clustering with Word Embeddings for Text Readability Assessment.
Miriam ChaYoungjune GwonH. T. KungPublished in: CIKM (2017)
Keyphrases
- language modeling
- language model
- n gram
- information retrieval
- term weighting
- retrieval model
- statistical language modeling
- translation model
- word segmentation
- keywords
- text clustering
- probabilistic model
- clustering algorithm
- sentence level
- cross lingual
- multiword
- k means
- text retrieval
- query expansion
- relevance model
- vector space
- text classification
- document level
- document retrieval
- high dimensional data
- word level
- vector space model
- statistical machine translation
- data points
- dimensionality reduction
- text documents
- multimedia
- text mining
- low dimensional
- web documents
- test collection
- document clustering
- co occurrence
- word pairs
- high dimensional
- machine translation system