Language Modeling by Clustering with Word Embeddings for Text Readability Assessment.
Miriam ChaYoungjune GwonH. T. KungPublished in: CoRR (2017)
Keyphrases
- language modeling
- language model
- n gram
- information retrieval
- term weighting
- translation model
- retrieval model
- multiword
- word segmentation
- cross lingual
- query expansion
- statistical language modeling
- probabilistic model
- text retrieval
- text classification
- clustering algorithm
- word level
- text mining
- document level
- text clustering
- word pairs
- sentence level
- high dimensional data
- machine translation system
- search engine
- k means
- text documents
- vector space
- keywords
- co occurrence
- dimensionality reduction
- language independent
- term frequency
- document clustering
- relevance model
- document retrieval
- text corpora
- mixture model
- information retrieval systems
- cross language
- data points
- statistical machine translation
- digital libraries
- semantic relations