How to Leverage a Multi-layered Transformer Language Model for Text Clustering: an Ensemble Approach.
Mira Ait SaadaFrançois RoleMohamed NadifPublished in: CIKM (2021)
Keyphrases
- multi layered
- language model
- text clustering
- document representation
- vector space model
- language modeling
- text mining
- n gram
- probabilistic model
- information retrieval
- document retrieval
- hierarchical clustering
- query expansion
- document clustering
- text categorization
- retrieval model
- clustering algorithm
- background knowledge
- text data
- text documents
- test collection
- k means
- wordnet
- query terms
- latent semantic indexing
- semantic relations
- user feedback
- text collections
- clustering quality
- metric learning
- natural language processing
- pairwise
- clustering method