Cross-lingual Text Clustering in a Large System.
Nicole R. SchneiderJagan SankaranarayananHanan SametPublished in: NLPIR (2023)
Keyphrases
- cross lingual
- text clustering
- document clustering
- text classification
- text mining
- clustering algorithm
- machine translation
- document representation
- text data
- text categorization
- text documents
- language modeling
- document collections
- background knowledge
- clustering method
- k means
- latent semantic analysis
- natural language processing
- text collections
- data mining
- bag of words
- language model
- news articles
- vector space
- supervised learning
- knowledge representation
- vector space model
- pairwise
- information retrieval