German Text Embedding Clustering Benchmark.
Silvan WehrliBert ArnrichChristopher IrrgangPublished in: KONVENS (2023)
Keyphrases
- k means
- text clustering
- clustering algorithm
- text mining
- clustering method
- categorical data
- multidimensional scaling
- information retrieval
- self organizing maps
- hierarchical clustering
- text documents
- vector space
- document clustering
- data mining
- short text
- free text
- keywords
- data sets
- spectral clustering
- text retrieval
- data clustering
- text data
- semantic information
- data points
- nonlinear dimensionality reduction
- syntactic categories