Contrastive Clustering to Mine Pseudo Parallel Data for Unsupervised Translation.
Xuan-Phi NguyenHongyu GongYun TangChanghan WangPhilipp KoehnShafiq R. JotyPublished in: ICLR (2022)
Keyphrases
- data sets
- clustering algorithm
- data analysis
- data collection
- data points
- data processing
- high quality
- high dimensional data
- raw data
- synthetic data
- unsupervised learning
- computer systems
- data mining tasks
- multidimensional data
- pairwise
- training data
- clustering analysis
- information bottleneck
- image data
- knowledge discovery
- data sources
- machine learning