Dynamic Data Sampler for Cross-Language Transfer Learning in Large Language Models.
Yudong LiYuhao FengWen ZhouZhe ZhaoLinlin ShenCheng HouXianxu HouPublished in: ICASSP (2024)
Keyphrases
- language model
- transfer learning
- labeled data
- data sets
- data analysis
- knowledge discovery
- data sources
- data points
- cross language
- n gram
- text retrieval
- context sensitive
- probabilistic model
- unlabeled data
- document collections
- test collection
- document retrieval
- language modeling
- cross lingual
- information retrieval
- active learning
- semi supervised
- training data
- search engine