TCM-GPT: Efficient Pre-training of Large Language Models for Domain Adaptation in Traditional Chinese Medicine.
Guoxing YangJianyu ShiZan WangXiaohong LiuGuangyu WangPublished in: CoRR (2023)
Keyphrases
- language model
- traditional chinese medicine
- domain adaptation
- language modeling
- probabilistic model
- document retrieval
- n gram
- retrieval model
- information retrieval
- cross domain
- test collection
- data mining
- query expansion
- labeled data
- text mining
- active learning
- semi supervised learning
- information retrieval systems
- semi supervised
- cross lingual
- target domain
- data analysis