Token-Mol 1.0: Tokenized drug design with large language model.
Jike WangRui QinMingyang WangMeijing FangYangyang ZhangYuchen ZhuQun SuQiaolin GouChao ShenOdin ZhangZhenxing WuDejun JiangXujun ZhangHuifeng ZhaoXiaozhe WanZhourui WuLiwei LiuYu KangChang-Yu HsiehTingjun HouPublished in: CoRR (2024)
Keyphrases
- language model
- drug design
- language modeling
- n gram
- probabilistic model
- information retrieval
- drug discovery
- protein protein interactions
- protein structure prediction
- query expansion
- retrieval model
- document retrieval
- test collection
- quantitative structure activity
- smoothing methods
- ad hoc information retrieval
- context sensitive
- mixture model
- query terms
- machine learning
- natural language processing
- data analysis
- translation model
- decision trees
- social networks