T4SEfinder: a bioinformatics tool for genome-scale prediction of bacterial type IV secreted effectors using pre-trained protein language model.
Yumeng ZhangYangming ZhangYi XiongHui WangZixin DengJiangning SongHong-Yu OuPublished in: Briefings Bioinform. (2022)
Keyphrases
- language model
- protein structure prediction
- sequence similarity
- genome scale
- pre trained
- computational biology
- systems biology
- protein sequences
- protein function
- probabilistic model
- speech recognition
- information retrieval
- computational tools
- molecular biology
- protein structure
- amino acids
- protein interaction
- machine learning
- metabolic pathways
- dna binding
- protein protein interactions
- graph theory
- protein folding
- training examples
- training data
- secondary structure
- high throughput
- semi supervised
- knowledge discovery
- data analysis
- data mining
- biological data