Unbiased organism-agnostic and highly sensitive signal peptide predictor with deep protein language model.
Junbo ShenQinze YuShenyang ChenQingxiong TanJingcheng LiYu LiPublished in: CoRR (2023)
Keyphrases
- language model
- amino acids
- tandem mass spectra
- mass spectrometry
- mass spectra
- language modeling
- n gram
- information retrieval
- document retrieval
- tandem mass spectrometry
- query expansion
- speech recognition
- protein sequences
- probabilistic model
- ad hoc information retrieval
- ms ms
- statistical language models
- retrieval model
- test collection
- high throughput
- language modelling
- mixture model
- context sensitive
- smoothing methods
- language models for information retrieval
- protein structure
- query terms
- document ranking
- pseudo relevance feedback
- word clouds
- mhc class ii
- language model for information retrieval
- word error rate
- document length
- statistical machine translation
- translation model
- drug discovery
- relevance model
- biological systems
- similarity search
- high dimensional