Test-time Backdoor Mitigation for Black-Box Large Language Models with Defensive Demonstrations.
Wenjie MoJiashu XuQin LiuJiongxiao WangJun YanChaowei XiaoMuhao ChenPublished in: CoRR (2023)
Keyphrases
- black box
- language model
- integration testing
- test cases
- language modeling
- n gram
- black boxes
- white box
- probabilistic model
- document retrieval
- speech recognition
- white box testing
- information retrieval
- retrieval model
- language modelling
- statistical language models
- vector space model
- test collection
- ad hoc information retrieval
- query expansion
- smoothing methods
- software testing
- term dependencies
- context sensitive
- query terms
- document ranking
- test data
- pseudo relevance feedback
- translation model
- tf idf
- vector space
- language models for information retrieval
- language model for information retrieval
- machine learning