Login / Signup
Unveiling the Spectrum of Data Contamination in Language Model: A Survey from Detection to Remediation.
Chunyuan Deng
Yilun Zhao
Yuzhao Heng
Yitong Li
Jiannan Cao
Xiangru Tang
Arman Cohan
Published in:
ACL (Findings) (2024)
Keyphrases
</>
language model
probabilistic model
uncertain data
n gram
information retrieval
language model for information retrieval
prior knowledge
maximum likelihood
document retrieval
language modeling
query specific
language modelling