Login / Signup
Concerned with Data Contamination? Assessing Countermeasures in Code Language Model.
Jialun Cao
Wuqi Zhang
Shing-Chi Cheung
Published in:
CoRR (2024)
Keyphrases
</>
language model
countermeasures
information retrieval
speech recognition
keywords
data sources
probability distribution
knowledge discovery
n gram
data analysis
semi supervised
query expansion
document retrieval
language modeling