Login / Signup
Improving Language Models Trained with Translated Data via Continual Pre-Training and Dictionary Learning Analysis.
Sabri Boughorbel
Md. Rizwan Parvez
Majd Hawasly
Published in:
CoRR (2024)
Keyphrases
</>
language model
data sets
data analysis
language modeling
dictionary learning
n gram
training data
training set
probabilistic model
data points
image data
information retrieval
image segmentation
high dimensional
speech recognition
document retrieval