Login / Signup
Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for Maltese.
Kurt Micallef
Albert Gatt
Marc Tanti
Lonneke van der Plas
Claudia Borg
Published in:
CoRR (2022)
Keyphrases
</>
data quality
data warehouse
test set
data cleaning
data transformation
quality management
database
natural language
email
quality assessment
data privacy
poor quality
data confidentiality