Publication: On the importance of pre-training data volume for compact language models.