Training and Evaluation of a Multilingual Tokenizer for GPT-SW3.
Felix StollenwerkPublished in: CoRR (2023)
Keyphrases
- digital libraries
- online learning
- evaluation process
- training set
- supervised learning
- information retrieval
- gold standard
- evaluation methods
- evaluation model
- training data
- face recognition
- feature space
- semi supervised
- training examples
- test set
- evaluation metrics
- training process
- training algorithm
- information systems