Login / Signup
Impact of Tokenization on Language Models: An Analysis for Turkish.
Cagri Toraman
Eyup Halit Yilmaz
Furkan Sahinuç
Oguzhan Ozcelik
Published in:
ACM Trans. Asian Low Resour. Lang. Inf. Process. (2023)
Keyphrases
</>
language model
language modeling
probabilistic model
n gram
query expansion
document retrieval
retrieval model
information retrieval
speech recognition
context sensitive
statistical language models
clustering algorithm
bayesian networks
text mining
language modelling