New Textual Corpora for Serbian Language Modeling.
Mihailo SkoricNikola JankovicPublished in: CoRR (2024)
Keyphrases
- language modeling
- language model
- comparable corpora
- query expansion
- information retrieval
- cross lingual
- retrieval model
- probabilistic model
- n gram
- statistical machine translation
- parallel corpus
- text classification
- natural language
- natural language processing
- information retrieval systems
- keywords
- metadata
- high dimensional
- finite state transducers
- translation model
- learning algorithm
- text mining
- principal component analysis
- machine learning
- feature vectors
- statistical language modeling