Annotated Dataset Creation through General Purpose Language Models for non-English Medical NLP.
Johann FreiFrank KramerPublished in: CoRR (2022)
Keyphrases
- language model
- general purpose
- cross language retrieval
- language modeling
- natural language
- machine translation
- n gram
- document retrieval
- statistical machine translation
- probabilistic model
- retrieval model
- natural language processing
- speech recognition
- information retrieval
- cross lingual
- multiword
- query expansion
- language modelling
- cross language
- smoothing methods
- test collection
- statistical language models
- information extraction
- question answering
- context sensitive
- ad hoc information retrieval
- vector space model
- hand crafted
- text mining
- language models for information retrieval
- pseudo relevance feedback
- bayesian networks
- translation model
- relevance model
- cross language information retrieval
- free text
- wordnet
- out of vocabulary
- language model for information retrieval
- hidden markov models
- statistical language modeling
- web search
- text categorization
- bag of words
- query terms
- query translation
- retrieval effectiveness
- document ranking
- word sense disambiguation
- linguistic features
- part of speech
- language processing