Efficient Domain Adaptation of Language Models via Adaptive Tokenization.
Vin SachidanandaJason S. KesslerYi'an LaiPublished in: SustaiNLP@EMNLP (2021)
Keyphrases
- language model
- domain adaptation
- language modeling
- n gram
- probabilistic model
- query expansion
- document retrieval
- retrieval model
- cross domain
- test collection
- relevance model
- information retrieval
- multiple sources
- semi supervised
- labeled data
- pseudo relevance feedback
- test data
- text categorization
- document classification
- text classification
- domain specific