Improving Pre-trained Language Model Sensitivity via Mask Specific losses: A case study on Biomedical NER.
Micheal AbahoDanushka BollegalaGary LeemingDan JoyceIain E. BuchanPublished in: NAACL-HLT (2024)
Keyphrases
- language model
- pre trained
- language modeling
- probabilistic model
- document retrieval
- n gram
- information extraction
- retrieval model
- query expansion
- information retrieval
- speech recognition
- test collection
- ad hoc information retrieval
- named entity recognition
- conditional random fields
- context sensitive
- mixture model
- query terms
- text mining
- smoothing methods
- translation model
- training data
- natural language processing
- training examples
- pairwise
- high dimensional
- named entities
- learning process
- data mining
- generative model