MEDVOC: Vocabulary Adaptation for Fine-tuning Pre-trained Language Models on Medical Text Summarization.
Gunjan BaldeSoumyadeep RoyMainack MondalNiloy GangulyPublished in: CoRR (2024)
Keyphrases
- fine tuning
- language model
- text summarization
- pre trained
- query expansion
- language modeling
- out of vocabulary
- named entity recognition
- spoken term detection
- natural language processing
- information extraction
- training data
- document retrieval
- speech recognition
- n gram
- probabilistic model
- retrieval model
- information retrieval
- question answering
- multi document summarization
- test collection
- training examples
- context sensitive
- control signals
- document structure
- relevant documents
- relevance model
- learning algorithm
- named entities
- conditional random fields
- labeled data
- keywords
- data sets