De-Identification of Clinical Notes Using Contextualized Language Models and a Token Classifier.
Joaquim SantosHenrique D. P. dos SantosFábio TabalipaRenata VieiraPublished in: BRACIS (2) (2021)
Keyphrases
- language model
- language modeling
- probabilistic model
- speech recognition
- n gram
- document retrieval
- information retrieval
- query expansion
- language modelling
- training data
- retrieval model
- ad hoc information retrieval
- language model for information retrieval
- statistical language models
- context sensitive
- support vector machine
- relevance model
- training set
- feature selection
- vector space model
- pseudo relevance feedback
- document ranking
- smoothing methods
- document length
- feature space
- language models for information retrieval
- error rate
- image classification
- decision trees