Exploit Multilingual Language Model at Scale for ICD-10 Clinical Text Classification.
Stefano SilvestriFrancesco GargiuloMario CiampiGiuseppe De PietroPublished in: ISCC (2020)
Keyphrases
- context sensitive
- language model
- text classification
- language modeling
- n gram
- cross lingual
- language independent
- clinical diagnosis
- document retrieval
- medical records
- probabilistic model
- retrieval model
- information retrieval
- speech recognition
- bag of words
- text categorization
- test collection
- query expansion
- ad hoc information retrieval
- labeled data
- mixture model
- free text
- language modelling
- text mining
- cross language
- statistical language models
- query terms
- relevance model
- multi label
- machine learning
- feature selection
- dirichlet prior
- language model for information retrieval
- text documents
- knn
- digital libraries
- tf idf
- naive bayes
- statistical language modeling