OCHADAI at SMM4H-2021 Task 5: Classifying self-reporting tweets on potential cases of COVID-19 by ensembling pre-trained language models.
Ying LuoLis PereiraIchiro KobayashiPublished in: SMM4H@NAACL-HLT (2021)
Keyphrases
- language model
- pre trained
- language modeling
- document retrieval
- information retrieval
- probabilistic model
- n gram
- speech recognition
- query expansion
- statistical language models
- language modelling
- smoothing methods
- language models for information retrieval
- test collection
- training data
- training examples
- document ranking
- decision trees
- named entities
- small number
- knn
- pairwise
- relevance model
- feature extraction
- machine learning
- data sets