Benchmarking for Public Health Surveillance tasks on Social Media with a Domain-Specific Pretrained Language Model.
Usman NaseemByoung-Chan LeeMatloob KhushiJinman KimAdam G. DunnPublished in: CoRR (2022)
Keyphrases
- language model
- public health
- domain specific
- social media
- language modeling
- n gram
- outbreak detection
- probabilistic model
- health related
- speech recognition
- document retrieval
- language modelling
- information retrieval
- disease outbreaks
- retrieval model
- query expansion
- mixture model
- statistical language models
- surveillance system
- infectious disease
- test collection
- query terms
- ad hoc information retrieval
- information systems
- smoothing methods
- pseudo relevance feedback
- document length
- social media data
- language model for information retrieval
- health information
- translation model
- context sensitive
- text mining
- user generated content
- generative model
- word clouds
- social networks