LT4SG@SMM4H24: Tweets Classification for Digital Epidemiology of Childhood Health Outcomes Using Pre-Trained Language Models.
Dasun AthukoralageThushari AtapattuMenasha ThilakaratneKatrina FalknerPublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- pre trained
- n gram
- language modelling
- probabilistic model
- statistical language models
- retrieval model
- document retrieval
- speech recognition
- information retrieval
- query expansion
- public health
- smoothing methods
- test collection
- language models for information retrieval
- text classification
- classification accuracy
- decision trees
- image classification
- supervised learning
- machine learning
- feature vectors
- pattern recognition
- support vector
- training data
- feature extraction
- training set
- named entities
- feature space
- document ranking
- feature selection
- learning algorithm