Leveraging Weakly Annotated Data for Hate Speech Detection in Code-Mixed Hinglish: A Feasibility-Driven Transfer Learning Approach with Large Language Models.
Sargam YadavAbhishek KaushikKevin McDaidPublished in: CoRR (2024)
Keyphrases
- language model
- transfer learning
- speech recognition
- word error rate
- language modeling
- n gram
- knowledge transfer
- probabilistic model
- cross domain learning
- reinforcement learning
- cross domain
- active learning
- speech signal
- labeled data
- information retrieval
- spoken term detection
- machine learning
- retrieval model
- query expansion
- test collection
- language models for information retrieval
- smoothing methods
- text categorization
- text classification
- automatic speech recognition
- error rate
- statistical language models
- collaborative filtering
- semi supervised learning
- structure learning
- text mining
- machine learning algorithms
- target domain
- context sensitive
- cross lingual
- bayesian networks