OffLanDat: A Community Based Implicit Offensive Language Dataset Generated by Large Language Model Through Prompt Engineering.
Amit DasMostafa RahgouyDongji FengZheng ZhangTathagata BhattacharyaNilanjana RaychawdharyMary SandageLauramarie PopeGerry V. DozierCheryl D. SealsPublished in: CoRR (2024)
Keyphrases
- language model
- language modeling
- n gram
- retrieval model
- language modelling
- probabilistic model
- document retrieval
- query expansion
- information retrieval
- speech recognition
- test collection
- mixture model
- query terms
- natural language
- language model for information retrieval
- context sensitive
- statistical language models
- vector space model
- translation model
- relevance model
- language models for information retrieval
- ad hoc information retrieval
- word error rate
- retrieval effectiveness
- machine learning
- statistical model
- cross language retrieval
- context dependent
- natural language processing
- model selection