Mitigating Boundary Ambiguity and Inherent Bias for Text Classification in the Era of Large Language Models.
Zhenyi LuJie TianWei WeiXiaoye QuYu ChengWenfeng XieDangyang ChenPublished in: CoRR (2024)
Keyphrases
- language model
- text classification
- language modeling
- n gram
- speech recognition
- information retrieval
- probabilistic model
- query expansion
- language modelling
- text categorization
- test collection
- bag of words
- feature selection
- retrieval model
- sentiment analysis
- naive bayes
- context sensitive
- document retrieval
- machine learning
- text documents
- statistical language modeling
- multi label
- cross lingual
- vector space model
- text mining
- document ranking
- query terms
- text classifiers
- ad hoc information retrieval
- statistical language models
- knn
- spoken term detection
- translation model
- word segmentation
- pseudo relevance feedback
- semantic features
- term frequency
- information extraction
- information retrieval systems