Impact of data quality for automatic issue classification using pre-trained language models.
Giuseppe ColavitoFilippo LanubileNicole NovielliLuigi QuarantaPublished in: J. Syst. Softw. (2024)
Keyphrases
- language model
- data quality
- language modeling
- pre trained
- speech recognition
- query expansion
- pattern recognition
- n gram
- language modelling
- support vector
- document retrieval
- test collection
- text classification
- information retrieval
- decision trees
- classification accuracy
- probabilistic model
- statistical language models
- feature vectors
- feature extraction
- feature selection
- retrieval model
- language models for information retrieval
- relevance model
- smoothing methods
- neural network
- supervised learning
- training data
- image classification
- semi supervised
- data model
- database