Automated occupation coding with hierarchical features: a data-centric approach to classification with pre-trained language models.
Parisa SafikhaniHayastan AvetisyanDennis Föste-EggersDavid BroneskePublished in: Discov. Artif. Intell. (2023)
Keyphrases
- language model
- data centric
- language modeling
- feature vectors
- classification accuracy
- pre trained
- feature space
- feature extraction
- speech recognition
- smoothing methods
- co occurrence
- pattern recognition
- n gram
- data driven
- probabilistic model
- decision trees
- machine learning
- image classification
- data management
- image features
- information retrieval
- supervised learning
- text classification
- information management
- bayesian networks
- text mining
- low level
- training examples
- wireless sensor networks
- support vector
- information integration
- feature selection
- data sets