MaChAmp at SemEval-2023 tasks 2, 3, 4, 5, 7, 8, 9, 10, 11, and 12: On the Effectiveness of Intermediate Training on an Uncurated Collection of Datasets.
Rob van der GootPublished in: SemEval@ACL (2023)
Keyphrases
- database
- ground truth labels
- amazon mechanical turk
- training set
- training dataset
- supervised learning
- benchmark datasets
- artificial intelligence
- text classification tasks
- document collections
- training samples
- training and testing data
- training phase
- data mining tasks
- training process
- data mining algorithms
- probabilistic model
- hidden markov models
- active learning
- object recognition
- feature selection
- neural network
- data sets