MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture.
Wancong ZhangIeshan VaidyaPublished in: CoRR (2021)
Keyphrases
- management system
- training set
- real time
- cross validation
- feedforward artificial neural networks
- avoid overfitting
- multi layer
- decision trees
- online learning
- test set
- fault diagnosis
- supply chain
- improved algorithm
- fuzzy logic
- camera parameters
- training algorithm
- focal length
- network architecture
- training error
- artificial intelligence