Regularization

Reduce the Overfit problem.

Overview

Dataset

Metrics

Papers

Methods

Data Augmentation.
Adjust Loss Function.
- Adding Penalty Terms (L1 Norm, L2 term).
- Weight Decay (w_new = w_old - η * (∇L + 2λw_old)) with ∇L is the gradient of the original loss function.
Noise Injection.
Mixup.
Larger Learning rate.
Smaller Batches.
Neural-network Specific.
- Dropout.
- Batch Normalization (BatchNorm)
Early Stopping.
Multi-task Learning.
Ensembles.
Label Smoothing.

Paper List

References

https://www.youtube.com/watch?v=ZF_g531UYec&list=PL2Yggtk_pK69nyeIgJsjPN0traCIyMJ_f&index=18

Page updated

Google Sites

Report abuse