Preface
This tutorial including the gentle introduction to self-training and a classification task implemented with python.
Referee:
A Gentle Introduction to Self-Training and Semi-Supervised Learning;
Theoretical analysis of self-training with deep networks on unlabeled data.
- Introduction
“Self-training algorithms train a model to fit pseudolabels predicted by another previously-learned model.”
On a conceptual level, self-training works like this:
Picture Original
Recent work also extents this method to enforce stability of predictions under input transformations such as adversarial perturbations and data augmentation.
- Python implementation
Dataset
This data set is intended to be used for binary classification, and contains data for 14.6k+ surgeries. The attributes are measurements like bmi, while the target variable, complication, records whether the patient suffered complications as a result of surgery.