All terms
Training
Checkpoint
A saved snapshot of a model's parameters at a point during or after training.
Definition
A checkpoint is a saved copy of a model's parameters, used to resume an interrupted run, evaluate progress partway through training, or release a finished model. Runs typically write checkpoints at regular intervals so work is not lost on failure. Released model weights are essentially a published checkpoint, frozen at the end of training.