Skip to main content
All terms
Training

Self-Play

Training a system on experience it generates itself, sometimes against copies of itself.

Definition

Self-play trains a system by having it produce its own experience, sometimes competing against copies of itself, and learning from the outcomes. It became famous in game-playing AI, where a system improves by playing itself many times. The idea now informs generating training data and reasoning traces for language models.