All terms
Training
Self-Play
Training a system on experience it generates itself, sometimes against copies of itself.
Definition
Self-play trains a system by having it produce its own experience, sometimes competing against copies of itself, and learning from the outcomes. It became famous in game-playing AI, where a system improves by playing itself many times. The idea now informs generating training data and reasoning traces for language models.