Skip to main content
All terms
Inference & Serving

Min-P Sampling

Sampling that sets a probability floor relative to the most likely token's probability.

Definition

Min-P sampling sets a dynamic cutoff by multiplying the top token's probability by a small constant and discarding every token below that threshold. The floor scales with model confidence: when the top token is highly probable, few alternatives survive; when it is uncertain, many remain eligible. This often yields text that is both more coherent and more varied than fixed top-k or top-p alone.