Skip to main content
All terms
Hardware & Systems

FP4

A 4-bit number format that squeezes models into very little memory for fast inference.

Definition

FP4 is a 4-bit floating-point format — a way of storing each number using only four bits. It packs models into very little memory and speeds up the math on supported chips, in exchange for some precision. It sits at the aggressive end of low-precision formats, alongside FP8 and INT4.