All terms
Hardware & Systems
FP4
A 4-bit number format that squeezes models into very little memory for fast inference.
Definition
FP4 is a 4-bit floating-point format — a way of storing each number using only four bits. It packs models into very little memory and speeds up the math on supported chips, in exchange for some precision. It sits at the aggressive end of low-precision formats, alongside FP8 and INT4.