Skip to main content
All terms
Models & Products

GPT-4o

OpenAI's omni model that handles text, audio, and image in one network.

Definition

GPT-4o, where the 'o' stands for 'omni', is a model from OpenAI that processes text, audio, and image inputs through a single network rather than chaining separate systems. This design speeds up audio responses and lets it reason across the different input types together. Available in ChatGPT and the API, it became a primary general-purpose model, balancing capability with speed and lower cost than earlier GPT-4 variants.