GPT-4o Mini

OpenAIfast

Fast and affordable small model for lightweight tasks and high-throughput use cases.

Context Window

128K

tokens

Max Output

16K

tokens

Input Price

$0.15

per 1M tokens

Output Price

$0.6

per 1M tokens

Speed

183

tokens/sec

Released

Jul 2024

2024-07-18

Blended Cost

$0.38

per 1M tokens

Value Score

192.0

quality per $

Capabilities

ChatVisionFunction CallingCode Generation

Benchmarks

Quality Index
72
MMLU Pro
82
HumanEval (Coding)
87.2
MATH
70.2
Arena ELO
1216

About GPT-4o Mini

GPT-4o Mini is a fast AI model by OpenAI, released on July 18, 2024. It supports a context window of 128K tokens and can generate up to 16K output tokens.

At $0.15 per million input tokens and $0.6 per million output tokens, its blended cost of $0.38/1M tokens makes it one of the most affordable models available. Its value score of 192.0 reflects the balance of quality and cost.

Using GPT-4o Mini with Swfte

Access GPT-4o Mini through Swfte Connect, our unified LLM gateway. Connect gives you a single API for 50+ models, with automatic routing, cost optimization, and fallback handling. You can also try GPT-4o Mini in our AI Playground before integrating.