Gemini 2.0 Flash

Googlefast

Google's fastest model. Optimized for speed and efficiency with strong coding and reasoning.

Context Window

1M

tokens

Max Output

8K

tokens

Input Price

$0.1

per 1M tokens

Output Price

$0.4

per 1M tokens

Speed

244

tokens/sec

Released

Feb 2025

2025-02-05

Blended Cost

$0.25

per 1M tokens

Value Score

296.0

quality per $

Capabilities

ChatVisionCode GenerationFunction CallingSearchAudio

Benchmarks

Quality Index
74
MMLU Pro
83.8
HumanEval (Coding)
86.5
MATH
73.1
Arena ELO
1240

About Gemini 2.0 Flash

Gemini 2.0 Flash is a fast AI model by Google, released on February 5, 2025. It supports a context window of 1000K tokens and can generate up to 8K output tokens.

At $0.1 per million input tokens and $0.4 per million output tokens, its blended cost of $0.25/1M tokens makes it one of the most affordable models available. Its value score of 296.0 reflects the balance of quality and cost.

Using Gemini 2.0 Flash with Swfte

Access Gemini 2.0 Flash through Swfte Connect, our unified LLM gateway. Connect gives you a single API for 50+ models, with automatic routing, cost optimization, and fallback handling. You can also try Gemini 2.0 Flash in our AI Playground before integrating.