How much does Gemini 2.0 Flash cost?

Gemini 2.0 Flash costs $0.1 per million input tokens and $0.4 per million output tokens. The blended average is $0.25 per million tokens.

What is the context window of Gemini 2.0 Flash?

Gemini 2.0 Flash supports a context window of 1000K tokens (1,000,000 tokens) and can generate up to 8K output tokens per request.

What is Gemini 2.0 Flash best for?

Gemini 2.0 Flash is best for: Fastest + cheapest. It supports chat, vision, code, function_calling, search, audio capabilities.

Gemini 2.0 Flash — Pricing, Benchmarks & Specs

Gemini 2.0 Flash

Googlefast

Google's fastest model. Optimized for speed and efficiency with strong coding and reasoning.

Context Window

tokens

Max Output

tokens

Input Price

$0.1

per 1M tokens

Output Price

$0.4

per 1M tokens

Speed

244

tokens/sec

Released

Feb 2025

2025-02-05

Blended Cost

$0.25

per 1M tokens

Value Score

296.0

quality per $

Capabilities

ChatVisionCode GenerationFunction CallingSearchAudio

Benchmarks

Quality Index

MMLU Pro

83.8

HumanEval (Coding)

86.5

MATH

73.1

Arena ELO

1240

Try in Playground Use via Swfte Connect Google Docs

Compare With

About Gemini 2.0 Flash

Gemini 2.0 Flash is a fast AI model by Google, released on February 5, 2025. It supports a context window of 1000K tokens and can generate up to 8K output tokens.

At $0.1 per million input tokens and $0.4 per million output tokens, its blended cost of $0.25/1M tokens makes it one of the most affordable models available. Its value score of 296.0 reflects the balance of quality and cost.

Using Gemini 2.0 Flash with Swfte

Access Gemini 2.0 Flash through Swfte Connect, our unified LLM gateway. Connect gives you a single API for 50+ models, with automatic routing, cost optimization, and fallback handling. You can also try Gemini 2.0 Flash in our AI Playground before integrating.