Cost of Translation: AI Model Pricing Compared (May 2026)
Translation is the highest-volume LLM workload in production today. We price the canonical web-copy translation (EN-ES, ~1K in / 1.2K out) across every major LLM, and compare it to DeepL Pro on the same workload.
The reference scenario
- Task: Translate 1,000 input tokens to 1,200 output tokens (typical EN→ES web copy)
- Input tokens per call: 1,000
- Output tokens per call: 1,200
- Monthly volume: 100,000 translations (e-commerce / SaaS localization workload)
- Total tokens / month: 220M
Output tokens exceed input tokens because Spanish averages ~20% longer than English at the token level.
Cost across 10 models, sorted cheapest first
| Rank | Model | Per call | Per month | vs cheapest |
|---|---|---|---|---|
| 1 | DeepSeek V4 Flash | $0.000476 | $47.60 | — |
| 2 | Gemini 2.0 Flash | $0.000580 | $58.00 | 1.2x |
| 3 | Claude 3.5 Haiku | $0.0056 | $560 | 11.8x |
| 4 | DeepSeek V4 Pro | $0.0059 | $592 | 12.4x |
| 5 | Qwen 3.6 Plus | $0.0081 | $812 | 17.1x |
| 6 | Gemini 3.1 Pro | $0.0161 | $1,610 | 33.8x |
| 7 | Claude Sonnet 4 | $0.0210 | $2,100 | 44.1x |
| 8 | Claude Opus 4.7 | $0.0350 | $3,500 | 73.5x |
| 9 | GPT-5.5 | $0.0410 | $4,100 | 86.1x |
| 10 | GPT-5.5 Pro | $0.2460 | $24,600 | 516.8x |
DeepL Pro reference on the same workload (per-character pricing, ~5K chars per call): ~$0.000125 per call / ~$12.50 per month. Not in the table because the pricing model is different (per-character, not per-token).
Monthly spend at 100K translations
DeepSeek V4 Flash #................................... $47.60 Gemini 2.0 Flash #................................... $58.00 Claude 3.5 Haiku #................................... $560 DeepSeek V4 Pro #................................... $592 Qwen 3.6 Plus #................................... $812 Gemini 3.1 Pro ##.................................. $1,610 Claude Sonnet 4 ###................................. $2,100 Claude Opus 4.7 #####............................... $3,500 GPT-5.5 ######.............................. $4,100 GPT-5.5 Pro #################################### $24,600
Per-call cost
DeepSeek V4 Flash #............................. $0.000476 Gemini 2.0 Flash #............................. $0.000580 Claude 3.5 Haiku #............................. $0.0056 DeepSeek V4 Pro #............................. $0.0059 Qwen 3.6 Plus #............................. $0.0081 Gemini 3.1 Pro ##............................ $0.0161 Claude Sonnet 4 ###........................... $0.0210 Claude Opus 4.7 ####.......................... $0.0350 GPT-5.5 #####......................... $0.0410 GPT-5.5 Pro ############################## $0.2460
Which model wins for translation?
For purely European languages (EN-ES, EN-FR, EN-DE, EN-IT): DeepL Pro is the value leader and quality leader. It is purpose-built for translation, has explicit glossary support, and the per-character pricing is competitive with the cheapest LLM tier. Most teams shipping European localization at scale use DeepL by default.
Recommended LLM pick: Gemini 3.1 Pro is our top LLM-based pick. The multilingual coverage is best-in-class, the long context allows you to send a brand-voice glossary alongside the source text, and the $3.50 / $10.50 per 1M tokens pricing is reasonable. Runner-up: DeepSeek V4 Pro, which is roughly 6x cheaper than Gemini 3.1 Pro and very strong for EN-ZH and CJK language pairs.
When to use a cheap model
- Templated copy: product descriptions, FAQs, terms-of-service updates
- High-volume bulk localization where human review is in the loop
- Internal tooling translation (admin UIs, error messages)
- Short strings (under 200 tokens each)
- Languages well-represented in the model (EN-ES, EN-PT, EN-ZH)
When to use a frontier model
- Marketing copy where brand voice matters
- Legal / contractual / medical translation
- Low-resource languages (Swahili, Bengali, Tagalog, Yoruba)
- Translation with embedded context (subtitles, dialog, conversational tone)
- Translation + transcreation (cultural adaptation, not just literal)
DeepL is not on the token table — but should be in your stack
DeepL prices per character, not per token. On a typical 5K-char EN-ES translation, DeepL Pro costs around $0.000125, putting it in the same ballpark as DeepSeek V4 Flash. The advantage of DeepL is purpose-built quality on European pairs and an explicit glossary product. The disadvantage is no instruction following — you cannot ask DeepL to "translate in a friendly tone, prefer Latin American Spanish, and keep technical terms in English." For that, an LLM is the only option.
Related
- Token Cost Calculator
- Cheap vs Expensive Model Comparison
- Model-Mixing Cost Savings
- AI Model Leaderboard
Pricing data sourced from official provider pages and OpenRouter, May 2026-05-06. DeepL pricing reflects the Pro plan published rate. Effective production cost will be 1.5-2x higher after retries, system prompts, and priority-tier surcharges.