AI Model Pricing Index
Compare API pricing across every major AI provider. Sortable table, historical trends, and an interactive cost calculator to estimate your monthly spend.
326
Models Tracked
52
Providers
$0.02
Cheapest Input
8824x
Price Range
Full Pricing Table
| Model | Provider | Input / 1M | Output / 1M | Blended | Quality | Value | Context |
|---|---|---|---|---|---|---|---|
Open-source | Mistral AI | $0.02 | $0.04 | $0.03 | 72 | 2400.0 | 131K |
Open-source | $0.02 | $0.04 | $0.03 | 50 | 1666.7 | 33K | |
Open-source | Meta | $0.02 | $0.05 | $0.04 | 65 | 1857.1 | 16K |
Open-source | Meta | $0.03 | $0.04 | $0.04 | 65 | 1857.1 | 8K |
Open-source | Meta | $0.02 | $0.06 | $0.04 | 50 | 1250.0 | 131K |
Hard reasoning | sao10k | $0.04 | $0.05 | $0.04 | 50 | 1111.1 | 8K |
Open-source | Meta | $0.05 | $0.05 | $0.05 | 50 | 1020.4 | 131K |
Open-source | $0.04 | $0.08 | $0.06 | 65 | 1083.3 | 131K | |
Open-source | $0.03 | $0.09 | $0.06 | 65 | 1083.3 | 8K | |
Speed & cost | gryphe | $0.06 | $0.06 | $0.06 | 58 | 966.7 | 4K |
Code generation | Alibaba Cloud | $0.03 | $0.09 | $0.06 | 50 | 833.3 | 33K |
Speed & cost | ibm-granite | $0.02 | $0.11 | $0.06 | 62 | 976.4 | 131K |
Open-source | Mistral AI | $0.05 | $0.08 | $0.07 | 72 | 1107.7 | 33K |
Open-source | Mistral AI | $0.03 | $0.11 | $0.07 | 72 | 1028.6 | 131K |
Speed & cost | OpenAI | $0.03 | $0.11 | $0.07 | 50 | 714.3 | 131K |
Open-source | Alibaba Cloud | $0.04 | $0.10 | $0.07 | 50 | 714.3 | 33K |
Speed & cost | liquid | $0.03 | $0.12 | $0.07 | 50 | 666.7 | 33K |
Open-source | Alibaba Cloud | $0.03 | $0.13 | $0.08 | 50 | 615.4 | 131K |
Open-source | $0.04 | $0.13 | $0.09 | 74 | 870.6 | 131K | |
Open-source | Alibaba Cloud | $0.07 | $0.10 | $0.09 | 82 | 959.1 | 262K |
Speed & cost | Amazon | $0.04 | $0.14 | $0.09 | 62 | 708.6 | 128K |
Open-source | Cohere | $0.04 | $0.15 | $0.09 | 50 | 533.3 | 128K |
Speed & cost | arcee | $0.04 | $0.15 | $0.10 | 50 | 512.8 | 131K |
Open-source | Alibaba Cloud | $0.05 | $0.15 | $0.10 | 82 | 820.0 | 256K |
Speed & cost | nvidia | $0.04 | $0.16 | $0.10 | 62 | 620.0 | 131K |
Speed & cost | rekaai | $0.10 | $0.10 | $0.10 | 58 | 580.0 | 16K |
Speed & cost | Mistral AI | $0.10 | $0.10 | $0.10 | 58 | 580.0 | 131K |
Z.ai: GLM 4 32B OSS Open-source | z-ai | $0.10 | $0.10 | $0.10 | 58 | 580.0 | 128K |
Speed & cost | microsoft | $0.07 | $0.14 | $0.10 | 65 | 634.1 | 16K |
Open-source | Meta | $0.03 | $0.20 | $0.11 | 50 | 440.5 | 60K |
Speed & cost | OpenAI | $0.04 | $0.19 | $0.11 | 50 | 436.7 | 131K |
Open-source | $0.08 | $0.16 | $0.12 | 74 | 616.7 | 131K | |
Speed & cost | nvidia | $0.05 | $0.20 | $0.13 | 62 | 496.0 | 262K |
Open-source | allenai | $0.05 | $0.20 | $0.13 | 50 | 400.0 | 128K |
Open-source | Mistral AI | $0.07 | $0.20 | $0.14 | 72 | 523.6 | 128K |
Search + citations | nousresearch | $0.14 | $0.14 | $0.14 | 65 | 464.3 | 8K |
Open-source | Alibaba Cloud | $0.06 | $0.24 | $0.15 | 82 | 546.7 | 41K |
Speed & cost | essentialai | $0.15 | $0.15 | $0.15 | 58 | 386.7 | 33K |
Speed & cost | Mistral AI | $0.15 | $0.15 | $0.15 | 58 | 386.7 | 262K |
Speed & cost | Amazon | $0.06 | $0.24 | $0.15 | 58 | 386.7 | 300K |
Open-source | Mistral AI | $0.11 | $0.19 | $0.15 | 58 | 386.7 | 3K |
Speed & cost | bytedance | $0.10 | $0.20 | $0.15 | 58 | 386.7 | 128K |
Speed & cost | rekaai | $0.10 | $0.20 | $0.15 | 58 | 386.7 | 66K |
Open-source | Alibaba Cloud | $0.08 | $0.24 | $0.16 | 82 | 512.5 | 41K |
Speed & cost | Alibaba Cloud | $0.07 | $0.26 | $0.16 | 82 | 504.6 | 1M |
Code generation | Alibaba Cloud | $0.07 | $0.27 | $0.17 | 82 | 482.4 | 160K |
Hard reasoning | baidu | $0.07 | $0.28 | $0.18 | 58 | 331.4 | 131K |
Speed & cost | baidu | $0.07 | $0.28 | $0.18 | 58 | 331.4 | 120K |
Speed & cost | arcee | $0.18 | $0.18 | $0.18 | 58 | 322.2 | 131K |
Open-source | Meta | $0.18 | $0.18 | $0.18 | 58 | 322.2 | 164K |
Open-source | Alibaba Cloud | $0.08 | $0.28 | $0.18 | 82 | 455.6 | 41K |
Speed & cost | $0.07 | $0.30 | $0.19 | 73 | 389.3 | 1M | |
Speed & cost | bytedance | $0.07 | $0.30 | $0.19 | 58 | 309.3 | 262K |
Speed & cost | OpenAI | $0.07 | $0.30 | $0.19 | 58 | 309.3 | 131K |
Speed & cost | Meta | $0.08 | $0.30 | $0.19 | 74 | 389.5 | 328K |
Speed & cost | xiaomi | $0.09 | $0.29 | $0.19 | 58 | 305.3 | 262K |
Open-source | Alibaba Cloud | $0.09 | $0.30 | $0.20 | 82 | 420.5 | 262K |
Open-source | Meta | $0.05 | $0.34 | $0.20 | 58 | 296.7 | 80K |
Open-source | Mistral AI | $0.10 | $0.30 | $0.20 | 72 | 360.0 | 33K |
Speed & cost | stepfun | $0.10 | $0.30 | $0.20 | 58 | 290.0 | 262K |
Speed & cost | Mistral AI | $0.20 | $0.20 | $0.20 | 58 | 290.0 | 262K |
Open-source | Mistral AI | $0.10 | $0.30 | $0.20 | 58 | 290.0 | 32K |
Open-source | Mistral AI | $0.10 | $0.30 | $0.20 | 58 | 290.0 | 131K |
Cheap-and-fast cascade tier | DeepSeek | $0.14 | $0.28 | $0.21 | 85 | 404.8 | 1M |
Open-source | Meta | $0.10 | $0.32 | $0.21 | 74 | 352.4 | 131K |
Open-source | Alibaba Cloud | $0.05 | $0.40 | $0.23 | 82 | 364.4 | 41K |
Speed & cost | OpenAI | $0.05 | $0.40 | $0.23 | 72 | 320.0 | 400K |
Speed & cost | z-ai | $0.06 | $0.40 | $0.23 | 58 | 252.2 | 203K |
Hard reasoning | Alibaba Cloud | $0.08 | $0.40 | $0.24 | 82 | 341.7 | 131K |
Speed & cost | $0.10 | $0.40 | $0.25 | 80 | 320.0 | 1M | |
Speed & cost | $0.10 | $0.40 | $0.25 | 80 | 320.0 | 1M | |
Open-source | nvidia | $0.10 | $0.40 | $0.25 | 74 | 296.0 | 131K |
Speed & cost | $0.10 | $0.40 | $0.25 | 73 | 292.0 | 1M | |
Speed & cost | OpenAI | $0.10 | $0.40 | $0.25 | 72 | 288.0 | 1M |
Speed & cost | bytedance | $0.10 | $0.40 | $0.25 | 58 | 232.0 | 262K |
Open-source | Alibaba Cloud | $0.12 | $0.39 | $0.26 | 82 | 321.6 | 33K |
Open-source | Alibaba Cloud | $0.10 | $0.42 | $0.26 | 82 | 315.4 | 131K |
Open-source | $0.13 | $0.40 | $0.27 | 76 | 286.8 | 262K | |
Search + citations | nousresearch | $0.13 | $0.40 | $0.27 | 58 | 218.9 | 131K |
Open-source | $0.14 | $0.40 | $0.27 | 76 | 281.5 | 262K | |
Search + citations | Alibaba Cloud | $0.09 | $0.45 | $0.27 | 58 | 214.8 | 131K |
Open-source | Alibaba Cloud | $0.14 | $0.41 | $0.27 | 58 | 212.5 | 131K |
Hard reasoning | DeepSeek | $0.29 | $0.29 | $0.29 | 91 | 313.8 | 33K |
Open-source | Alibaba Cloud | $0.08 | $0.50 | $0.29 | 82 | 282.8 | 131K |
Search + citations | nousresearch | $0.30 | $0.30 | $0.30 | 74 | 246.7 | 131K |
Speed & cost | nvidia | $0.10 | $0.50 | $0.30 | 58 | 193.3 | 262K |
Speed & cost | thedrummer | $0.17 | $0.43 | $0.30 | 58 | 193.3 | 33K |
Open-source | nex-agi | $0.14 | $0.50 | $0.32 | 86 | 270.9 | 131K |
Open-source | DeepSeek | $0.26 | $0.38 | $0.32 | 87 | 271.9 | 164K |
Open-source | Alibaba Cloud | $0.13 | $0.52 | $0.33 | 82 | 252.3 | 131K |
Hard reasoning | allenai | $0.15 | $0.50 | $0.33 | 58 | 178.5 | 66K |
Open-source | DeepSeek | $0.27 | $0.41 | $0.34 | 86 | 252.9 | 164K |
Speed & cost | xAI | $0.20 | $0.50 | $0.35 | 58 | 165.7 | 2M |
Speed & cost | xAI | $0.20 | $0.50 | $0.35 | 58 | 165.7 | 2M |
Speed & cost | baidu | $0.14 | $0.56 | $0.35 | 58 | 165.7 | 30K |
Speed & cost | tencent | $0.14 | $0.57 | $0.35 | 58 | 163.4 | 131K |
Hard reasoning | Alibaba Cloud | $0.15 | $0.58 | $0.36 | 58 | 158.9 | 131K |
Open-source | Meta | $0.15 | $0.60 | $0.38 | 82 | 218.7 | 1M |
Search + citations | OpenAI | $0.15 | $0.60 | $0.38 | 80 | 213.3 | 128K |
Speed & cost | OpenAI | $0.15 | $0.60 | $0.38 | 80 | 213.3 | 128K |
Speed & cost | OpenAI | $0.15 | $0.60 | $0.38 | 80 | 213.3 | 128K |
Open-source | Mistral AI | $0.15 | $0.60 | $0.38 | 72 | 192.0 | 262K |
Speed & cost | upstage | $0.15 | $0.60 | $0.38 | 58 | 154.7 | 128K |
Open-source | Cohere | $0.15 | $0.60 | $0.38 | 58 | 154.7 | 128K |
Speed & cost | xAI | $0.30 | $0.50 | $0.40 | 82 | 205.0 | 131K |
Speed & cost | xAI | $0.30 | $0.50 | $0.40 | 82 | 205.0 | 131K |
Open-source | Meta | $0.40 | $0.40 | $0.40 | 74 | 185.0 | 131K |
Speed & cost | thedrummer | $0.40 | $0.40 | $0.40 | 66 | 165.0 | 33K |
Speed & cost | nvidia | $0.20 | $0.60 | $0.40 | 62 | 155.0 | 131K |
Open-source | allenai | $0.20 | $0.60 | $0.40 | 58 | 145.0 | 66K |
Speed & cost | thedrummer | $0.30 | $0.50 | $0.40 | 58 | 145.0 | 131K |
Open-source | Alibaba Cloud | $0.20 | $0.60 | $0.40 | 58 | 145.0 | 128K |
Open-source | Mistral AI | $0.20 | $0.60 | $0.40 | 58 | 145.0 | 33K |
Code generation | Alibaba Cloud | $0.12 | $0.75 | $0.43 | 82 | 188.5 | 262K |
Hard reasoning | Alibaba Cloud | $0.10 | $0.78 | $0.44 | 82 | 186.9 | 131K |
Open-source | DeepSeek | $0.15 | $0.75 | $0.45 | 86 | 191.1 | 33K |
Open-source | DeepSeek | $0.20 | $0.77 | $0.48 | 86 | 177.3 | 164K |
Open-source | z-ai | $0.13 | $0.85 | $0.49 | 58 | 118.4 | 131K |
Open-source | DeepSeek | $0.21 | $0.79 | $0.50 | 86 | 172.0 | 164K |
Speed & cost | inception | $0.25 | $0.75 | $0.50 | 58 | 116.0 | 128K |
Speed & cost | meituan | $0.20 | $0.80 | $0.50 | 58 | 116.0 | 131K |
Speed & cost | inception | $0.25 | $0.75 | $0.50 | 58 | 116.0 | 128K |
Code generation | inception | $0.25 | $0.75 | $0.50 | 58 | 116.0 | 128K |
Hard reasoning | Alibaba Cloud | $0.26 | $0.78 | $0.52 | 58 | 111.5 | 1M |
Open-source | Alibaba Cloud | $0.26 | $0.78 | $0.52 | 58 | 111.5 | 1M |
Open-source | Alibaba Cloud | $0.26 | $0.78 | $0.52 | 58 | 111.5 | 1M |
Hard reasoning | arcee | $0.22 | $0.85 | $0.54 | 58 | 108.4 | 262K |
Open-source | Alibaba Cloud | $0.20 | $0.88 | $0.54 | 82 | 151.9 | 262K |
Open-source | Mistral AI | $0.54 | $0.54 | $0.54 | 72 | 133.3 | 33K |
Speed & cost | undi95 | $0.45 | $0.65 | $0.55 | 66 | 120.0 | 6K |
Speed & cost | minimax | $0.12 | $0.99 | $0.55 | 58 | 104.7 | 197K |
Code generation | Alibaba Cloud | $0.20 | $0.97 | $0.58 | 82 | 140.2 | 1M |
Open-source | Alibaba Cloud | $0.09 | $1.10 | $0.60 | 82 | 137.8 | 262K |
Code generation | Mistral AI | $0.30 | $0.90 | $0.60 | 78 | 130.0 | 256K |
Open-source | z-ai | $0.30 | $0.90 | $0.60 | 58 | 96.7 | 131K |
Open-source | DeepSeek | $0.32 | $0.89 | $0.60 | 86 | 142.1 | 164K |
Code generation | Alibaba Cloud | $0.22 | $1.00 | $0.61 | 82 | 134.4 | 262K |
Speed & cost | minimax | $0.27 | $0.95 | $0.61 | 58 | 95.1 | 197K |
Speed & cost | microsoft | $0.62 | $0.62 | $0.62 | 62 | 100.0 | 66K |
Open-source | Meta | $0.51 | $0.74 | $0.63 | 66 | 105.6 | 8K |
Speed & cost | minimax | $0.26 | $1.00 | $0.63 | 58 | 92.4 | 197K |
Code generation | arcee | $0.50 | $0.80 | $0.65 | 66 | 101.5 | 33K |
Open-source | $0.65 | $0.65 | $0.65 | 65 | 100.0 | 8K | |
Speed & cost | prime-intellect | $0.20 | $1.10 | $0.65 | 58 | 89.2 | 131K |
Speed & cost | minimax | $0.20 | $1.10 | $0.65 | 58 | 89.2 | 1M |
Speed & cost | thedrummer | $0.55 | $0.80 | $0.68 | 66 | 97.8 | 33K |
Speed & cost | baidu | $0.28 | $1.10 | $0.69 | 58 | 84.1 | 123K |
Hard reasoning | sao10k | $0.65 | $0.75 | $0.70 | 66 | 94.3 | 131K |
Hard reasoning | tngtech | $0.30 | $1.10 | $0.70 | 91 | 130.0 | 164K |
Speed & cost | OpenAI | $0.20 | $1.25 | $0.72 | 72 | 99.3 | 400K |
Open-source | Alibaba Cloud | $0.16 | $1.30 | $0.73 | 82 | 112.1 | 262K |
Hard reasoning | Alibaba Cloud | $0.12 | $1.36 | $0.74 | 82 | 110.7 | 131K |
Hard reasoning | DeepSeek | $0.70 | $0.80 | $0.75 | 91 | 121.3 | 131K |
Speed & cost | Anthropic | $0.25 | $1.25 | $0.75 | 72 | 96.0 | 200K |
Code generation | kwaipilot | $0.30 | $1.20 | $0.75 | 58 | 77.3 | 256K |
Speed & cost | minimax | $0.30 | $1.20 | $0.75 | 58 | 77.3 | 205K |
Speed & cost | minimax | $0.30 | $1.20 | $0.75 | 58 | 77.3 | 66K |
Open-source | DeepSeek | $0.40 | $1.20 | $0.80 | 86 | 107.5 | 164K |
Open-source | Alibaba Cloud | $0.80 | $0.80 | $0.80 | 66 | 82.5 | 33K |
Hard reasoning | Alibaba Cloud | $0.15 | $1.50 | $0.82 | 82 | 99.7 | 131K |
Code generation | Alibaba Cloud | $0.66 | $1.00 | $0.83 | 66 | 79.5 | 33K |
Speed & cost | baidu | $0.42 | $1.25 | $0.83 | 66 | 79.0 | 123K |
Hard reasoning | Alibaba Cloud | $0.13 | $1.56 | $0.84 | 82 | 97.0 | 131K |
Hard reasoning | sao10k | $0.85 | $0.85 | $0.85 | 66 | 77.6 | 131K |
Speed & cost | xAI | $0.20 | $1.50 | $0.85 | 58 | 68.2 | 256K |
Speed & cost | mancer | $0.75 | $1.00 | $0.88 | 66 | 75.4 | 8K |
Speed & cost | $0.25 | $1.50 | $0.88 | 62 | 70.9 | 1M | |
Open-source | Alibaba Cloud | $0.20 | $1.56 | $0.88 | 82 | 93.4 | 262K |
Open-source | Alibaba Cloud | $0.26 | $1.56 | $0.91 | 82 | 90.1 | 1M |
Speed & cost | arcee | $0.75 | $1.20 | $0.97 | 66 | 67.7 | 131K |
Open-source | Mistral AI | $0.50 | $1.50 | $1.00 | 85 | 85.0 | 262K |
Speed & cost | OpenAI | $0.40 | $1.60 | $1.00 | 80 | 80.0 | 1M |
Speed & cost | morph | $0.80 | $1.20 | $1.00 | 66 | 66.0 | 82K |
Speed & cost | eleutherai | $0.80 | $1.20 | $1.00 | 66 | 66.0 | 4K |
Open-source | alfredpros | $0.80 | $1.20 | $1.00 | 66 | 66.0 | 4K |
Search + citations | Perplexity | $1.00 | $1.00 | $1.00 | 66 | 66.0 | 127K |
Search + citations | nousresearch | $1.00 | $1.00 | $1.00 | 66 | 66.0 | 131K |
Speed & cost | OpenAI | $0.50 | $1.50 | $1.00 | 66 | 66.0 | 16K |
Speed & cost | aion | $0.70 | $1.40 | $1.05 | 66 | 62.9 | 131K |
Speed & cost | relace | $0.85 | $1.25 | $1.05 | 66 | 62.9 | 256K |
Speed & cost | Moonshot AI | $0.38 | $1.72 | $1.05 | 89 | 84.7 | 262K |
Open-source | z-ai | $0.39 | $1.75 | $1.07 | 66 | 61.7 | 203K |
Speed & cost | OpenAI | $0.25 | $2.00 | $1.13 | 83 | 73.8 | 400K |
Speed & cost | bytedance | $0.25 | $2.00 | $1.13 | 58 | 51.6 | 262K |
Speed & cost | bytedance | $0.25 | $2.00 | $1.13 | 58 | 51.6 | 262K |
Code generation | OpenAI | $0.25 | $2.00 | $1.13 | 58 | 51.6 | 400K |
Open-source | Alibaba Cloud | $0.46 | $1.82 | $1.14 | 82 | 72.1 | 131K |
Open-source | z-ai | $0.39 | $1.90 | $1.15 | 66 | 57.6 | 205K |
Open-source | Alibaba Cloud | $0.26 | $2.08 | $1.17 | 82 | 70.1 | 262K |
Open-source | nvidia | $1.20 | $1.20 | $1.20 | 74 | 61.7 | 131K |
Speed & cost | xiaomi | $0.40 | $2.00 | $1.20 | 66 | 55.0 | 262K |
Open-source | Mistral AI | $0.40 | $2.00 | $1.20 | 66 | 55.0 | 262K |
Speed & cost | Moonshot AI | $0.40 | $2.00 | $1.20 | 66 | 55.0 | 131K |
Open-source | Mistral AI | $0.40 | $2.00 | $1.20 | 66 | 55.0 | 131K |
Open-source | z-ai | $0.60 | $1.80 | $1.20 | 66 | 55.0 | 66K |
Open-source | Mistral AI | $0.40 | $2.00 | $1.20 | 66 | 55.0 | 131K |
Open-source | Mistral AI | $0.40 | $2.00 | $1.20 | 66 | 55.0 | 131K |
Open-source | nvidia | $0.60 | $1.80 | $1.20 | 66 | 55.0 | 131K |
Speed & cost | aion | $0.80 | $1.60 | $1.20 | 66 | 55.0 | 131K |
Open-source | aion | $0.80 | $1.60 | $1.20 | 65 | 54.2 | 33K |
Hard reasoning | Moonshot AI | $0.47 | $2.00 | $1.23 | 66 | 53.4 | 131K |
General purpose | deepcogito | $1.25 | $1.25 | $1.25 | 74 | 59.2 | 128K |
Hard reasoning | DeepSeek | $0.45 | $2.15 | $1.30 | 91 | 70.0 | 164K |
Speed & cost | minimax | $0.40 | $2.20 | $1.30 | 66 | 50.8 | 1M |
Open-source | Alibaba Cloud | $0.52 | $2.08 | $1.30 | 66 | 50.8 | 131K |
Open-source | Alibaba Cloud | $0.39 | $2.34 | $1.36 | 82 | 60.1 | 262K |
Image generation | $0.30 | $2.50 | $1.40 | 80 | 57.1 | 33K | |
Speed & cost | $0.30 | $2.50 | $1.40 | 80 | 57.1 | 1M | |
Speed & cost | morph | $0.90 | $1.90 | $1.40 | 66 | 47.1 | 262K |
Speed & cost | Amazon | $0.30 | $2.50 | $1.40 | 58 | 41.4 | 1M |
Open-source | z-ai | $0.60 | $2.20 | $1.40 | 66 | 47.1 | 131K |
Hard reasoning | Alibaba Cloud | $0.26 | $2.60 | $1.43 | 82 | 57.3 | 131K |
Speed & cost | Moonshot AI | $0.57 | $2.30 | $1.43 | 66 | 46.0 | 131K |
Hard reasoning | sao10k | $1.48 | $1.48 | $1.48 | 74 | 50.0 | 8K |
Speed & cost | OpenAI | $0.60 | $2.40 | $1.50 | 66 | 44.0 | 128K |
Speed & cost | OpenAI | $1.00 | $2.00 | $1.50 | 66 | 44.0 | 4K |
Z.ai: GLM 5OSS Open-source | z-ai | $0.72 | $2.30 | $1.51 | 88 | 58.3 | 80K |
DeepSeek: R1OSS Hard reasoning | DeepSeek | $0.70 | $2.50 | $1.60 | 91 | 56.9 | 64K |
Speed & cost | $0.50 | $3.00 | $1.75 | 80 | 45.7 | 1M | |
General purpose | OpenAI | $1.50 | $2.00 | $1.75 | 74 | 42.3 | 4K |
Image generation | $0.50 | $3.00 | $1.75 | 66 | 37.7 | 66K | |
xAI: Grok 4.3 New Agentic tasks & real-time info | xAI | $1.25 | $2.50 | $1.88 | 94 | 50.1 | 1M |
Code generation | Alibaba Cloud | $0.65 | $3.25 | $1.95 | 82 | 42.1 | 1M |
Speed & cost | xiaomi | $1.00 | $3.00 | $2.00 | 66 | 33.0 | 1M |
Search + citations | relace | $1.00 | $3.00 | $2.00 | 66 | 33.0 | 256K |
Search + citations | nousresearch | $1.00 | $3.00 | $2.00 | 66 | 33.0 | 131K |
Speed & cost | Amazon | $0.80 | $3.20 | $2.00 | 66 | 33.0 | 300K |
Speed & cost | arcee | $0.90 | $3.30 | $2.10 | 66 | 31.4 | 131K |
Speed & cost | switchpoint | $0.85 | $3.40 | $2.13 | 66 | 31.1 | 131K |
Image generation | OpenAI | $2.50 | $2.00 | $2.25 | 74 | 32.9 | 400K |
Hard reasoning | Alibaba Cloud | $0.78 | $3.90 | $2.34 | 82 | 35.0 | 262K |
Open-source | Alibaba Cloud | $0.78 | $3.90 | $2.34 | 82 | 35.0 | 262K |
Speed & cost | Anthropic | $0.80 | $4.00 | $2.40 | 76 | 31.7 | 200K |
Frontier quality at low cost | Moonshot AI | $0.95 | $4.00 | $2.48 | 93 | 37.6 | 256K |
Open-source | z-ai | $1.20 | $4.00 | $2.60 | 74 | 28.5 | 203K |
Open-source | z-ai | $1.20 | $4.00 | $2.60 | 74 | 28.5 | 203K |
Qwen: Qwen-Max OSS Open-source | Alibaba Cloud | $1.04 | $4.16 | $2.60 | 74 | 28.5 | 33K |
Open-source value leader | DeepSeek | $1.74 | $3.48 | $2.61 | 92 | 35.2 | 1M |
Speed & cost | OpenAI | $0.75 | $4.50 | $2.63 | 83 | 31.6 | 400K |
Hard reasoning | OpenAI | $1.10 | $4.40 | $2.75 | 82 | 29.8 | 200K |
Hard reasoning | OpenAI | $1.10 | $4.40 | $2.75 | 82 | 29.8 | 200K |
Hard reasoning | OpenAI | $1.10 | $4.40 | $2.75 | 82 | 29.8 | 200K |
Hard reasoning | OpenAI | $1.10 | $4.40 | $2.75 | 82 | 29.8 | 200K |
Speed & cost | Anthropic | $1.00 | $5.00 | $3.00 | 76 | 25.3 | 200K |
Hard reasoning | sao10k | $3.00 | $3.00 | $3.00 | 74 | 24.7 | 16K |
Open-weight agentic & tool use | z-ai | $1.55 | $4.65 | $3.10 | 90 | 29.0 | 200K |
Speed & cost | writer | $0.60 | $6.00 | $3.30 | 66 | 20.0 | 1M |
General purpose | OpenAI | $3.00 | $4.00 | $3.50 | 74 | 21.1 | 16K |
Open-source | Mistral AI | $2.00 | $6.00 | $4.00 | 85 | 21.3 | 131K |
Open-source | Mistral AI | $2.00 | $6.00 | $4.00 | 85 | 21.3 | 131K |
Open-source | Mistral AI | $2.00 | $6.00 | $4.00 | 85 | 21.3 | 128K |
General purpose | xAI | $2.00 | $6.00 | $4.00 | 74 | 18.5 | 2M |
General purpose | xAI | $2.00 | $6.00 | $4.00 | 93 | 23.3 | 2M |
Open-source | Mistral AI | $2.00 | $6.00 | $4.00 | 74 | 18.5 | 131K |
General purpose | anthracite-org | $3.00 | $5.00 | $4.00 | 74 | 18.5 | 16K |
Open-source | Mistral AI | $2.00 | $6.00 | $4.00 | 72 | 18.0 | 66K |
Deep research | OpenAI | $2.00 | $8.00 | $5.00 | 96 | 19.2 | 200K |
Hard reasoning | OpenAI | $2.00 | $8.00 | $5.00 | 92 | 18.4 | 200K |
General purpose | OpenAI | $2.00 | $8.00 | $5.00 | 89 | 17.8 | 1M |
General purpose | ai21 | $2.00 | $8.00 | $5.00 | 74 | 14.8 | 256K |
Search + citations | Perplexity | $2.00 | $8.00 | $5.00 | 74 | 14.8 | 128K |
Deep research | Perplexity | $2.00 | $8.00 | $5.00 | 74 | 14.8 | 128K |
Code generation | OpenAI | $1.25 | $10.00 | $5.63 | 93 | 16.5 | 400K |
General purpose | OpenAI | $1.25 | $10.00 | $5.63 | 93 | 16.5 | 400K |
General purpose | OpenAI | $1.25 | $10.00 | $5.63 | 93 | 16.5 | 128K |
Code generation | OpenAI | $1.25 | $10.00 | $5.63 | 93 | 16.5 | 400K |
General purpose | OpenAI | $1.25 | $10.00 | $5.63 | 90 | 16.0 | 400K |
Speed & cost | $1.25 | $10.00 | $5.63 | 91 | 16.2 | 1M | |
Speed & cost | $1.25 | $10.00 | $5.63 | 91 | 16.2 | 1M | |
Speed & cost | $1.25 | $10.00 | $5.63 | 91 | 16.2 | 1M | |
General purpose | alpindale | $3.75 | $7.50 | $5.63 | 82 | 14.6 | 6K |
Code generation | OpenAI | $1.25 | $10.00 | $5.63 | 74 | 13.2 | 400K |
General purpose | OpenAI | $1.25 | $10.00 | $5.63 | 74 | 13.2 | 128K |
General purpose | aion | $4.00 | $8.00 | $6.00 | 82 | 13.7 | 131K |
General purpose | OpenAI | $2.50 | $10.00 | $6.25 | 88 | 14.1 | 128K |
Search + citations | OpenAI | $2.50 | $10.00 | $6.25 | 88 | 14.1 | 128K |
General purpose | OpenAI | $2.50 | $10.00 | $6.25 | 88 | 14.1 | 128K |
General purpose | OpenAI | $2.50 | $10.00 | $6.25 | 88 | 14.1 | 128K |
General purpose | OpenAI | $2.50 | $10.00 | $6.25 | 88 | 14.1 | 128K |
Open-source | Cohere | $2.50 | $10.00 | $6.25 | 84 | 13.4 | 128K |
General purpose | OpenAI | $2.50 | $10.00 | $6.25 | 74 | 11.8 | 128K |
General purpose | Cohere | $2.50 | $10.00 | $6.25 | 74 | 11.8 | 256K |
General purpose | inflection | $2.50 | $10.00 | $6.25 | 74 | 11.8 | 8K |
General purpose | inflection | $2.50 | $10.00 | $6.25 | 74 | 11.8 | 8K |
Speed & cost | $2.00 | $12.00 | $7.00 | 96 | 13.7 | 1M | |
Speed & cost | $2.00 | $12.00 | $7.00 | 96 | 13.7 | 1M | |
Image generation | $2.00 | $12.00 | $7.00 | 94 | 13.4 | 66K | |
General purpose | Amazon | $2.50 | $12.50 | $7.50 | 74 | 9.9 | 1M |
General purpose | OpenAI | $1.75 | $14.00 | $7.88 | 93 | 11.8 | 128K |
Code generation | OpenAI | $1.75 | $14.00 | $7.88 | 93 | 11.8 | 400K |
Code generation | OpenAI | $1.75 | $14.00 | $7.88 | 93 | 11.8 | 400K |
General purpose | OpenAI | $1.75 | $14.00 | $7.88 | 93 | 11.8 | 128K |
General purpose | OpenAI | $1.75 | $14.00 | $7.88 | 93 | 11.8 | 400K |
General purpose | OpenAI | $2.50 | $15.00 | $8.75 | 93 | 10.6 | 1M |
General purpose | xAI | $3.00 | $15.00 | $9.00 | 90 | 10.0 | 131K |
General purpose | xAI | $3.00 | $15.00 | $9.00 | 90 | 10.0 | 131K |
General purpose | Anthropic | $3.00 | $15.00 | $9.00 | 91 | 10.1 | 1M |
General purpose | Anthropic | $3.00 | $15.00 | $9.00 | 88 | 9.8 | 1M |
General purpose | Anthropic | $3.00 | $15.00 | $9.00 | 86 | 9.6 | 200K |
General purpose | Anthropic | $3.00 | $15.00 | $9.00 | 86 | 9.6 | 200K |
Hard reasoning | Anthropic | $3.00 | $15.00 | $9.00 | 86 | 9.6 | 200K |
Search + citations | Perplexity | $3.00 | $15.00 | $9.00 | 74 | 8.2 | 200K |
General purpose | xAI | $3.00 | $15.00 | $9.00 | 74 | 8.2 | 256K |
Search + citations | Perplexity | $3.00 | $15.00 | $9.00 | 74 | 8.2 | 200K |
Multimodal | OpenAI | $10.00 | $10.00 | $10.00 | 88 | 8.8 | 400K |
General purpose | OpenAI | $5.00 | $15.00 | $10.00 | 88 | 8.8 | 128K |
Multimodal | OpenAI | $6.00 | $18.00 | $12.00 | 88 | 7.3 | 128K |
Coding & agentic workflows | Anthropic | $5.00 | $25.00 | $15.00 | 97 | 6.5 | 1M |
General purpose | Anthropic | $5.00 | $25.00 | $15.00 | 95 | 6.3 | 1M |
General purpose | Anthropic | $5.00 | $25.00 | $15.00 | 95 | 6.3 | 200K |
OpenAI: GPT-5.5 New Frontier general purpose | OpenAI | $5.00 | $30.00 | $17.50 | 98 | 5.6 | 1M |
Multimodal | OpenAI | $10.00 | $30.00 | $20.00 | 88 | 4.4 | 128K |
Complex analysis | OpenAI | $10.00 | $30.00 | $20.00 | 88 | 4.4 | 128K |
Multimodal | OpenAI | $10.00 | $30.00 | $20.00 | 88 | 4.4 | 128K |
Deep research | OpenAI | $10.00 | $40.00 | $25.00 | 96 | 3.8 | 200K |
Hard reasoning | OpenAI | $15.00 | $60.00 | $37.50 | 88 | 2.3 | 200K |
Multimodal | Anthropic | $15.00 | $75.00 | $45.00 | 94 | 2.1 | 200K |
Multimodal | Anthropic | $15.00 | $75.00 | $45.00 | 94 | 2.1 | 200K |
Complex analysis | OpenAI | $30.00 | $60.00 | $45.00 | 93 | 2.1 | 8K |
Multimodal | OpenAI | $30.00 | $60.00 | $45.00 | 93 | 2.1 | 8K |
Hard reasoning | OpenAI | $20.00 | $80.00 | $50.00 | 96 | 1.9 | 200K |
Complex analysis | OpenAI | $15.00 | $120.00 | $67.50 | 88 | 1.3 | 400K |
Complex analysis | OpenAI | $21.00 | $168.00 | $94.50 | 97 | 1.0 | 400K |
Reasoning at any cost | OpenAI | $30.00 | $180.00 | $105.00 | 99 | 0.9 | 1M |
Complex analysis | OpenAI | $30.00 | $180.00 | $105.00 | 97 | 0.9 | 1M |
Hard reasoning | OpenAI | $150.00 | $600.00 | $375.00 | 93 | 0.2 | 200K |
Estimate Your Monthly Cost
Monthly cost estimate
Enter your typical request shape. Costs below are projected over one month, based on current public list-price API rates.
Cheapest
Mistral: Mistral Nemo
$2.20
per month at this volume
Best value (quality ≥ 80)
Qwen: Qwen3 235B A22B Instruct 2507 · Q 82
$6.55
per month at this volume
Most expensive
OpenAI: o1-pro
$25,500
per month at this volume
Save 30-60% with Mixture-of-Routers
Most production traffic is mixed-difficulty. Send the easy 60% to a cheap model and the hard 10% to a frontier model — same quality, fraction of the cost.
Full breakdown by model
Sorted cheapest to most expensive
| Model | Cost / request | Input cost / mo | Output cost / mo | Total / mo |
|---|---|---|---|---|
Mistral: Mistral Nemo $0.02 in / $0.04 out per 1M | $0.000022 | $1.00 | $1.20 | $2.20 |
Google: Gemma 3n 4B $0.02 in / $0.04 out per 1M | $0.000022 | $1.00 | $1.20 | $2.20 |
Meta: Llama 3.1 8B Instruct $0.02 in / $0.05 out per 1M | $0.000025 | $1.00 | $1.50 | $2.50 |
Meta: Llama 3 8B Instruct $0.03 in / $0.04 out per 1M | $0.000027 | $1.50 | $1.20 | $2.70 |
Llama Guard 3 8B $0.02 in / $0.06 out per 1M | $0.000028 | $1.00 | $1.80 | $2.80 |
Sao10K: Llama 3 8B Lunaris $0.04 in / $0.05 out per 1M | $0.000035 | $2.00 | $1.50 | $3.50 |
Meta: Llama 3.2 11B Vision Instruct $0.049 in / $0.049 out per 1M | $0.000039 | $2.45 | $1.47 | $3.92 |
IBM: Granite 4.0 Micro $0.017 in / $0.11 out per 1M | $0.000042 | $0.8500 | $3.30 | $4.15 |
Google: Gemma 2 9B $0.03 in / $0.09 out per 1M | $0.000042 | $1.50 | $2.70 | $4.20 |
Qwen: Qwen2.5 Coder 7B Instruct $0.03 in / $0.09 out per 1M | $0.000042 | $1.50 | $2.70 | $4.20 |
Google: Gemma 3 4B $0.04 in / $0.08 out per 1M | $0.000044 | $2.00 | $2.40 | $4.40 |
Mistral: Mistral Small 3.1 24B $0.03 in / $0.11 out per 1M | $0.000048 | $1.50 | $3.30 | $4.80 |
MythoMax 13B $0.06 in / $0.06 out per 1M | $0.000048 | $3.00 | $1.80 | $4.80 |
OpenAI: gpt-oss-20b $0.03 in / $0.11 out per 1M | $0.000048 | $1.50 | $3.30 | $4.80 |
Mistral: Mistral Small 3 $0.05 in / $0.08 out per 1M | $0.000049 | $2.50 | $2.40 | $4.90 |
Qwen: Qwen2.5 7B Instruct $0.04 in / $0.1 out per 1M | $0.000050 | $2.00 | $3.00 | $5.00 |
LiquidAI: LFM2-24B-A2B $0.03 in / $0.12 out per 1M | $0.000051 | $1.50 | $3.60 | $5.10 |
Qwen: Qwen-Turbo $0.0325 in / $0.13 out per 1M | $0.000055 | $1.63 | $3.90 | $5.53 |
Google: Gemma 3 12B $0.04 in / $0.13 out per 1M | $0.000059 | $2.00 | $3.90 | $5.90 |
Amazon: Nova Micro 1.0 $0.035 in / $0.14 out per 1M | $0.000060 | $1.75 | $4.20 | $5.95 |
Cohere: Command R7B (12-2024) $0.0375 in / $0.15 out per 1M | $0.000064 | $1.88 | $4.50 | $6.38 |
Qwen: Qwen3 235B A22B Instruct 2507 $0.071 in / $0.1 out per 1M | $0.000065 | $3.55 | $3.00 | $6.55 |
Arcee AI: Trinity Mini $0.045 in / $0.15 out per 1M | $0.000068 | $2.25 | $4.50 | $6.75 |
NVIDIA: Nemotron Nano 9B V2 $0.04 in / $0.16 out per 1M | $0.000068 | $2.00 | $4.80 | $6.80 |
Qwen: Qwen3.5-9B $0.05 in / $0.15 out per 1M | $0.000070 | $2.50 | $4.50 | $7.00 |
Meta: Llama 3.2 1B Instruct $0.027 in / $0.2 out per 1M | $0.000073 | $1.35 | $6.00 | $7.35 |
Microsoft: Phi 4 $0.065 in / $0.14 out per 1M | $0.000075 | $3.25 | $4.20 | $7.45 |
OpenAI: gpt-oss-120b $0.039 in / $0.19 out per 1M | $0.000077 | $1.95 | $5.70 | $7.65 |
Reka Edge $0.1 in / $0.1 out per 1M | $0.000080 | $5.00 | $3.00 | $8.00 |
Mistral: Ministral 3 3B 2512 $0.1 in / $0.1 out per 1M | $0.000080 | $5.00 | $3.00 | $8.00 |
Z.ai: GLM 4 32B $0.1 in / $0.1 out per 1M | $0.000080 | $5.00 | $3.00 | $8.00 |
NVIDIA: Nemotron 3 Nano 30B A3B $0.05 in / $0.2 out per 1M | $0.000085 | $2.50 | $6.00 | $8.50 |
AllenAI: Olmo 2 32B Instruct $0.05 in / $0.2 out per 1M | $0.000085 | $2.50 | $6.00 | $8.50 |
Google: Gemma 3 27B $0.08 in / $0.16 out per 1M | $0.000088 | $4.00 | $4.80 | $8.80 |
Mistral: Mistral Small 3.2 24B $0.075 in / $0.2 out per 1M | $0.000097 | $3.75 | $6.00 | $9.75 |
Qwen: Qwen3 14B $0.06 in / $0.24 out per 1M | $0.000102 | $3.00 | $7.20 | $10.20 |
Amazon: Nova Lite 1.0 $0.06 in / $0.24 out per 1M | $0.000102 | $3.00 | $7.20 | $10.20 |
ByteDance: UI-TARS 7B $0.1 in / $0.2 out per 1M | $0.000110 | $5.00 | $6.00 | $11.00 |
Reka Flash 3 $0.1 in / $0.2 out per 1M | $0.000110 | $5.00 | $6.00 | $11.00 |
Qwen: Qwen3.5-Flash $0.065 in / $0.26 out per 1M | $0.000111 | $3.25 | $7.80 | $11.05 |
Qwen: Qwen3 32B $0.08 in / $0.24 out per 1M | $0.000112 | $4.00 | $7.20 | $11.20 |
Mistral: Mistral 7B Instruct v0.1 $0.11 in / $0.19 out per 1M | $0.000112 | $5.50 | $5.70 | $11.20 |
NousResearch: Hermes 2 Pro - Llama-3 8B $0.14 in / $0.14 out per 1M | $0.000112 | $7.00 | $4.20 | $11.20 |
Qwen: Qwen3 Coder 30B A3B Instruct $0.07 in / $0.27 out per 1M | $0.000116 | $3.50 | $8.10 | $11.60 |
Baidu: ERNIE 4.5 21B A3B Thinking $0.07 in / $0.28 out per 1M | $0.000119 | $3.50 | $8.40 | $11.90 |
Baidu: ERNIE 4.5 21B A3B $0.07 in / $0.28 out per 1M | $0.000119 | $3.50 | $8.40 | $11.90 |
EssentialAI: Rnj 1 Instruct $0.15 in / $0.15 out per 1M | $0.000120 | $7.50 | $4.50 | $12.00 |
Mistral: Ministral 3 8B 2512 $0.15 in / $0.15 out per 1M | $0.000120 | $7.50 | $4.50 | $12.00 |
Qwen: Qwen3 30B A3B $0.08 in / $0.28 out per 1M | $0.000124 | $4.00 | $8.40 | $12.40 |
Google: Gemini 2.0 Flash Lite $0.075 in / $0.3 out per 1M | $0.000128 | $3.75 | $9.00 | $12.75 |
ByteDance Seed: Seed 1.6 Flash $0.075 in / $0.3 out per 1M | $0.000128 | $3.75 | $9.00 | $12.75 |
OpenAI: gpt-oss-safeguard-20b $0.075 in / $0.3 out per 1M | $0.000128 | $3.75 | $9.00 | $12.75 |
Meta: Llama 3.2 3B Instruct $0.051 in / $0.34 out per 1M | $0.000128 | $2.55 | $10.20 | $12.75 |
Meta: Llama 4 Scout $0.08 in / $0.3 out per 1M | $0.000130 | $4.00 | $9.00 | $13.00 |
Xiaomi: MiMo-V2-Flash $0.09 in / $0.29 out per 1M | $0.000132 | $4.50 | $8.70 | $13.20 |
Qwen: Qwen3 30B A3B Instruct 2507 $0.09 in / $0.3 out per 1M | $0.000135 | $4.50 | $9.00 | $13.50 |
Mistral: Mistral Small Creative $0.1 in / $0.3 out per 1M | $0.000140 | $5.00 | $9.00 | $14.00 |
StepFun: Step 3.5 Flash $0.1 in / $0.3 out per 1M | $0.000140 | $5.00 | $9.00 | $14.00 |
Mistral: Voxtral Small 24B 2507 $0.1 in / $0.3 out per 1M | $0.000140 | $5.00 | $9.00 | $14.00 |
Mistral: Devstral Small 1.1 $0.1 in / $0.3 out per 1M | $0.000140 | $5.00 | $9.00 | $14.00 |
Arcee AI: Spotlight $0.18 in / $0.18 out per 1M | $0.000144 | $9.00 | $5.40 | $14.40 |
Meta: Llama Guard 4 12B $0.18 in / $0.18 out per 1M | $0.000144 | $9.00 | $5.40 | $14.40 |
Qwen: Qwen3 8B $0.05 in / $0.4 out per 1M | $0.000145 | $2.50 | $12.00 | $14.50 |
OpenAI: GPT-5 Nano $0.05 in / $0.4 out per 1M | $0.000145 | $2.50 | $12.00 | $14.50 |
Meta: Llama 3.3 70B Instruct $0.1 in / $0.32 out per 1M | $0.000146 | $5.00 | $9.60 | $14.60 |
Z.ai: GLM 4.7 Flash $0.06 in / $0.4 out per 1M | $0.000150 | $3.00 | $12.00 | $15.00 |
DeepSeek: DeepSeek V4 Flash $0.14 in / $0.28 out per 1M | $0.000154 | $7.00 | $8.40 | $15.40 |
Qwen: Qwen3 30B A3B Thinking 2507 $0.08 in / $0.4 out per 1M | $0.000160 | $4.00 | $12.00 | $16.00 |
Mistral: Ministral 3 14B 2512 $0.2 in / $0.2 out per 1M | $0.000160 | $10.00 | $6.00 | $16.00 |
Google: Gemini 2.5 Flash Lite Preview 09-2025 $0.1 in / $0.4 out per 1M | $0.000170 | $5.00 | $12.00 | $17.00 |
Google: Gemini 2.5 Flash Lite $0.1 in / $0.4 out per 1M | $0.000170 | $5.00 | $12.00 | $17.00 |
NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 $0.1 in / $0.4 out per 1M | $0.000170 | $5.00 | $12.00 | $17.00 |
Google: Gemini 2.0 Flash $0.1 in / $0.4 out per 1M | $0.000170 | $5.00 | $12.00 | $17.00 |
OpenAI: GPT-4.1 Nano $0.1 in / $0.4 out per 1M | $0.000170 | $5.00 | $12.00 | $17.00 |
ByteDance Seed: Seed-2.0-Mini $0.1 in / $0.4 out per 1M | $0.000170 | $5.00 | $12.00 | $17.00 |
Qwen: Qwen3 VL 32B Instruct $0.104 in / $0.416 out per 1M | $0.000177 | $5.20 | $12.48 | $17.68 |
Qwen2.5 72B Instruct $0.12 in / $0.39 out per 1M | $0.000177 | $6.00 | $11.70 | $17.70 |
Tongyi DeepResearch 30B A3B $0.09 in / $0.45 out per 1M | $0.000180 | $4.50 | $13.50 | $18.00 |
Google: Gemma 4 26B A4B $0.13 in / $0.4 out per 1M | $0.000185 | $6.50 | $12.00 | $18.50 |
Nous: Hermes 4 70B $0.13 in / $0.4 out per 1M | $0.000185 | $6.50 | $12.00 | $18.50 |
Qwen: Qwen3 VL 8B Instruct $0.08 in / $0.5 out per 1M | $0.000190 | $4.00 | $15.00 | $19.00 |
Google: Gemma 4 31B $0.14 in / $0.4 out per 1M | $0.000190 | $7.00 | $12.00 | $19.00 |
Qwen: Qwen VL Plus $0.1365 in / $0.4095 out per 1M | $0.000191 | $6.83 | $12.29 | $19.11 |
NVIDIA: Nemotron 3 Super $0.1 in / $0.5 out per 1M | $0.000200 | $5.00 | $15.00 | $20.00 |
TheDrummer: Rocinante 12B $0.17 in / $0.43 out per 1M | $0.000214 | $8.50 | $12.90 | $21.40 |
Nex AGI: DeepSeek V3.1 Nex N1 $0.135 in / $0.5 out per 1M | $0.000218 | $6.75 | $15.00 | $21.75 |
Qwen: Qwen3 VL 30B A3B Instruct $0.13 in / $0.52 out per 1M | $0.000221 | $6.50 | $15.60 | $22.10 |
AllenAI: Olmo 3 32B Think $0.15 in / $0.5 out per 1M | $0.000225 | $7.50 | $15.00 | $22.50 |
DeepSeek: R1 Distill Qwen 32B $0.29 in / $0.29 out per 1M | $0.000232 | $14.50 | $8.70 | $23.20 |
Baidu: ERNIE 4.5 VL 28B A3B $0.14 in / $0.56 out per 1M | $0.000238 | $7.00 | $16.80 | $23.80 |
Nous: Hermes 3 70B Instruct $0.3 in / $0.3 out per 1M | $0.000240 | $15.00 | $9.00 | $24.00 |
Tencent: Hunyuan A13B Instruct $0.14 in / $0.57 out per 1M | $0.000241 | $7.00 | $17.10 | $24.10 |
DeepSeek: DeepSeek V3.2 $0.26 in / $0.38 out per 1M | $0.000244 | $13.00 | $11.40 | $24.40 |
Qwen: QwQ 32B $0.15 in / $0.58 out per 1M | $0.000249 | $7.50 | $17.40 | $24.90 |
xAI: Grok 4.1 Fast $0.2 in / $0.5 out per 1M | $0.000250 | $10.00 | $15.00 | $25.00 |
xAI: Grok 4 Fast $0.2 in / $0.5 out per 1M | $0.000250 | $10.00 | $15.00 | $25.00 |
Meta: Llama 4 Maverick $0.15 in / $0.6 out per 1M | $0.000255 | $7.50 | $18.00 | $25.50 |
OpenAI: GPT-4o-mini Search Preview $0.15 in / $0.6 out per 1M | $0.000255 | $7.50 | $18.00 | $25.50 |
OpenAI: GPT-4o-mini (2024-07-18) $0.15 in / $0.6 out per 1M | $0.000255 | $7.50 | $18.00 | $25.50 |
OpenAI: GPT-4o-mini $0.15 in / $0.6 out per 1M | $0.000255 | $7.50 | $18.00 | $25.50 |
Mistral: Mistral Small 4 $0.15 in / $0.6 out per 1M | $0.000255 | $7.50 | $18.00 | $25.50 |
Upstage: Solar Pro 3 $0.15 in / $0.6 out per 1M | $0.000255 | $7.50 | $18.00 | $25.50 |
Cohere: Command R (08-2024) $0.15 in / $0.6 out per 1M | $0.000255 | $7.50 | $18.00 | $25.50 |
DeepSeek: DeepSeek V3.2 Exp $0.27 in / $0.41 out per 1M | $0.000258 | $13.50 | $12.30 | $25.80 |
NVIDIA: Nemotron Nano 12B 2 VL $0.2 in / $0.6 out per 1M | $0.000280 | $10.00 | $18.00 | $28.00 |
AllenAI: Olmo 3.1 32B Instruct $0.2 in / $0.6 out per 1M | $0.000280 | $10.00 | $18.00 | $28.00 |
Qwen: Qwen2.5 VL 32B Instruct $0.2 in / $0.6 out per 1M | $0.000280 | $10.00 | $18.00 | $28.00 |
Mistral: Saba $0.2 in / $0.6 out per 1M | $0.000280 | $10.00 | $18.00 | $28.00 |
Qwen: Qwen3 Next 80B A3B Thinking $0.0975 in / $0.78 out per 1M | $0.000283 | $4.88 | $23.40 | $28.28 |
Qwen: Qwen3 Coder Next $0.12 in / $0.75 out per 1M | $0.000285 | $6.00 | $22.50 | $28.50 |
DeepSeek: DeepSeek V3.1 $0.15 in / $0.75 out per 1M | $0.000300 | $7.50 | $22.50 | $30.00 |
xAI: Grok 3 Mini $0.3 in / $0.5 out per 1M | $0.000300 | $15.00 | $15.00 | $30.00 |
xAI: Grok 3 Mini Beta $0.3 in / $0.5 out per 1M | $0.000300 | $15.00 | $15.00 | $30.00 |
TheDrummer: Cydonia 24B V4.1 $0.3 in / $0.5 out per 1M | $0.000300 | $15.00 | $15.00 | $30.00 |
Meta: Llama 3.1 70B Instruct $0.4 in / $0.4 out per 1M | $0.000320 | $20.00 | $12.00 | $32.00 |
TheDrummer: UnslopNemo 12B $0.4 in / $0.4 out per 1M | $0.000320 | $20.00 | $12.00 | $32.00 |
Z.ai: GLM 4.5 Air $0.13 in / $0.85 out per 1M | $0.000320 | $6.50 | $25.50 | $32.00 |
DeepSeek: DeepSeek V3 0324 $0.2 in / $0.77 out per 1M | $0.000331 | $10.00 | $23.10 | $33.10 |
Meituan: LongCat Flash Chat $0.2 in / $0.8 out per 1M | $0.000340 | $10.00 | $24.00 | $34.00 |
DeepSeek: DeepSeek V3.1 Terminus $0.21 in / $0.79 out per 1M | $0.000342 | $10.50 | $23.70 | $34.20 |
Inception: Mercury 2 $0.25 in / $0.75 out per 1M | $0.000350 | $12.50 | $22.50 | $35.00 |
Inception: Mercury $0.25 in / $0.75 out per 1M | $0.000350 | $12.50 | $22.50 | $35.00 |
Inception: Mercury Coder $0.25 in / $0.75 out per 1M | $0.000350 | $12.50 | $22.50 | $35.00 |
MiniMax: MiniMax M2.5 $0.118 in / $0.99 out per 1M | $0.000356 | $5.90 | $29.70 | $35.60 |
Qwen: Qwen3 VL 235B A22B Instruct $0.2 in / $0.88 out per 1M | $0.000364 | $10.00 | $26.40 | $36.40 |
Qwen: Qwen Plus 0728 (thinking) $0.26 in / $0.78 out per 1M | $0.000364 | $13.00 | $23.40 | $36.40 |
Qwen: Qwen Plus 0728 $0.26 in / $0.78 out per 1M | $0.000364 | $13.00 | $23.40 | $36.40 |
Qwen: Qwen-Plus $0.26 in / $0.78 out per 1M | $0.000364 | $13.00 | $23.40 | $36.40 |
Arcee AI: Trinity Large Thinking $0.22 in / $0.85 out per 1M | $0.000365 | $11.00 | $25.50 | $36.50 |
Qwen: Qwen3 Next 80B A3B Instruct $0.09 in / $1.1 out per 1M | $0.000375 | $4.50 | $33.00 | $37.50 |
Qwen: Qwen3 Coder Flash $0.195 in / $0.975 out per 1M | $0.000390 | $9.75 | $29.25 | $39.00 |
Qwen: Qwen3 Coder 480B A35B $0.22 in / $1 out per 1M | $0.000410 | $11.00 | $30.00 | $41.00 |
Mistral: Codestral 2508 $0.3 in / $0.9 out per 1M | $0.000420 | $15.00 | $27.00 | $42.00 |
ReMM SLERP 13B $0.45 in / $0.65 out per 1M | $0.000420 | $22.50 | $19.50 | $42.00 |
MiniMax: MiniMax M2.1 $0.27 in / $0.95 out per 1M | $0.000420 | $13.50 | $28.50 | $42.00 |
Z.ai: GLM 4.6V $0.3 in / $0.9 out per 1M | $0.000420 | $15.00 | $27.00 | $42.00 |
DeepSeek: DeepSeek V3 $0.32 in / $0.89 out per 1M | $0.000427 | $16.00 | $26.70 | $42.70 |
MiniMax: MiniMax M2 $0.255 in / $1 out per 1M | $0.000427 | $12.75 | $30.00 | $42.75 |
Prime Intellect: INTELLECT-3 $0.2 in / $1.1 out per 1M | $0.000430 | $10.00 | $33.00 | $43.00 |
MiniMax: MiniMax-01 $0.2 in / $1.1 out per 1M | $0.000430 | $10.00 | $33.00 | $43.00 |
Mistral: Mixtral 8x7B Instruct $0.54 in / $0.54 out per 1M | $0.000432 | $27.00 | $16.20 | $43.20 |
Qwen: Qwen3 VL 8B Thinking $0.117 in / $1.365 out per 1M | $0.000468 | $5.85 | $40.95 | $46.80 |
Baidu: ERNIE 4.5 300B A47B $0.28 in / $1.1 out per 1M | $0.000470 | $14.00 | $33.00 | $47.00 |
Qwen: Qwen3.5-35B-A3B $0.1625 in / $1.3 out per 1M | $0.000471 | $8.13 | $39.00 | $47.13 |
OpenAI: GPT-5.4 Nano $0.2 in / $1.25 out per 1M | $0.000475 | $10.00 | $37.50 | $47.50 |
Meta: Llama 3 70B Instruct $0.51 in / $0.74 out per 1M | $0.000477 | $25.50 | $22.20 | $47.70 |
TNG: DeepSeek R1T2 Chimera $0.3 in / $1.1 out per 1M | $0.000480 | $15.00 | $33.00 | $48.00 |
Arcee AI: Coder Large $0.5 in / $0.8 out per 1M | $0.000490 | $25.00 | $24.00 | $49.00 |
WizardLM-2 8x22B $0.62 in / $0.62 out per 1M | $0.000496 | $31.00 | $18.60 | $49.60 |
Anthropic: Claude 3 Haiku $0.25 in / $1.25 out per 1M | $0.000500 | $12.50 | $37.50 | $50.00 |
Kwaipilot: KAT-Coder-Pro V2 $0.3 in / $1.2 out per 1M | $0.000510 | $15.00 | $36.00 | $51.00 |
MiniMax: MiniMax M2.7 $0.3 in / $1.2 out per 1M | $0.000510 | $15.00 | $36.00 | $51.00 |
MiniMax: MiniMax M2-her $0.3 in / $1.2 out per 1M | $0.000510 | $15.00 | $36.00 | $51.00 |
TheDrummer: Skyfall 36B V2 $0.55 in / $0.8 out per 1M | $0.000515 | $27.50 | $24.00 | $51.50 |
Google: Gemma 2 27B $0.65 in / $0.65 out per 1M | $0.000520 | $32.50 | $19.50 | $52.00 |
Qwen: Qwen3 235B A22B Thinking 2507 $0.1495 in / $1.495 out per 1M | $0.000523 | $7.47 | $44.85 | $52.33 |
Qwen: Qwen3 VL 30B A3B Thinking $0.13 in / $1.56 out per 1M | $0.000533 | $6.50 | $46.80 | $53.30 |
Sao10K: Llama 3.3 Euryale 70B $0.65 in / $0.75 out per 1M | $0.000550 | $32.50 | $22.50 | $55.00 |
xAI: Grok Code Fast 1 $0.2 in / $1.5 out per 1M | $0.000550 | $10.00 | $45.00 | $55.00 |
DeepSeek: DeepSeek V3.2 Speciale $0.4 in / $1.2 out per 1M | $0.000560 | $20.00 | $36.00 | $56.00 |
Qwen: Qwen3.5-27B $0.195 in / $1.56 out per 1M | $0.000566 | $9.75 | $46.80 | $56.55 |
Google: Gemini 3.1 Flash Lite Preview $0.25 in / $1.5 out per 1M | $0.000575 | $12.50 | $45.00 | $57.50 |
Baidu: ERNIE 4.5 VL 424B A47B $0.42 in / $1.25 out per 1M | $0.000585 | $21.00 | $37.50 | $58.50 |
DeepSeek: R1 Distill Llama 70B $0.7 in / $0.8 out per 1M | $0.000590 | $35.00 | $24.00 | $59.00 |
Qwen: Qwen3.5 Plus 2026-02-15 $0.26 in / $1.56 out per 1M | $0.000598 | $13.00 | $46.80 | $59.80 |
Qwen2.5 Coder 32B Instruct $0.66 in / $1 out per 1M | $0.000630 | $33.00 | $30.00 | $63.00 |
Qwen: Qwen2.5 VL 72B Instruct $0.8 in / $0.8 out per 1M | $0.000640 | $40.00 | $24.00 | $64.00 |
Mancer: Weaver (alpha) $0.75 in / $1 out per 1M | $0.000675 | $37.50 | $30.00 | $67.50 |
OpenAI: GPT-4.1 Mini $0.4 in / $1.6 out per 1M | $0.000680 | $20.00 | $48.00 | $68.00 |
Sao10K: Llama 3.1 Euryale 70B v2.2 $0.85 in / $0.85 out per 1M | $0.000680 | $42.50 | $25.50 | $68.00 |
Mistral: Mistral Large 3 2512 $0.5 in / $1.5 out per 1M | $0.000700 | $25.00 | $45.00 | $70.00 |
OpenAI: GPT-3.5 Turbo $0.5 in / $1.5 out per 1M | $0.000700 | $25.00 | $45.00 | $70.00 |
MoonshotAI: Kimi K2.5 $0.3827 in / $1.72 out per 1M | $0.000707 | $19.13 | $51.60 | $70.73 |
Z.ai: GLM 4.7 $0.39 in / $1.75 out per 1M | $0.000720 | $19.50 | $52.50 | $72.00 |
OpenAI: GPT-5 Mini $0.25 in / $2 out per 1M | $0.000725 | $12.50 | $60.00 | $72.50 |
ByteDance Seed: Seed-2.0-Lite $0.25 in / $2 out per 1M | $0.000725 | $12.50 | $60.00 | $72.50 |
ByteDance Seed: Seed 1.6 $0.25 in / $2 out per 1M | $0.000725 | $12.50 | $60.00 | $72.50 |
OpenAI: GPT-5.1-Codex-Mini $0.25 in / $2 out per 1M | $0.000725 | $12.50 | $60.00 | $72.50 |
Arcee AI: Virtuoso Large $0.75 in / $1.2 out per 1M | $0.000735 | $37.50 | $36.00 | $73.50 |
Qwen: Qwen3.5-122B-A10B $0.26 in / $2.08 out per 1M | $0.000754 | $13.00 | $62.40 | $75.40 |
Morph: Morph V3 Fast $0.8 in / $1.2 out per 1M | $0.000760 | $40.00 | $36.00 | $76.00 |
EleutherAI: Llemma 7b $0.8 in / $1.2 out per 1M | $0.000760 | $40.00 | $36.00 | $76.00 |
AlfredPros: CodeLLaMa 7B Instruct Solidity $0.8 in / $1.2 out per 1M | $0.000760 | $40.00 | $36.00 | $76.00 |
Z.ai: GLM 4.6 $0.39 in / $1.9 out per 1M | $0.000765 | $19.50 | $57.00 | $76.50 |
AionLabs: Aion-1.0-Mini $0.7 in / $1.4 out per 1M | $0.000770 | $35.00 | $42.00 | $77.00 |
Qwen: Qwen3 235B A22B $0.455 in / $1.82 out per 1M | $0.000773 | $22.75 | $54.60 | $77.35 |
Xiaomi: MiMo-V2-Omni $0.4 in / $2 out per 1M | $0.000800 | $20.00 | $60.00 | $80.00 |
Mistral: Devstral 2 2512 $0.4 in / $2 out per 1M | $0.000800 | $20.00 | $60.00 | $80.00 |
Relace: Relace Apply 3 $0.85 in / $1.25 out per 1M | $0.000800 | $42.50 | $37.50 | $80.00 |
MoonshotAI: Kimi K2 0905 $0.4 in / $2 out per 1M | $0.000800 | $20.00 | $60.00 | $80.00 |
Mistral: Mistral Medium 3.1 $0.4 in / $2 out per 1M | $0.000800 | $20.00 | $60.00 | $80.00 |
Mistral: Devstral Medium $0.4 in / $2 out per 1M | $0.000800 | $20.00 | $60.00 | $80.00 |
Mistral: Mistral Medium 3 $0.4 in / $2 out per 1M | $0.000800 | $20.00 | $60.00 | $80.00 |
Perplexity: Sonar $1 in / $1 out per 1M | $0.000800 | $50.00 | $30.00 | $80.00 |
Nous: Hermes 3 405B Instruct $1 in / $1 out per 1M | $0.000800 | $50.00 | $30.00 | $80.00 |
MoonshotAI: Kimi K2 Thinking $0.47 in / $2 out per 1M | $0.000835 | $23.50 | $60.00 | $83.50 |
Z.ai: GLM 4.5V $0.6 in / $1.8 out per 1M | $0.000840 | $30.00 | $54.00 | $84.00 |
NVIDIA: Llama 3.1 Nemotron Ultra 253B v1 $0.6 in / $1.8 out per 1M | $0.000840 | $30.00 | $54.00 | $84.00 |
MiniMax: MiniMax M1 $0.4 in / $2.2 out per 1M | $0.000860 | $20.00 | $66.00 | $86.00 |
DeepSeek: R1 0528 $0.45 in / $2.15 out per 1M | $0.000870 | $22.50 | $64.50 | $87.00 |
AionLabs: Aion-2.0 $0.8 in / $1.6 out per 1M | $0.000880 | $40.00 | $48.00 | $88.00 |
AionLabs: Aion-RP 1.0 (8B) $0.8 in / $1.6 out per 1M | $0.000880 | $40.00 | $48.00 | $88.00 |
Qwen: Qwen VL Max $0.52 in / $2.08 out per 1M | $0.000884 | $26.00 | $62.40 | $88.40 |
Qwen: Qwen3.5 397B A17B $0.39 in / $2.34 out per 1M | $0.000897 | $19.50 | $70.20 | $89.70 |
Google: Nano Banana (Gemini 2.5 Flash Image) $0.3 in / $2.5 out per 1M | $0.000900 | $15.00 | $75.00 | $90.00 |
Google: Gemini 2.5 Flash $0.3 in / $2.5 out per 1M | $0.000900 | $15.00 | $75.00 | $90.00 |
Amazon: Nova 2 Lite $0.3 in / $2.5 out per 1M | $0.000900 | $15.00 | $75.00 | $90.00 |
Qwen: Qwen3 VL 235B A22B Thinking $0.26 in / $2.6 out per 1M | $0.000910 | $13.00 | $78.00 | $91.00 |
NVIDIA: Llama 3.1 Nemotron 70B Instruct $1.2 in / $1.2 out per 1M | $0.000960 | $60.00 | $36.00 | $96.00 |
Z.ai: GLM 4.5 $0.6 in / $2.2 out per 1M | $0.000960 | $30.00 | $66.00 | $96.00 |
MoonshotAI: Kimi K2 0711 $0.57 in / $2.3 out per 1M | $0.000975 | $28.50 | $69.00 | $97.50 |
Deep Cogito: Cogito v2.1 671B $1.25 in / $1.25 out per 1M | $0.001000 | $62.50 | $37.50 | $100.00 |
OpenAI: GPT Audio Mini $0.6 in / $2.4 out per 1M | $0.001020 | $30.00 | $72.00 | $102.00 |
Morph: Morph V3 Large $0.9 in / $1.9 out per 1M | $0.001020 | $45.00 | $57.00 | $102.00 |
Z.ai: GLM 5 $0.72 in / $2.3 out per 1M | $0.001050 | $36.00 | $69.00 | $105.00 |
DeepSeek: R1 $0.7 in / $2.5 out per 1M | $0.001100 | $35.00 | $75.00 | $110.00 |
OpenAI: GPT-3.5 Turbo (older v0613) $1 in / $2 out per 1M | $0.001100 | $50.00 | $60.00 | $110.00 |
Google: Gemini 3 Flash Preview $0.5 in / $3 out per 1M | $0.001150 | $25.00 | $90.00 | $115.00 |
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) $0.5 in / $3 out per 1M | $0.001150 | $25.00 | $90.00 | $115.00 |
Sao10k: Llama 3 Euryale 70B v2.1 $1.48 in / $1.48 out per 1M | $0.001184 | $74.00 | $44.40 | $118.40 |
Qwen: Qwen3 Coder Plus $0.65 in / $3.25 out per 1M | $0.001300 | $32.50 | $97.50 | $130.00 |
OpenAI: GPT-3.5 Turbo Instruct $1.5 in / $2 out per 1M | $0.001350 | $75.00 | $60.00 | $135.00 |
Amazon: Nova Pro 1.0 $0.8 in / $3.2 out per 1M | $0.001360 | $40.00 | $96.00 | $136.00 |
xAI: Grok 4.3 $1.25 in / $2.5 out per 1M | $0.001375 | $62.50 | $75.00 | $137.50 |
Xiaomi: MiMo-V2-Pro $1 in / $3 out per 1M | $0.001400 | $50.00 | $90.00 | $140.00 |
Relace: Relace Search $1 in / $3 out per 1M | $0.001400 | $50.00 | $90.00 | $140.00 |
Nous: Hermes 4 405B $1 in / $3 out per 1M | $0.001400 | $50.00 | $90.00 | $140.00 |
Arcee AI: Maestro Reasoning $0.9 in / $3.3 out per 1M | $0.001440 | $45.00 | $99.00 | $144.00 |
Switchpoint Router $0.85 in / $3.4 out per 1M | $0.001445 | $42.50 | $102.00 | $144.50 |
Qwen: Qwen3 Max Thinking $0.78 in / $3.9 out per 1M | $0.001560 | $39.00 | $117.00 | $156.00 |
Qwen: Qwen3 Max $0.78 in / $3.9 out per 1M | $0.001560 | $39.00 | $117.00 | $156.00 |
Anthropic: Claude 3.5 Haiku $0.8 in / $4 out per 1M | $0.001600 | $40.00 | $120.00 | $160.00 |
MoonshotAI: Kimi K2.6 $0.95 in / $4 out per 1M | $0.001675 | $47.50 | $120.00 | $167.50 |
OpenAI: GPT-5.4 Mini $0.75 in / $4.5 out per 1M | $0.001725 | $37.50 | $135.00 | $172.50 |
Qwen: Qwen-Max $1.04 in / $4.16 out per 1M | $0.001768 | $52.00 | $124.80 | $176.80 |
Z.ai: GLM 5V Turbo $1.2 in / $4 out per 1M | $0.001800 | $60.00 | $120.00 | $180.00 |
Z.ai: GLM 5 Turbo $1.2 in / $4 out per 1M | $0.001800 | $60.00 | $120.00 | $180.00 |
OpenAI: GPT-5 Image Mini $2.5 in / $2 out per 1M | $0.001850 | $125.00 | $60.00 | $185.00 |
OpenAI: o4 Mini High $1.1 in / $4.4 out per 1M | $0.001870 | $55.00 | $132.00 | $187.00 |
OpenAI: o4 Mini $1.1 in / $4.4 out per 1M | $0.001870 | $55.00 | $132.00 | $187.00 |
OpenAI: o3 Mini High $1.1 in / $4.4 out per 1M | $0.001870 | $55.00 | $132.00 | $187.00 |
OpenAI: o3 Mini $1.1 in / $4.4 out per 1M | $0.001870 | $55.00 | $132.00 | $187.00 |
DeepSeek: DeepSeek V4 Pro $1.74 in / $3.48 out per 1M | $0.001914 | $87.00 | $104.40 | $191.40 |
Anthropic: Claude Haiku 4.5 $1 in / $5 out per 1M | $0.002000 | $50.00 | $150.00 | $200.00 |
Writer: Palmyra X5 $0.6 in / $6 out per 1M | $0.002100 | $30.00 | $180.00 | $210.00 |
Z.ai: GLM 5.1 $1.55 in / $4.65 out per 1M | $0.002170 | $77.50 | $139.50 | $217.00 |
Sao10K: Llama 3.1 70B Hanami x1 $3 in / $3 out per 1M | $0.002400 | $150.00 | $90.00 | $240.00 |
OpenAI: GPT-3.5 Turbo 16k $3 in / $4 out per 1M | $0.002700 | $150.00 | $120.00 | $270.00 |
Mistral Large 2411 $2 in / $6 out per 1M | $0.002800 | $100.00 | $180.00 | $280.00 |
Mistral Large 2407 $2 in / $6 out per 1M | $0.002800 | $100.00 | $180.00 | $280.00 |
Mistral Large $2 in / $6 out per 1M | $0.002800 | $100.00 | $180.00 | $280.00 |
xAI: Grok 4.20 Multi-Agent $2 in / $6 out per 1M | $0.002800 | $100.00 | $180.00 | $280.00 |
xAI: Grok 4.20 $2 in / $6 out per 1M | $0.002800 | $100.00 | $180.00 | $280.00 |
Mistral: Pixtral Large 2411 $2 in / $6 out per 1M | $0.002800 | $100.00 | $180.00 | $280.00 |
Mistral: Mixtral 8x22B Instruct $2 in / $6 out per 1M | $0.002800 | $100.00 | $180.00 | $280.00 |
Magnum v4 72B $3 in / $5 out per 1M | $0.003000 | $150.00 | $150.00 | $300.00 |
OpenAI: o4 Mini Deep Research $2 in / $8 out per 1M | $0.003400 | $100.00 | $240.00 | $340.00 |
OpenAI: o3 $2 in / $8 out per 1M | $0.003400 | $100.00 | $240.00 | $340.00 |
OpenAI: GPT-4.1 $2 in / $8 out per 1M | $0.003400 | $100.00 | $240.00 | $340.00 |
AI21: Jamba Large 1.7 $2 in / $8 out per 1M | $0.003400 | $100.00 | $240.00 | $340.00 |
Perplexity: Sonar Reasoning Pro $2 in / $8 out per 1M | $0.003400 | $100.00 | $240.00 | $340.00 |
Perplexity: Sonar Deep Research $2 in / $8 out per 1M | $0.003400 | $100.00 | $240.00 | $340.00 |
OpenAI: GPT-5.1-Codex-Max $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
OpenAI: GPT-5.1 $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
OpenAI: GPT-5.1 Chat $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
OpenAI: GPT-5.1-Codex $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
OpenAI: GPT-5 $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
Google: Gemini 2.5 Pro $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
Google: Gemini 2.5 Pro Preview 06-05 $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
Google: Gemini 2.5 Pro Preview 05-06 $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
OpenAI: GPT-5 Codex $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
OpenAI: GPT-5 Chat $1.25 in / $10 out per 1M | $0.003625 | $62.50 | $300.00 | $362.50 |
Goliath 120B $3.75 in / $7.5 out per 1M | $0.004125 | $187.50 | $225.00 | $412.50 |
OpenAI: GPT-4o Audio $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
OpenAI: GPT-4o Search Preview $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
OpenAI: GPT-4o (2024-11-20) $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
OpenAI: GPT-4o (2024-08-06) $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
OpenAI: GPT-4o $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
Cohere: Command R+ (08-2024) $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
OpenAI: GPT Audio $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
Cohere: Command A $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
Inflection: Inflection 3 Pi $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
Inflection: Inflection 3 Productivity $2.5 in / $10 out per 1M | $0.004250 | $125.00 | $300.00 | $425.00 |
AionLabs: Aion-1.0 $4 in / $8 out per 1M | $0.004400 | $200.00 | $240.00 | $440.00 |
Google: Gemini 3.1 Pro Preview Custom Tools $2 in / $12 out per 1M | $0.004600 | $100.00 | $360.00 | $460.00 |
Google: Gemini 3.1 Pro Preview $2 in / $12 out per 1M | $0.004600 | $100.00 | $360.00 | $460.00 |
Google: Nano Banana Pro (Gemini 3 Pro Image Preview) $2 in / $12 out per 1M | $0.004600 | $100.00 | $360.00 | $460.00 |
Amazon: Nova Premier 1.0 $2.5 in / $12.5 out per 1M | $0.005000 | $125.00 | $375.00 | $500.00 |
OpenAI: GPT-5.3 Chat $1.75 in / $14 out per 1M | $0.005075 | $87.50 | $420.00 | $507.50 |
OpenAI: GPT-5.3-Codex $1.75 in / $14 out per 1M | $0.005075 | $87.50 | $420.00 | $507.50 |
OpenAI: GPT-5.2-Codex $1.75 in / $14 out per 1M | $0.005075 | $87.50 | $420.00 | $507.50 |
OpenAI: GPT-5.2 Chat $1.75 in / $14 out per 1M | $0.005075 | $87.50 | $420.00 | $507.50 |
OpenAI: GPT-5.2 $1.75 in / $14 out per 1M | $0.005075 | $87.50 | $420.00 | $507.50 |
OpenAI: GPT-5.4 $2.5 in / $15 out per 1M | $0.005750 | $125.00 | $450.00 | $575.00 |
xAI: Grok 3 $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
xAI: Grok 3 Beta $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
Anthropic: Claude Sonnet 4.6 $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
Anthropic: Claude Sonnet 4.5 $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
Anthropic: Claude Sonnet 4 $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
Anthropic: Claude 3.7 Sonnet $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
Anthropic: Claude 3.7 Sonnet (thinking) $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
Perplexity: Sonar Pro Search $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
xAI: Grok 4 $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
Perplexity: Sonar Pro $3 in / $15 out per 1M | $0.006000 | $150.00 | $450.00 | $600.00 |
OpenAI: GPT-4o (2024-05-13) $5 in / $15 out per 1M | $0.007000 | $250.00 | $450.00 | $700.00 |
OpenAI: GPT-5 Image $10 in / $10 out per 1M | $0.008000 | $500.00 | $300.00 | $800.00 |
OpenAI: GPT-4o (extended) $6 in / $18 out per 1M | $0.008400 | $300.00 | $540.00 | $840.00 |
Anthropic: Claude Opus 4.7 $5 in / $25 out per 1M | $0.0100 | $250.00 | $750.00 | $1000.00 |
Anthropic: Claude Opus 4.6 $5 in / $25 out per 1M | $0.0100 | $250.00 | $750.00 | $1000.00 |
Anthropic: Claude Opus 4.5 $5 in / $25 out per 1M | $0.0100 | $250.00 | $750.00 | $1000.00 |
OpenAI: GPT-5.5 $5 in / $30 out per 1M | $0.0115 | $250.00 | $900.00 | $1150.00 |
OpenAI: GPT-4 Turbo $10 in / $30 out per 1M | $0.0140 | $500.00 | $900.00 | $1400.00 |
OpenAI: GPT-4 Turbo Preview $10 in / $30 out per 1M | $0.0140 | $500.00 | $900.00 | $1400.00 |
OpenAI: GPT-4 Turbo (older v1106) $10 in / $30 out per 1M | $0.0140 | $500.00 | $900.00 | $1400.00 |
OpenAI: o3 Deep Research $10 in / $40 out per 1M | $0.0170 | $500.00 | $1200.00 | $1700.00 |
OpenAI: o1 $15 in / $60 out per 1M | $0.0255 | $750.00 | $1800.00 | $2550.00 |
Anthropic: Claude Opus 4.1 $15 in / $75 out per 1M | $0.0300 | $750.00 | $2250.00 | $3000.00 |
Anthropic: Claude Opus 4 $15 in / $75 out per 1M | $0.0300 | $750.00 | $2250.00 | $3000.00 |
OpenAI: GPT-4 (older v0314) $30 in / $60 out per 1M | $0.0330 | $1500.00 | $1800.00 | $3300.00 |
OpenAI: GPT-4 $30 in / $60 out per 1M | $0.0330 | $1500.00 | $1800.00 | $3300.00 |
OpenAI: o3 Pro $20 in / $80 out per 1M | $0.0340 | $1000.00 | $2400.00 | $3400.00 |
OpenAI: GPT-5 Pro $15 in / $120 out per 1M | $0.0435 | $750.00 | $3600.00 | $4350.00 |
OpenAI: GPT-5.2 Pro $21 in / $168 out per 1M | $0.0609 | $1050.00 | $5040.00 | $6090.00 |
OpenAI: GPT-5.5 Pro $30 in / $180 out per 1M | $0.0690 | $1500.00 | $5400.00 | $6900.00 |
OpenAI: GPT-5.4 Pro $30 in / $180 out per 1M | $0.0690 | $1500.00 | $5400.00 | $6900.00 |
OpenAI: o1-pro $150 in / $600 out per 1M | $0.2550 | $7500.00 | $18,000 | $25,500 |
List-price estimate. Real bills typically run 1.3-1.7x higher after retries, system-prompt re-sends, and tool-call round-trips. See per-million-tokens true cost for the adders.
Understanding AI API Pricing in 2026
AI model pricing has undergone a dramatic transformation. Since GPT-4 launched in March 2023 at $30 per million input tokens, prices have fallen by over 90% — driven by competition from Anthropic, Google, and open-source challengers like DeepSeek and Meta's Llama.
Today's pricing landscape spans a 150x range: from Google's Gemini 2.0 Flash at $0.10/1M input tokens to Claude Opus 4 at $15/1M tokens. The key insight is that price doesn't always correlate with quality — DeepSeek V3 delivers 86% quality at just $0.27/1M tokens, while some premium models charge 50x more for marginal quality gains.
How to Optimize AI API Costs
The most effective strategy is model routing: sending simple queries to cheap, fast models and complex queries to premium models. A gateway like Swfte Connect automates this, typically reducing costs by 30-60% without sacrificing quality.
Other strategies include: leveraging cached input pricing (offered by Google and DeepSeek), batching requests to reduce per-call overhead, and using open-source models for predictable workloads where you can self-host.
Pricing Trends to Watch
- Price compression continues: Expect another 50%+ reduction across flagship models by end of 2026
- Reasoning premium: Models with extended thinking (o3, R1) cost more due to higher compute per request
- Open-source pressure: Llama 4 and DeepSeek are forcing closed providers to cut prices faster
- Cached pricing: More providers offering discounted rates for repeated context