AI Model Directory

Detailed specs, pricing, and benchmarks for every major AI model.

OpenAI

GPT-4o

OpenAI's flagship multimodal model with vision, code generation, and function calling. Excellent all-round performance.

85

Quality

$6.25

Blended/1M

109

tok/s

Best for: General purpose
OpenAI

GPT-4o Mini

Fast and affordable small model for lightweight tasks and high-throughput use cases.

72

Quality

$0.38

Blended/1M

183

tok/s

Best for: High throughput
OpenAI

o3 Mini

OpenAI's compact reasoning model with extended thinking capabilities for complex problem solving.

88

Quality

$2.75

Blended/1M

155

tok/s

Best for: Reasoning & math
OpenAI

o3

OpenAI's most powerful reasoning model. State-of-the-art on MATH, coding, and science benchmarks.

96

Quality

$25.00

Blended/1M

68

tok/s

Best for: Hard reasoning
OpenAI

GPT-4.1

OpenAI's latest flagship with 1M token context, improved instruction following and coding.

89

Quality

$5.00

Blended/1M

120

tok/s

Best for: Long context
Anthropic

Claude Opus 4

Anthropic's most capable model. Excels at complex analysis, nuanced writing, and extended agentic tasks.

95

Quality

$45.00

Blended/1M

52

tok/s

Best for: Complex analysis
Anthropic

Claude Sonnet 4

Anthropic's balanced model with excellent coding and reasoning. Best price-to-performance ratio.

88

Quality

$9.00

Blended/1M

95

tok/s

Best for: Coding & balance
Anthropic

Claude 3.5 Haiku

Anthropic's fastest model. Ultra-low latency for real-time applications and high-volume tasks.

75

Quality

$2.40

Blended/1M

172

tok/s

Best for: Speed & cost
Google

Gemini 2.5 Pro

Google's thinking model with native tool use, 1M context window, and strong multimodal capabilities.

92

Quality

$5.63

Blended/1M

87

tok/s

Best for: Multimodal + value
Google

Gemini 2.0 Flash

Google's fastest model. Optimized for speed and efficiency with strong coding and reasoning.

74

Quality

$0.25

Blended/1M

244

tok/s

Best for: Fastest + cheapest
Meta
OSS

Llama 4 Maverick

Meta's mixture-of-experts model with 17B active parameters and 128 experts. Strong multimodal and multilingual performance.

80

Quality

$0.40

Blended/1M

135

tok/s

Best for: Open-source value
Meta
OSS

Llama 4 Scout

Meta's efficient MoE model with 16 experts. 10M token context window and strong multilingual support.

71

Quality

$0.28

Blended/1M

198

tok/s

Best for: Longest context
Mistral AI

Mistral Large 2

Mistral's flagship 123B model with strong multilingual and coding performance. Supports 128K context.

79

Quality

$4.00

Blended/1M

78

tok/s

Best for: Multilingual
Mistral AI

Codestral

Mistral's dedicated code model. Optimized for code generation, completion, and review across 80+ languages.

76

Quality

$0.60

Blended/1M

195

tok/s

Best for: Code generation
DeepSeek
OSS

DeepSeek V3

671B MoE model with 37B active parameters. Outstanding price-performance ratio and coding ability.

86

Quality

$0.69

Blended/1M

62

tok/s

Best for: Best open-source value
DeepSeek
OSS

DeepSeek R1

DeepSeek's reasoning model. Competitive with o3 on math and coding at a fraction of the cost.

91

Quality

$1.37

Blended/1M

35

tok/s

Best for: Cheap reasoning
xAI

Grok 3

xAI's flagship model with strong reasoning and real-time information access. Trained on the Colossus cluster.

87

Quality

$9.00

Blended/1M

82

tok/s

Best for: Real-time info
xAI

Grok 3 Mini

xAI's efficient reasoning model with thinking capabilities at a lower cost point.

78

Quality

$0.40

Blended/1M

165

tok/s

Best for: Budget reasoning
Cohere

Command R+

Cohere's flagship for enterprise RAG. Optimized for retrieval-augmented generation and tool use.

68

Quality

$6.25

Blended/1M

72

tok/s

Best for: Enterprise RAG
Amazon

Amazon Nova Pro

Amazon's capable multimodal model. Strong balance of accuracy, speed, and cost for diverse tasks.

70

Quality

$2.00

Blended/1M

110

tok/s

Best for: AWS ecosystem
Alibaba Cloud
OSS

Qwen 2.5 72B

Alibaba's flagship open-source model. Competitive with GPT-4o class models on benchmarks at a fraction of the cost.

80

Quality

$0.60

Blended/1M

85

tok/s

Best for: Open-source flagship
Alibaba Cloud
OSS

Qwen 2.5 Coder 32B

Specialized coding model from Alibaba. Top open-source code model on HumanEval and SWE-Bench.

74

Quality

$0.30

Blended/1M

125

tok/s

Best for: Open-source coding
Perplexity

Sonar Pro

Perplexity's search-augmented model. Combines LLM reasoning with real-time web search and citations.

78

Quality

$9.00

Blended/1M

65

tok/s

Best for: Search + citations