/ THE FRONTIER MODELS INDEX™ · VOLUME I · Q2 2026

Which AI model should you actually use?

A rating system for every major frontier AI model. 59 models rated across six capability dimensions — General Reasoning, Code Generation, Math & STEM, Tool Use & Agency, Multimodal, and Safety & Alignment. Sourced from public benchmarks (SWE-Bench, GPQA-Diamond, AIME, MMLU-Pro, TAU-bench, HarmBench), red-team disclosures, and FASCIA's proprietary evaluation suite. Updated quarterly.

59 models rated 6 capability dimensions 30+ benchmark sources Q2 2026 edition

Grade

Model

Score

Capability

Claude Opus 4.6

Frontier · Anthropic · Anthropic · Apr 2026

Claude Sonnet 4.6

Frontier · Anthropic · Anthropic · Mar 2026

Claude Haiku 4.5

Frontier · Anthropic · Anthropic · Q1 2026

Frontier · OpenAI · OpenAI · Sep 2025

Frontier · OpenAI · OpenAI · Sep 2025

Reasoning · OpenAI · OpenAI · Jan 2025

Reasoning · OpenAI · OpenAI · Jan 2025

Reasoning · OpenAI · OpenAI · Apr 2025

Frontier · Google · Google DeepMind · Q1 2026

Gemini 2.5 Flash

Frontier · Google · Google DeepMind · Q1 2026

Frontier · xAI · xAI · Jul 2025

Frontier · xAI · xAI · Feb 2025

Llama 4 Behemoth

Open-Weights · Meta · Meta AI · Apr 2025

Llama 4 Maverick

Open-Weights · Meta · Meta AI · Apr 2025

Open-Weights · Meta · Meta AI · Apr 2025

Open-Weights · Meta · Meta AI · Dec 2024

Open-Weights · Meta · Meta AI · Jul 2024

Mistral Large 2

Frontier · Mistral · Mistral AI · Jul 2024

Mistral Medium 3

Frontier · Mistral · Mistral AI · May 2025

Open-Weights · Mistral · Mistral AI · Apr 2024

Reasoning · DeepSeek · DeepSeek · Jan 2025

Open-Weights · DeepSeek · DeepSeek · Dec 2024

Open-Weights · DeepSeek · DeepSeek · Q1 2026

Open-Weights · Alibaba · Alibaba · Q1 2026

Specialty · Code · Alibaba · Q3 2024

Enterprise · Cohere · Cohere · Apr 2024

Enterprise · Cohere · Cohere · Mar 2024

Jamba 1.5 Large

Architecture · AI21 · AI21 Labs · Aug 2024

Small Model · Microsoft · Microsoft Research · Dec 2024

Small Model · Microsoft · Microsoft Research · Aug 2024

Frontier · Amazon · Amazon · Dec 2024

Frontier · Amazon · Amazon · Dec 2024

Frontier · Reka · Reka AI · Apr 2024

Frontier · Moonshot · Moonshot AI · Q2 2025

Frontier · ByteDance · ByteDance · 2024 (multiple updates)

Frontier · 01.AI · 01.AI (Kai-Fu Lee) · May 2024

GitHub Copilot (current generation)

Specialty · Code · GitHub / OpenAI · 2021+ continuous

Cursor (Composer model)

Specialty · Code · Anysphere · 2023+ continuous

Specialty · Code · Replit · 2024+

Midjourney V6 / V7

Multimodal · Image · Midjourney · V6 Dec 2023; V7 2025

Multimodal · Image · OpenAI · Oct 2023

Multimodal · Image · Black Forest Labs · Aug 2024

Stable Diffusion 3.5

Multimodal · Image · Stability AI · Oct 2024

Multimodal · Video · OpenAI · Dec 2024 (public)

Multimodal · Video · Google DeepMind · 2025

Runway Gen-3 / Gen-4

Multimodal · Video · Runway · Gen-3 Jun 2024; Gen-4 2025

ElevenLabs (V3)

Multimodal · Audio / TTS · ElevenLabs · V3 2024

Multimodal · Music · Suno · V3 2024; V4 2024-2025

Multimodal · Music · Uncharted Labs · Apr 2024

Inflection 2.5 (Deprecated)

Deprecated · Inflection AI (Microsoft acquihire) · Mar 2024

Stability AI base models (Pre-2025)

At-Risk · Stability AI · Multiple 2022-2024

Perplexity Sonar

Specialty · Search · Perplexity · 2024 continuous

Multimodal · Mistral · Mistral AI · Nov 2024

Open-Weights · Tencent · Tencent · Nov 2024

Open-Weights · Sea Group · Sea Group AI · 2024

Open-Weights · Cohere · Cohere for AI · Oct 2024

Open-Weights · Google · Google DeepMind · Q1 2026

Open-Weights · TII · Technology Innovation Institute (UAE) · Dec 2024

Open-Weights · AI2 · Allen Institute for AI · Nov 2024

The right model scales. The wrong one breaks.

Every AI lab claims their model is the best. The benchmarks tell a different story. The Frontier Models Index makes the actual capability visible — so engineering teams, procurement, and AI strategy leaders can pick models that ship, and avoid the ones that fail in production.

See the methodology Institutional API