AI Foundation Models

Definition of AI Foundation Models

AI foundation models are large-scale machine learning or deep learning models trained on vast datasets, enabling them to be adapted for a wide range of tasks across various domains, such as text, images, audio, and more.[9]

Model Comparison Table

Below is a comparison of top AI foundation models based on key benchmarks like GPQA (Graduate-Level Google-Proof Q&A) and SWE-bench (Software Engineering tasks). Data sourced from recent leaderboards as of 2025.

Rank Model Developer GPQA (%) SWE-bench (%)
1 Gemini 3 Pro Google 91.9 76.2
2 Claude Opus 4.5 Anthropic 87.0 80.9
3 GPT-5.2 OpenAI 92.4 80.0
4 Gemini 3 Flash Google 90.4 78.0
5 GLM-4.7 Zhipu AI 85.7 73.8
6 MiniMax M2.1 MiniMax 81.0 67.0
7 GLM-4.6 Zhipu AI 81.0 68.0
8 MiMo-V2-Flash Xiaomi 83.7 73.4
9 Gemini 2.5 Pro Google 83.0 63.2
10 GLM-4.5 Zhipu AI 79.1 64.2
11 Claude Opus 4 Anthropic 79.6 72.5
12 DeepSeek-V3.2-Speciale DeepSeek - 73.1
13 Claude Opus 4.1 Anthropic 80.9 74.5
19 GPT-5 OpenAI 85.7 74.9
- Grok 4 xAI 87.0 -

Prominent AI Foundation Models

Below is a curated list of notable foundation models, categorized by primary modality. This includes large language models (LLMs), vision models, multimodal models, and others. The list draws from various sources and focuses on well-known examples, including their developers where available.

Language Models
Vision Models
Multimodal Models
Other Modalities (e.g., Audio, Robotics, Specialized)

For a more exhaustive list, repositories like Awesome-Foundation-Models on GitHub provide hundreds of models with paper references.[5][12] Rankings, such as Forrester's top 10, highlight models like Google Gemini as leaders based on offering, strategy, and market presence.[2][11]