OpenAI |
GPT-1
|
June, 2018
|
Active
|
Trained with BookCorpus 4.5 GB of text, from 7,000 unpublished books of various genres.
|
|
GPT-2
|
February, 2019
|
Active
|
Trained with WebText: 40 GB of text, 8 million documents, from 45 million webpages upvoted on Reddit.
|
|
GPT-3
|
May, 2020
|
Active
|
Trained with 499 billion tokens consisting of CommonCrawl (570 GB), WebText, English Wikipedia, and two books corpora
(Books1 and Books2)
|
|
GPT-3.5
|
March, 2022
|
Active
|
Undisclosed
|
|
GPT-4
|
March, 2023
|
Active
|
Undisclosed
|
|
GPT-4o
|
May, 2024
|
Active
|
Undisclosed
|
|
GPT-4.5
|
February, 2025
|
Active
|
Undisclosed
|
|
GPT-4.1
|
April, 2025
|
Active
|
Undisclosed
|
|
GPT-5
|
August, 2025
|
Active
|
Undisclosed
|
|
GPT-5.1
|
November, 2025
|
Active
|
Incremental GPT-5 update listed in the ChatGPT model timeline on Wikipedia.
|
|
GPT-5.2
|
December, 2025
|
Active
|
Follow-up GPT-5 release listed in the ChatGPT model timeline on Wikipedia.
|
|
GPT-5.3
|
2026
|
Active
|
GPT-5 branch update listed in the ChatGPT model list on Wikipedia.
|
|
GPT-5.3-Codex
|
2026
|
Active
|
Coding-specialized GPT-5.3 variant listed on the ChatGPT page.
|
|
GPT-5.4
|
March, 2026
|
Active
|
Enterprise-focused GPT-5 update listed in the ChatGPT timeline on Wikipedia.
|
|
GPT-5.5
|
2026
|
Active
|
Most capable GPT-5 branch release listed as the current engine on Wikipedia.
|
|
GPT-4 Turbo
|
November, 2023
|
Discontinued
|
Earlier lower-cost GPT-4 generation that preceded GPT-4.1 and GPT-4o families.
|
|
GPT-4o mini
|
July, 2024
|
Active
|
Cost-efficient GPT-4o variant for high-throughput and latency-sensitive use cases.
|
|
o1-preview
|
September, 2024
|
Discontinued
|
Early reasoning-model preview that introduced explicit long-thought style inference.
|
|
o1-mini
|
September, 2024
|
Active
|
Smaller reasoning-focused model optimized for lower cost.
|
|
o1
|
December, 2024
|
Active
|
Production reasoning model for complex coding, math and multi-step planning.
|
|
o3-mini
|
January, 2025
|
Active
|
Fast and efficient reasoning model for routine analytical workloads.
|
|
o3
|
2025
|
Active
|
Higher-capability reasoning model tier for difficult multi-step tasks.
|
|
o4-mini
|
2025
|
Active
|
Latest mini reasoning family focused on strong capability at low cost and latency.
|
|
gpt-oss-20b
|
2026
|
Active
|
Open-weight OpenAI model listed in Amazon Bedrock model cards.
|
|
gpt-oss-120b
|
2026
|
Active
|
Larger open-weight OpenAI model listed in Amazon Bedrock model cards.
|
Google |
LaMDA
|
May, 2022
|
Active
|
Currently, LaMDA is not available to the public but is accessible to select developers for testing and refinement.
|
|
Bard
|
March, 2023
|
Discontinued
|
Google's first experimental chatbot service based on LaMDA
|
|
PaLM
|
April, 2022
|
Discontinued
|
Pathways Language Model family that preceded PaLM 2 and Gemini.
|
|
PaLM 2
|
May, 2023
|
Discontinued
|
Successor to PaLM with stronger multilingual, reasoning and coding capabilities.
|
|
Gemini 1.0 Nano
|
December, 2023
|
Discontinued
|
Designed for on-device tasks and first available in Google's Pixel 8 Pro
|
|
Gemini 1.0 Pro
|
December, 2023
|
Discontinued
|
Designed for a diverse range of tasks
|
|
Gemini 1.0 Ultra
|
February, 2024
|
Discontinued
|
Google's most powerful offering in the Gemini 1.0 family
|
|
Gemini 1.5 Pro
|
February, 2024
|
Discontinued
|
As a successor to the 1.0 series of models, 1.5 Pro offers significantly increased context size
(up to 1 million tokens). It is designed to be the most capable model in the Gemini 1.5 family.
|
|
Gemini 1.5 Flash
|
May, 2024
|
Discontinued
|
Faster 1.5 variant announced at Google I/O 2024 for lower latency and cost.
|
|
Gemini 2.0 Flash
|
January, 2025
|
Active
|
Developed by Google with a focus on multimodality, agentic capabilities, and speed
|
|
Gemini 2.0 Flash Thinking
|
December, 2024
|
Active
|
Reasoning-oriented Gemini variant designed for longer multi-step problem solving.
|
|
Gemini 2.0 Pro (Experimental)
|
February, 2025
|
Active
|
Experimental higher-capability Gemini 2.0 tier for advanced tasks.
|
|
Gemini 2.0 Flash-Lite
|
February, 2025
|
Active
|
First-ever Gemini Flash-Lite model designed for cost-efficiency and speed
|
|
Gemini 2.5 Pro
|
March, 2025
|
Active
|
Introduced first as 2.5 Pro Experimental and later made generally available in June 2025.
|
|
Gemini 2.5 Flash
|
April, 2025
|
Active
|
Default Gemini model announced at I/O 2025, optimized for speed with strong multimodal capability.
|
|
Gemini 2.5 Flash-Lite
|
June, 2025
|
Active
|
Cost-efficient Flash variant introduced in the June 2025 Gemini 2.5 family expansion.
|
|
Gemini 2.5 Flash Image (Nano Banana)
|
August, 2025
|
Active
|
Image generation/editing model publicly released as "Nano Banana" and later identified as Gemini 2.5 Flash Image.
|
|
Gemma
|
February, 2024
|
Active
|
Lightweight open model family released by Google for local and research usage.
|
|
Gemma 2
|
June, 2024
|
Active
|
Improved open model generation with stronger quality and efficiency.
|
|
Gemma 3
|
2025
|
Active
|
Latest Gemma family iteration focused on stronger reasoning and multimodal support.
|
|
Gemini 3 Pro
|
November, 2025
|
Active
|
New Gemini 3 flagship generation that replaced 2.5 Pro in Google announcements.
|
|
Gemini 3 Deep Think
|
November, 2025
|
Active
|
Reasoning-focused Gemini 3 variant based on the 2.5 Pro Deep Think mode.
|
|
Gemini 3 Flash
|
December, 2025
|
Active
|
Frontier-speed Gemini 3 model optimized for low latency and high throughput.
|
|
Gemini 3.1 Pro
|
February, 2026
|
Active
|
Incremental Pro update for more complex reasoning and enterprise workloads.
|
|
Gemini 3.1 Flash-Lite
|
March, 2026
|
Active
|
Flash-Lite 3.1 release targeted at intelligence at scale with low-cost serving.
|
|
Gemini Robotics
|
March, 2025
|
Active
|
Vision-language-action model built on Gemini 2.0 for robotics control tasks.
|
|
Gemma 4
|
April, 2026
|
Active
|
Gemma generation designed for stronger reasoning and agentic workflows.
|
Meta |
OPT
|
May, 2022
|
Non-commercial
|
GPT-3 architecture with some adaptations from Megatron. Uniquely, the training logbook written by the team was published.
Corpus size 180 billion tokens.
|
|
Galactica
|
November, 2022
|
Discontinued
|
Research model family focused on scientific text generation; withdrawn shortly after release.
|
|
Llama 1
|
February, 2023
|
Discontinued
|
Corpus size 1.4 trillion tokens
|
|
Chameleon
|
June, 2024
|
Active
|
Early-fusion multimodal model architecture from Meta Research for text+image generation.
|
|
Llama 2
|
July, 2023
|
Discontinued
|
Corpus size 2 trillion tokens
|
|
Code Llama
|
August, 2023
|
Discontinued
|
Code-specialized Llama 2 derivative tuned for code completion and infilling.
|
|
Llama 3
|
April, 2024
|
Active
|
Corpus size 15 trillion tokens
|
|
Llama 3.1
|
July, 2024
|
Active
|
Corpus size 15.6 trillion tokens
|
|
Llama 3.2
|
September, 2024
|
Active
|
Corpus size 9 trillion tokens
|
|
Llama 3.3
|
December, 2024
|
Active
|
Corpus size 15 trillion tokens
|
|
Llama 4
|
April, 2025
|
Active
|
Corpus size 40 trillion tokens
|
|
Llama 4 Maverick
|
April, 2025
|
Active
|
Llama 4 released variant (17B active parameters, 128-expert MoE) listed as stable on Wikipedia.
|
|
Llama 4 Scout
|
April, 2025
|
Active
|
Llama 4 released variant (17B active parameters, 16-expert MoE) with very long context support.
|
|
Llama 4 Behemoth
|
2025
|
Announced
|
Announced by Meta but not publicly released at the time Scout and Maverick shipped.
|
Microsoft |
Copilot
|
September, 2023
|
Active
|
Based on Microsoft's Prometheus model, which is based on OpenAI's GPT-4 series
|
|
Phi-1
|
June, 2023
|
Discontinued
|
Small language model (1.3B) trained with textbook-quality data.
|
|
Phi-1.5
|
September, 2023
|
Discontinued
|
Improved 1.3B small model with stronger reasoning and coding than Phi-1.
|
|
Phi-2
|
December, 2023
|
Active
|
1.4T tokens, 2.7B parameters, strong benchmark performance for its size.
|
|
Phi-3
|
April, 2024
|
Active
|
Phi-3-mini, Phi-3-small, Phi-3-medium and Phi-3-vision.
|
|
Phi-3.5
|
August, 2024
|
Active
|
Mid-cycle Phi update with stronger long-context and multilingual performance.
|
|
Phi-4
|
December, 2024
|
Active
|
Next-generation Phi family tuned for reasoning and agent workloads.
|
|
Phi-4-mini
|
2025
|
Active
|
Lower-latency, lower-cost Phi-4 variant for edge and high-throughput scenarios.
|
|
Phi-4-multimodal
|
2025
|
Active
|
Multimodal Phi-4 variant supporting image+text understanding and generation.
|
|
Phi-4-reasoning
|
2025
|
Active
|
Reasoning-focused Phi-4 variant optimized for multi-step analytical tasks.
|
|
MAI-Thinking-1
|
June, 2026
|
Active
|
Microsoft AI flagship reasoning model. Medium-sized model that ranks among the strongest in its weight class, matching leading models on key software engineering benchmarks, showing advanced mathematical reasoning, and preferred over Claude Sonnet 4.6 in blind side-by-side human evaluations. Trained from the ground up on clean data without distillation from third-party models.
|
|
MAI-Code-1-Flash
|
June, 2026
|
Active
|
Inference-efficient agentic coding model tailored for and deeply integrated into GitHub Copilot, VS Code and the Microsoft stack. With 5 billion active parameters, it is positioned as comparable to Claude Haiku-class models at lower cost.
|
|
MAI-Image-2.5
|
June, 2026
|
Active
|
Image generation and editing model supporting world-class text-to-image and image editing tasks. Microsoft states it surpasses the Arena score of Nano Banana Pro.
|
|
MAI-Image-2.5-Flash
|
June, 2026
|
Active
|
Ultra-efficient Flash variant of MAI-Image-2.5 for lower-cost text-to-image and image editing workloads.
|
|
MAI-Transcribe-1.5
|
June, 2026
|
Active
|
Speech-to-text model described by Microsoft as state-of-the-art in transcription accuracy, five times faster than competing models, with built-in support for domain-specific terminology across 43 languages.
|
|
MAI-Voice-2
|
June, 2026
|
Active
|
High-quality multilingual speech generation model across 15 languages, capable of adapting to a voice from a short sample and including safeguards against misuse.
|
|
MAI-Voice-2-Flash
|
June, 2026
|
Announced
|
Lower-cost, ultra-efficient upcoming variant of MAI-Voice-2 announced as coming soon.
|
Anthropic |
Claude 1
|
March, 2023
|
Discontinued
|
First public Claude generation focused on helpful and safer assistant behavior.
|
|
Claude 2
|
July, 2023
|
Discontinued
|
Major capability and context window increase over Claude 1.
|
|
Claude Instant 1.2
|
2023
|
Discontinued
|
Lower-latency Claude 2-era variant used for fast and lower-cost responses.
|
|
Claude 2.1
|
November, 2023
|
Discontinued
|
Improved reliability and reduced hallucinations.
|
|
Claude 3 (Haiku, Sonnet, Opus)
|
March, 2024
|
Active
|
Multimodal family balancing speed, cost and reasoning quality.
|
|
Claude 3.5 Sonnet
|
June, 2024
|
Active
|
Strong coding and agentic performance with better latency/cost profile.
|
|
Claude 3.5 Haiku
|
October, 2024
|
Active
|
Fast and cost-efficient model for high-throughput use cases.
|
|
Claude 3.7 Sonnet
|
February, 2025
|
Active
|
Hybrid reasoning model with improved long-horizon task execution.
|
|
Claude Sonnet 4
|
May, 2025
|
Active
|
New generation tuned for coding, tool use and production workflows.
|
|
Claude Opus 4
|
May, 2025
|
Active
|
Highest-capability Claude tier for complex reasoning and agent tasks.
|
|
Claude Opus 4.1
|
April, 2026
|
Active
|
Incremental Opus 4 update listed in the Claude model timeline on Wikipedia.
|
|
Claude Haiku 4.5
|
October, 2025
|
Active
|
Haiku branch update focused on speed and cost efficiency.
|
|
Claude Sonnet 4.5
|
2025
|
Active
|
Sonnet branch update listed in the model versions table on Wikipedia.
|
|
Claude Opus 4.5
|
2025
|
Active
|
Opus branch update listed in the model versions table on Wikipedia.
|
|
Claude Sonnet 4.6
|
February, 2026
|
Active
|
Sonnet branch update listed as a stable release in the Wikipedia infobox.
|
|
Claude Opus 4.6
|
2026
|
Active
|
Opus branch update listed in the model versions table on Wikipedia.
|
|
Claude Opus 4.7
|
April, 2026
|
Active
|
Latest Opus stable release listed in the Wikipedia infobox timeline.
|
|
Claude Opus 4.8
|
2026
|
Active
|
Newer Opus release listed in Amazon Bedrock model cards.
|
|
Claude Mythos Preview
|
2026
|
Preview
|
Preview Claude model listed in Amazon Bedrock model cards.
|
|
Claude Fable 5
|
9 June 2026
|
Active
|
A Mythos-class model that we've made safe for general use.
|
|
Claude Mythos 5 (preview)
|
9 June 2026
|
Limited availability
|
Ready for a small group of cyberdefenders and infrastructure providers. It's the same underlying model as Fable 5, but with the safeguards lifted in some areas.
|
Mistral AI |
Mistral 7B
|
September, 2023
|
Active
|
Open-weights base model that accelerated the open model ecosystem.
|
|
Mixtral 8x7B
|
December, 2023
|
Active
|
Sparse MoE model with strong quality/latency tradeoff.
|
|
Mixtral 8x22B
|
April, 2024
|
Active
|
Larger MoE model for higher reasoning and generation quality.
|
|
Mistral Large 2
|
July, 2024
|
Active
|
Flagship frontier model offered through API and cloud partners.
|
|
Pixtral 12B
|
September, 2024
|
Active
|
Multimodal model line for image + text tasks.
|
|
Mistral Large (24.02)
|
February, 2024
|
Discontinued
|
First Mistral Large release for enterprise and multilingual/coding tasks.
|
|
Mistral Small
|
February, 2024
|
Discontinued
|
Smaller companion model launched alongside Mistral Large (24.02).
|
|
Codestral 22B
|
May, 2024
|
Discontinued
|
First Mistral code-focused open-weight model family.
|
|
Mathstral 7B
|
July, 2024
|
Active
|
STEM and mathematical reasoning-focused model.
|
|
Codestral Mamba 7B
|
July, 2024
|
Active
|
Code model variant built on Mamba architecture for longer-context generation.
|
|
Ministral 3B
|
October, 2024
|
Active
|
Small dense model in the Ministral line.
|
|
Ministral 8B
|
October, 2024
|
Active
|
Larger dense Ministral variant for stronger capability at moderate cost.
|
|
Pixtral Large (24.11)
|
November, 2024
|
Active
|
Large multimodal model combining a visual encoder with Mistral Large 2.
|
|
Mistral Large 2 (24.11)
|
November, 2024
|
Active
|
Refreshed Mistral Large 2 release in the model timeline.
|
|
Mistral Small 3
|
January, 2025
|
Discontinued
|
24B-parameter small model generation preceding 3.1 and 3.2 refreshes.
|
|
Codestral 25.01
|
January, 2025
|
Discontinued
|
Codestral model refresh for coding workloads.
|
|
Mistral Small 3.1
|
March, 2025
|
Discontinued
|
Smaller and more efficient successor to Mistral Small 3.
|
|
Devstral Small (25.05)
|
May, 2025
|
Discontinued
|
Early Devstral branch for agentic coding tasks.
|
|
Mistral Small 3.2
|
June, 2025
|
Discontinued
|
Refresh release of Mistral Small 3.1.
|
|
Devstral Medium 1.0
|
July, 2025
|
Active
|
Agentic coding model in the Devstral lineup.
|
|
Devstral Small 1.1 (25.07)
|
July, 2025
|
Active
|
Updated small Devstral variant for coding and tool use.
|
|
Codestral 25.08
|
August, 2025
|
Active
|
Updated Codestral release in the enterprise coding stack.
|
|
Mistral Large 3
|
December, 2025
|
Active
|
Sparse MoE flagship generation succeeding Mistral Large 2.
|
|
Ministral 3
|
December, 2025
|
Active
|
Dense small-model family (3B/8B/14B) released with Mistral Large 3.
|
|
Devstral 2
|
December, 2025
|
Active
|
New Devstral generation focused on stronger coding performance.
|
|
Devstral Small 2
|
December, 2025
|
Active
|
Compact coding model released alongside Devstral 2.
|
|
Mistral Medium 3.5
|
2026
|
Active
|
Medium-tier model listed in the current product lineup.
|
|
Mistral Small 4
|
2026
|
Active
|
Latest small model generation in the Mistral product line.
|
|
Magistral Small 2509
|
2025
|
Active
|
Reasoning-focused Mistral release listed in Amazon Bedrock model cards.
|
xAI |
Grok-1
|
November, 2023
|
Discontinued
|
First Grok generation integrated into X ecosystem products.
|
|
Grok-1.5
|
May, 2024
|
Discontinued
|
Improved reasoning and long-context capabilities (128k context window).
|
|
Grok-2
|
August, 2024
|
Discontinued
|
Stronger general model with improved coding and tool use; predecessor to Grok 3.
|
|
Grok-2 mini
|
August, 2024
|
Discontinued
|
Smaller Grok-2 variant focused on speed and efficiency.
|
|
Grok-2.5
|
August, 2025
|
Discontinued
|
Source-available Grok update released after Grok-2 and before Grok 4 family.
|
|
Grok-3
|
February, 2025
|
Discontinued
|
Major flagship release with reasoning modes and DeepSearch.
|
|
Grok-3 mini
|
February, 2025
|
Discontinued
|
Faster and smaller Grok-3 variant released alongside Grok-3.
|
|
Grok 4
|
July, 2025
|
Active
|
Successor to Grok 3; flagship generation introduced with Grok 4 Heavy.
|
|
Grok 4 Fast
|
September, 2025
|
Active
|
Enterprise-focused variant tuned for lower latency and reduced token usage.
|
|
Grok 4.1
|
November, 2025
|
Active
|
Incremental Grok 4 update aimed at better reasoning and lower hallucination rates.
|
|
Grok 4.1 Fast
|
November, 2025
|
Active
|
Fast variant of Grok 4.1 optimized for tool-calling and agentic workflows.
|
|
Grok 4.3 Beta
|
April, 2026
|
Active
|
Latest stable track listed in Wikipedia infobox timeline.
|
Cohere |
Command R
|
March, 2024
|
Active
|
Enterprise-focused model optimized for retrieval and tool use.
|
|
Command R+
|
April, 2024
|
Active
|
Higher-capability tier in the Command R family.
|
|
Aya 23
|
2024
|
Active
|
Open multilingual model family from Cohere For AI.
|
Alibaba (Qwen) |
Qwen 1.5
|
February, 2024
|
Active
|
Major open model line update with broad size variants.
|
|
Qwen 2
|
June, 2024
|
Active
|
Improved multilingual and coding performance.
|
|
Qwen 2.5
|
September, 2024
|
Active
|
Strong open model family with broad ecosystem adoption.
|
|
Qwen 3
|
April, 2025
|
Active
|
Next generation Qwen family with stronger reasoning variants.
|
|
QwQ-32B
|
November, 2024
|
Active
|
Reasoning-focused model line released with open weights.
|
|
Qwen2.5-Coder
|
November, 2024
|
Active
|
Code generation family for software engineering workloads.
|
|
Qwen2.5-VL
|
January, 2025
|
Active
|
Vision-language family with multimodal understanding across image and text.
|
|
Qwen2.5-Omni
|
March, 2025
|
Active
|
Multimodal model supporting text, image, video and audio interactions.
|
|
Qwen3-Coder
|
July, 2025
|
Active
|
Advanced coding-focused Qwen3 branch for agentic software development.
|
|
Qwen3-Omni
|
September, 2025
|
Active
|
Omni branch of Qwen3 targeting unified multimodal I/O.
|
|
Qwen3-VL
|
September, 2025
|
Active
|
Vision-language continuation in the Qwen3 generation.
|
|
Qwen3-Coder-Next
|
February, 2026
|
Active
|
Hybrid coding model positioned as a next-step update in the coder branch.
|
|
Qwen3.5
|
February, 2026
|
Active
|
Open-weights Qwen3.5 release focused on complex task completion.
|
|
Qwen3.5-Plus
|
February, 2026
|
Active
|
Higher-capability proprietary tier in the Qwen3.5 generation.
|
|
Qwen3.6-35B-A3B
|
April, 2026
|
Active
|
Open model release in the Qwen3.6 line (MoE-style activated parameters).
|
|
Qwen3.6-27B
|
April, 2026
|
Active
|
Qwen3.6 line release listed in the stable release timeline.
|
|
Qwen3.7 Plus
|
May, 2026
|
Active
|
Latest stable-track Qwen3.7 Plus release listed in the infobox timeline.
|
|
Qwen3.7 Max
|
May, 2026
|
Active
|
Latest flagship stable-track Qwen3.7 Max release.
|
DeepSeek |
DeepSeek Coder
|
November, 2023
|
Active
|
First DeepSeek model family focused on code generation tasks.
|
|
DeepSeek-V2
|
May, 2024
|
Active
|
Efficient MoE model focused on cost/performance.
|
|
DeepSeek-V2.5
|
September, 2024
|
Active
|
Unified update combining general and coding capabilities from the V2 line.
|
|
DeepSeek-V3
|
December, 2024
|
Active
|
Upgraded base model with strong benchmark results.
|
|
DeepSeek-V3-0324
|
March, 2025
|
Active
|
MIT-licensed refresh of the V3 line released in March 2025.
|
|
DeepSeek-V3.1
|
August, 2025
|
Active
|
Hybrid thinking/non-thinking architecture with improved coding benchmarks.
|
|
DeepSeek-V3.1-Terminus
|
September, 2025
|
Active
|
Incremental V3.1 update released as the Terminus variant.
|
|
DeepSeek-V3.2-Exp
|
September, 2025
|
Discontinued
|
Experimental V3.2 preview before final V3.2 release.
|
|
DeepSeek-V3.2
|
December, 2025
|
Active
|
V3.2 production release in the DeepSeek V3 family.
|
|
DeepSeek-VL2
|
November, 2025
|
Active
|
Vision-language model line extending DeepSeek multimodal capabilities.
|
|
DeepSeek-R1-Lite-Preview
|
November, 2024
|
Discontinued
|
Early preview release of the R1 reasoning line via chat.
|
|
DeepSeek-R1
|
January, 2025
|
Active
|
Reasoning-focused model family emphasizing chain-of-thought quality.
|
|
DeepSeek-R1-0528
|
May, 2025
|
Active
|
Updated R1 release under MIT license with improved reasoning behavior.
|
|
DeepSeek-V4-Pro (Preview)
|
April, 2026
|
Preview
|
Preview of the V4 series with 1M context window in the April 2026 announcement.
|
|
DeepSeek-V4-Flash (Preview)
|
April, 2026
|
Preview
|
Faster V4 preview variant released alongside V4-Pro.
|
AWS |
Titan
|
April, 2023
|
Active
|
Earlier Amazon Bedrock model family. Titan includes first-generation AWS foundation
model lines such as Titan Text, Titan Embeddings, and Titan Image.
|
|
Nova
|
December, 2024
|
Active
|
Newer Amazon Bedrock model family (for example Nova Micro, Lite, Pro, and Premier),
positioned as the latest AWS generation relative to Titan.
|
|
Nova Pro
|
2024
|
Active
|
Nova text/multimodal model variant explicitly listed in Amazon Bedrock model cards.
|
AI21 Labs |
Jamba 1.5 Mini
|
2024
|
Active
|
AI21 hybrid model available in Amazon Bedrock.
|
|
Jamba 1.5 Large
|
2024
|
Active
|
Larger AI21 Jamba variant listed in Amazon Bedrock model cards.
|
MiniMax |
MiniMax M2
|
2025
|
Active
|
MiniMax model family listed in Amazon Bedrock model cards.
|
|
MiniMax M2.1
|
2025
|
Active
|
Incremental MiniMax M2 line release in Bedrock model cards.
|
|
MiniMax M2.5
|
2026
|
Active
|
Latest MiniMax model listed in Amazon Bedrock model cards.
|
|
MiniMax M3
|
2026
|
Active
|
New MiniMax M3 generation listed in Amazon Bedrock model cards.
|
Moonshot AI |
Kimi K2 Thinking
|
2026
|
Active
|
Reasoning-focused Kimi model listed in Amazon Bedrock model cards.
|
|
Kimi K2.5
|
2026
|
Active
|
Updated Kimi family release listed in Amazon Bedrock model cards.
|
|
Kimi K2.7 Code
|
2026
|
Active
|
Coding-specialized Kimi release listed in the Moonshot model timeline.
|
NVIDIA |
NVIDIA Nemotron Nano 9B v2
|
2026
|
Active
|
Compact Nemotron model listed in Amazon Bedrock model cards.
|
|
NVIDIA Nemotron 3 Super 120B
|
2026
|
Active
|
Large NVIDIA Nemotron model listed in Amazon Bedrock model cards.
|
Writer |
Palmyra X4
|
2025
|
Active
|
Writer enterprise model listed in Amazon Bedrock model cards.
|
|
Palmyra X5
|
2026
|
Active
|
Newer Writer Palmyra release listed in Amazon Bedrock model cards.
|
Z.AI |
GLM 4.7
|
2026
|
Active
|
Z.AI GLM series release listed in Amazon Bedrock model cards.
|
|
GLM 4.7 Flash
|
2026
|
Active
|
Faster GLM variant listed in Amazon Bedrock model cards.
|
|
GLM 5
|
2026
|
Active
|
Latest GLM generation listed in Amazon Bedrock model cards.
|
|
GLM 5.2
|
2026
|
Active
|
Incremental GLM 5 update listed in the Z.AI model timeline.
|
Opencode |
Go
|
2026
|
Active
|
Umbrella routing model that centralizes API access to external LLM providers (Includes models like Kimi, GLM, Qwen, DeepSeek, ...)
|
|
Zen
|
2026
|
Active
|
Umbrella routing model that centralizes API access to external LLM providers (Includes premium models such as Claude (Sonnet, Opus), GPT (OpenAI), Gemini, ...)
|