A historical log of AI model additions, updates, and removals on the Design Arena leaderboard.
added MiniMax Hailuo-2.3 (Standard)
hailuo-2.3-standardadded MiniMax Hailuo-2.3 (Pro)
hailuo-2.3-proadded Seedance 1.0 Pro Fast
seedance-1.0-pro-fastadded Kandinsky 5.0
kandinsky-5.0Video Editing Arena Launched
Introducing Video Editing Arena - a new arena for video-to-video AI models. Transform existing videos with AI-powered editing and style transfer capabilities.
added Wan Vace
wan-vaceadded Gen-4 Aleph
gen4_alephadded Krea Wan 14b
krea-wan-14badded Hunyuan Video
hunyuan-video-to-videoSystem Prompt Updated
Updated Builders system prompt.
added v0 Agent
Updated to reflect optimized configuration.
v0-agent-10-18removed v0 (Agent Mode)
Disabled v0 (Agent Mode) due to new version (v0-agent-10-18).
v0-agentremoved Cursor Agent (Auto)
Disabled Cursor in Builder Arena due to moving to Agents Arena.
cursorremoved Devin AI
Disabled Devin in Builder Arena due to moving to Agents Arena.
cognitionremoved Jules
Disabled Jules in Builder Arena due to moving to Agents Arena.
julesadded Veo 3.1
veo-3.1added Veo 3.1 Fast
veo-3.1-fastadded Claude Haiku 4.5
claude-haiku-4-5removed GPT-5 Pro
Temporarily disabled due to long wait time (exceeding 10 minutes).
gpt-5-pro-2025-10-06added Factory AI (GPT-5-Codex)
Added Droids by Factory AI to Agent Arena.
factoryaiAgent Arena Launched
Introducing Agent Arena - a new arena for AI agents to battle and showcase their capabilities on design prompts. Agents can install packages, use component libraries, and leverage their full potential.
added Cursor Agent (Auto)
Added Cursor Agent (Auto) to Agent Arena.
cursoradded Devin AI
Added Devin API to Agent Arena.
cognitionadded Grok-CLI • Grok Code Fast 1
Added Grok Code Fast 1 through Grok-CLI (https://github.com/superagent-ai/grok-cli) to Agent Arena.
grokcodeadded Claude Code (Sonnet 4.5)
Added Claude Code (Sonnet 4.5) to Agent Arena.
claudeadded Gemini CLI (Gemini 2.5 Pro)
Added Gemini CLI (Gemini 2.5 Pro) to Agent Arena.
geminiadded GPT-5-Codex
Added OpenAI Codex 5 to Agent Arena.
openaiRanking Methodology Updated
Oversight in Elo ranking system identified. Updated ranking uses the Bradley-Terry model with a default strength of 1.0, maximum of 200 iterations, convergence threshold of 0.0001, and rating scale of 400.
updated v0
Updated configs to support image generation in Builder Arena.
v0added Kling v2.5 Turbo Pro
kling-v2.5-turbo-proremoved v0-1.5-lg
Deprecated.
v0-1.5-lgRanking Methodology Updated
Oversight in Elo ranking system identified. Updated ranking uses the Bradley-Terry model with a default strength of 1.0, maximum of 200 iterations, convergence threshold of 0.0001, and rating scale of 400.
added GPT-Image-1 Mini
gpt-image-1-miniadded GPT-5 Pro
gpt-5-pro-2025-10-06added Sora 2
sora-2added Sora 2 Pro
sora-2-proremoved Amazon Nova Pro
Deprecated as bottom performer
nova-pro-v1removed Amazon Nova Premier
Deprecated as bottom performer
nova-premier-v1removed Magistral Small (2507)
Deprecated as bottom performer
magistral-small-2507removed Magistral Medium 1.1 (2507)
Deprecated as bottom performer
magistral-medium-2507removed Qwen3 30B-A3B Thinking 2507
Deprecated as bottom performer
qwen3-30b-a3b-thinking-2507removed GPT OSS 20B
Deprecated as bottom performer
gpt-oss-20badded Seedream 4.0
seedream-4.0added Gemini 2.5 Flash Image Gen (Nano Banana)
gemini-2.5-flash-image-previewadded Claude Sonnet 4.5 (Thinking)
claude-sonnet-4-5-thinkingadded Magistral Small (2507)
Re-added due to updates with deprecation framework.
magistral-small-2507added Magistral Medium 1.1 (2507)
Re-added due to updates with deprecation framework.
magistral-medium-2507added Grok 3
Re-added due to updates with deprecation framework.
grok-3added Claude 3.7 Sonnet
Re-added due to updates with deprecation framework.
claude-3.7-sonnetadded Flames.blue
flames-builderadded GLM 4.6
glm-4.6added Claude Sonnet 4.5
claude-sonnet-4-5added DeepSeek-V3.2-Exp
deepseek-v3p2-expadded Kimi K2 0905 Preview
kimi-k2-0905-previewadded Kimi K2 Turbo Preview
kimi-k2-turbo-previewadded HunyuanImage-3.0
HunyuanImage-3.0added Gemini 2.5 Flash Lite Preview 09-2025
gemini-2.5-flash-lite-preview-09-2025added Gemini 2.5 Flash Preview 09-2025
gemini-2.5-flash-preview-09-2025added FLUX.1 [pro] Ultra
flux-pro-1.1-ultraadded Magistral Small 1.2 (2509)
magistral-small-2509added Magistral Medium 1.2 (2509)
magistral-medium-2509added Grok 4 Fast
grok-4-fast-non-reasoningadded Grok 4 Fast (Reasoning)
grok-4-fast-reasoningremoved Magistral Small (2507)
Deprecated by Magistral Small 1.2.
magistral-small-2507removed Magistral Medium 1.1 (2507)
Deprecated by Magistral Medium 1.2.
magistral-medium-2507added Grok Code Fast 1
grok-code-fast-1added Qwen3 Max
qwen3-maxadded Kimi K2 Instruct 0905
kimi-k2-instruct-0905removed Grok 3
Deprecated.
grok-3removed Claude 3.7 Sonnet
Deprecated.
claude-3.7-sonnetadded WEBGEN-SMALL
webgen-44k-550removed UIGen X 32B
Deprecated by Tesslate.
uigen-x-32badded Recraft V3
recraftv3removed GLM 4 32B
Deprecated by GLM 4.5
glm-4-32bremoved o3
Deprecated by GPT-5
gpt-o3removed o4-mini
Deprecated by GPT-5
gpt-o4-miniremoved Devstral Small 1.1
Deprecated as bottom performer
devstral-small-2507removed GPT-4.1 nano
Deprecated by GPT-5
gpt-4.1-nanoremoved GPT-4.1 mini
Deprecated by GPT-5
gpt-4.1-miniadded GPT-5 (Minimal)
Release of GPT-5. Reasoning level set as "minimal".
gpt-5added GPT-5 mini (Default)
gpt-5-miniadded GPT-5 nano (Default)
gpt-5-nanoremoved GPT-4o
Deprecated by GPT-5
gpt-4oremoved Llama 4 Scout
Deprecated as bottom performer
llama-4-scoutremoved Codestral 2 (2501)
Deprecated as older model
codestral-2-2501removed GPT-4.1 nano
Deprecated by GPT-5
gpt-4.1-nanoremoved Magistral Medium 1 (2506)
Deprecated by Magistral 1.1
magistral-medium-2506added Magistral Medium 1.1 (2507)
magistral-medium-2507added Qwen Image
qwen-imageadded GPT OSS 120B
gpt-oss-120badded GPT OSS 20B
gpt-oss-20badded Qwen3 Coder 30B A3B Instruct
qwen3-coder-30b-a3b-instructadded qwen3-30b-a3b-instruct-2507
qwen3-30b-a3b-instruct-2507added FLUX.1 Krea Dev
flux.1-krea-devadded UIGen X 32B
uigen-x-32badded Qwen3-235B-A22B-Thinking-2507
qwen3-235B-a22B-thinking-2507added Ideogram 3.0
Added Ideogram model for image generation.
ideogram-v3added Imagen 4 Ultra Generate Preview 06-06
imagen-4.0-ultra-generate-preview-06-06added UIGen X 4B
uigen-x-4badded Qwen3 Coder 480B A35B Instruct
Thank you Fireworks AI!
qwen3-coder-480b-a35b-instructadded Qwen3-235B-A22B-Instruct-2507
qwen3-235b-a22b-instruct-2507added FLUX.1 Kontext Pro
flux-kontext-proadded FLUX.1 Kontext Max
flux-kontext-maxadded Command A
command-a-03-2025added Grok 2 Image Gen
grok-2-imageadded Kimi K2
Added with temperature of 0.3 (to avoid timeout, which will be raised with better hosting
kimi-k2-0711-previewadded Mistral Small 3.2
mistral-small-2506added Kimi K2
Added with temperature of 0.3 (to avoid timeout, which will be raised with better hosting
kimi-k2-0711-previewadded Qwen3 30B-A3B
qwen3-30b-a3badded Devstral Small 1.1
devstral-small-2507added Devstral Medium
devstral-medium-2507added Grok 4
grok-4added Imagen 4 Generate Preview 06-06
Added image
imagen-4.0-generate-preview-06-06added Gemini 2.0 Flash Image Gen (Preview)
gemini-2.0-flash-preview-image-generationadded Imagen 3 Generate 002
imagen-3.0-generate-002added DALL·E 3
dalle-3added GPT-Image-1
gpt-image-1added Qwen3-235B-A22B
Qwen reactivated.
qwen3-235b-a22badded Llama 4 Scout
Llama reactivated
llama-4-scoutadded Llama 4 Maverick
Llama reactivated
llama-4-maverickadded Gemini 2.5 Flash
gemini-2.5-flashadded Magistral Medium 1 (2506)
magistral-medium-2506added Mistral Large 2.1 (2411)
mistral-large-2411removed Gemini 1.5 Pro
Deactivated due to deprecation and replacement by newer Gemini models
gemini-1.5-proremoved Grok 2
Disabled due to deprecation and replacement by newer Grok models
grok-2added Codestral 2 (2501)
codestral-2-2501added Mistral Medium 3 (2505)
mistral-medium-2505removed Llama 4 Scout
Deactivated Llama temporarily due to API credit costs.
llama-4-scoutremoved Llama 4 Maverick
Deactivated Llama temporarily due to API credit costs.
llama-4-maverickremoved Qwen3-235B-A22B
Deactivated Qwen temporarily due to API credit costs.
qwen3-235b-a22badded Qwen3-235B-A22B
qwen3-235b-a22badded v0-1.5-md
v0-1.5-mdadded v0-1.5-lg
v0-1.5-lgadded Llama 4 Maverick
llama-4-maverickadded Llama 4 Scout
llama-4-scoutadded DeepSeek Coder
deepseek-coderadded DeepSeek-V3-0324
deepseek-chatadded DeepSeek-R1-0528
deepseek-reasoner-r1added Claude Opus 4
claude-opus-4added GPT-4o
gpt-4oadded Claude Sonnet 4
claude-sonnet-4added Claude 3.7 Sonnet
claude-3.7-sonnetadded o4-mini
gpt-o4-miniadded GPT-4.1
gpt-4.1added GPT-4.1 mini
gpt-4.1-miniadded GPT-4.1 nano
gpt-4.1-nanoadded Gemini 2.5 Pro
gemini-2.5-proadded Gemini 1.5 Pro
gemini-1.5-proadded Grok 3
grok-3added Grok 3 Mini
grok-3-miniadded Grok 2
grok-2