Product

GPT-5.2 Support in Warp & New Terminal-Bench Score

Suraj Gupta

December 11, 2025

Today we’re excited to roll out full support for OpenAI’s GPT-5.2 across Warp. After extensive benchmarking, tuning, and real-world stress testing, GPT-5.2 is now available to all Warp users, and it consistently delivers the best end-to-end coding performance we have seen yet.

Alongside the release, Warp’s agent achieved a new score on Terminal Bench 2.0, ranking #2 overall and setting a new high watermark for terminal-native agentic coding performance.

Under the Hood: Why GPT-5.2 Feels Different

GPT-5.2 improves across every dimension developers care about: planning, speed, reliability, and the ability to “close the loop” on complex tasks.

In testing, we saw several standout improvements:

Stronger planning with less guidance. In Warp’s planning mode, GPT-5.2 now proposes coherent multi-step approaches without requiring highly precise instructions, even for design or structural changes.
Closes the loop with higher reliability. Agents have long been able to verify their own changes, but GPT-5.2 makes this behavior significantly more reliable, reducing dead-ends and producing cleaner end-to-end workflows.
Fast execution. Lower latency and adaptive reasoning make interactions feel more fluid inside the terminal.
Parallel tool use. GPT 5.2 more intelligently uses parallel tool calling to improve the efficiency of open-ended search and multi-file edits.
Noticeably better UX. The team described GPT-5.2 as a smoother, more predictable interaction pattern compared to previous GPT-5 models.

Across the board, Warp’s agent became far more reliable at handling long-horizon tasks that require sustained reasoning and verification, a key ingredient in agentic development.

Warp's strongest Terminal-Bench Score

Paired with Warp, GPT-5.2 achieved a best-in-class score of 61.14% on Terminal Bench, marking Warp’s strongest performance to date.

This reflects not just raw model quality, but the depth of integration between Warp’s agent platform and OpenAI’s latest model.

Partnering with OpenAI

We partnered closely with OpenAI to optimize GPT-5.2 for Warp—fine-tuning prompt structure, tool definitions, and planning heuristics to balance speed, accuracy, and context awareness inside the terminal.

We’re thrilled to see Warp reach another best in class terminal bench score, hitting 61.14% with GPT-5.2! We love how Warp supports builders through the entire software development cycle and are excited to see them continue to push the frontier.

Try 5.2 in Warp

Give GPT-5.2 a try in Warp, we’re excited to hear your feedback!

Start your software factory

Book a demo and we’ll walk you through the workflows that map to your stack.

Illustration for If you want better agent ROI and governance, move your agents to the cloud

Jul 18, 2026 · 4 min

Get agents off your machine

Even though we are living in 2026, it feels like folks have forgotten the lessons of 2006: software belongs in the cloud, not on individuals’ desktops.

BYO_API_KEY with the Claude, Gemini CLI, and Codex logos

May 20, 2026 · 6 min

Bring your own inference to Warp

Today we’re releasing one of the most requested updates from the Warp community: more control over inference.

Oz is the first multi-harness control plane for cloud agents including Claude code, codex, and whatever comes next.

May 19, 2026 · 6 min

A single pane of glass for managing all of your cloud agents

With Oz, engineering teams can now integrate, orchestrate, control, and improve any cloud agent at scale including Claude Code, Codex, Warp Agent, and whatever comes next.

GPT-5.2 Support in Warp & New Terminal-Bench Score

Under the Hood: Why GPT-5.2 Feels Different

Warp's strongest Terminal-Bench Score

Partnering with OpenAI

Try 5.2 in Warp

Start your software factory

Related articles

Get agents off your machine

Bring your own inference to Warp

A single pane of glass for managing all of your cloud agents

Get Warp today

Mac

Linux

Windows