Product

Build vs buy: how to deploy coding agents at scale

Zach Lloyd

March 24, 2026

The consensus amongst engineering leaders I talk to is that they want to deploy cloud coding agents to automate development this year. Their goal is for agents to be first-line contributors across the entire SDLC: from planning and prototyping to coding and testing to deploying and monitoring.

The big question I keep hearing is how. Should they build an in-house orchestration system, or buy one off the shelf from Warp, Cursor, OpenAI, or someone else? Or perhaps some hybrid?

It depends on a bunch of factors, and there is no one size fits all solution. It depends on how complex a company's internal stack is, how much appetite there is for building and maintaining an agent stack, how many harnesses to support, needs around handoff, tracking, sandboxing, etc.

Let me dive in on how I would make this decision and how we are thinking about it as we build Oz, Warp's cloud orchestration platform.

The intelligence infrastructure problem

The way I conceive of the problem (and how Oz is architected) is to help companies deploy intelligence infrastructure that allows for agents to scale in the cloud. Your agent infrastructure should support a bunch of primitives like auditing, governance, sandboxing, etc.

Expanding on these primitives, you probably want:

Control plane: an abstraction layer that governs which models/harnesses are used and why
Orchestration: workflow coordination across models, agents, tools, and systems
Harnesses: what drives the model (e.g. Claude Code, Codex, Oz, etc.)
Context Architecture: persistent memory, context graphs, proprietary data integration, tribal knowledge and feedback loops — the institutional intelligence that compounds over time as agents access it
Evaluation and Observability: continuous benchmarking, performance monitoring, cost optimization, and quality assurance across vendors
Enterprise governance and compliance: policy enforcement, access controls, sandboxing, model risk management, compliance logging, auditability, traceability, guardrails and regulatory alignment

All of these need to be packaged together into a great developer experience (DX). Your engineering team is going to be the primary user of your internal orchestration system, so they need a good admin user experience for configuration and management.

What features you'll need

In terms of features, you'll want your infrastructure to have:

Policies: who can use what model/harness on what repo and when
Tracking for agent runs: what triggered the agent, what did it do, etc.
Continuous improvement: how your agent system improves over time using memory
Handoff: how engineers take the wheel when an agent gets stuck
Collaboration: how engineers work together and how they see what agents are doing, to coordinate and approve their work
Data access: what data stores can agents access, how is that access granted and tracked
Triggers: how are agents triggered, how they integrate with internal and external systems, and how it is programmable. You may also consider an "if this, then that" type system
Artifacts: what files, PRs, branches, plans etc. can agents produce
Metrics: how do you measure the success of agents and improve them over time
Sandboxing: how do you control what the agent can do and make sure they execute securely
Hosting: what compute does the agent run on
And so on…

The MVP trap

In terms of build vs. buy, there's obviously a big surface area of stuff to build here, but there's no need to build it all at once.

For instance, it's easy to build a simple web app that lets developers prompt an agent to run on a cloud machine somewhere. The MVP I see folks often build is an app with a big prompt box and a harness picker that spins up a dev environment, passes a prompt to an agent SDK and then polls for the result. You can build something like this in a few days; a lot of companies are starting here.

Tempting as it is, I would only go down this route if you are committed to building the real thing – because you'll find that as soon as you have the MVP, you'll start to want features like handoff, audit logs, evals, integrations into other apps, agent access to private data, self-improvement, etc., and that starts to become a pretty big effort.

In other words, the MVP is pretty easy, but the remaining pieces to manage agents at scale is actually where the bulk of the work is. For companies with large engineering teams and very complex dev setups (say, Stripe, for example), I think owning this entire stack makes sense. It's going to be a lot of work for them to integrate a third-party stack, and they have the resources to continuously improve their homegrown system.

For most companies though, I might start with an MVP and get a feel for the limits. But I would also explore how you can use other primitives to build on (or maybe just try a soup-to-nuts solution). For instance, you probably don't want to build handoff and mobile access. I doubt you'll want to build evals. I would try to figure out the essential parts that make sense for you to own, which are typically related to the idiosyncrasies of your dev setup, and find ways to buy the rest.

How we're building Oz

For Oz, what we are finding in our discussions with eng leaders is that while some companies want a plug-and-play orchestrator, others want to use parts of our solution, which is why we are building it as a programming stack, not just a product.

Architecture diagram of an agent orchestration platform showing agent triggers, control plane with orchestration and observability, task delegation to agent harnesses, hosting and deployment, and a memory and context layer

For instance, folks want the ability to run an agent within their existing dev setups, and not have to define new Docker environments in order for an agent to do work. They can get this by running a "raw" agent SDK inside their existing container, but if they go that route they have to build all the orchestration primitives themselves. Oz supports a raw-style deployment that comes with tracking features built in, without having to use any of our other orchestration primitives.

Other things we consistently hear are the need for self-hosting agents so that code doesn't flow to third-party systems and a desire to swap multiple harnesses for benchmarking and cost control. They want a memory layer that sits above the harnesses that can self-improve over time. Increasingly, they want not just developers but all knowledge workers to be able to build with a single agent platform.

How to decide

To summarize, the factors to consider are:

Resources to build and maintain an orchestrator – it's easy to do an MVP, but a lot of surface area for a production grade solution that will scale
Complexity of your company's development stack – the more complex it is, the less likely a prebuilt cloud agents solution will serve your needs (and if Oz doesn't fit those needs, we'd love to talk to you). You should think incrementally though and not approach build vs. buy as an all or nothing decision
How fast you want to get up and running – this one actually could go either way. You might be able to build an MVP very fast that is a good short term solution, or you might be able to deploy something like Oz more quickly.
Lock-in aversion: if you don't care about lock-in, you might go all in on a single-harness solution, which will simplify building…but you lose optionality down the road.

If you're an engineering leader weighing these choices I'd love to connect, learn and help you navigate the landscape.

BYO_API_KEY with the Claude, Gemini CLI, and Codex logos

May 20, 2026 · 6 min

Bring your own inference to Warp

Today we’re releasing one of the most requested updates from the Warp community: more control over inference.

Oz is the first multi-harness control plane for cloud agents including Claude code, codex, and whatever comes next.

May 19, 2026 · 6 min

A single pane of glass for managing all of your cloud agents

With Oz, engineering teams can now integrate, orchestrate, control, and improve any cloud agent at scale including Claude Code, Codex, Warp Agent, and whatever comes next.

Apr 28, 2026 · 7 min

Warp is now open-source

Warp is now open-source, and the community can participate in building it using an agent-first workflow managed by Oz, our cloud agent orchestration platform.

Build vs buy: how to deploy coding agents at scale

The intelligence infrastructure problem

What features you'll need

The MVP trap

How we're building Oz

How to decide

Related articles

Bring your own inference to Warp

A single pane of glass for managing all of your cloud agents

Warp is now open-source

Get Warp today

Mac

Linux

Windows