The Next Thing Now Live 45 mins

What is Harness Engineering?

Hosted by
RB
Rob Borley
KS
Kev Smith

Harness Engineering Is Where Software Delivery Becomes a System Problem

For most of the modern software era, we have treated delivery capacity as a human problem. When there was too much work, we added people. When the work became harder to coordinate, we added process. When quality became inconsistent, we added reviews, ceremonies, testing disciplines and governance layers. The industry built an extraordinary amount of structure around one simple constraint: software was produced by human beings, and human beings needed help to produce it well.

That assumption shaped almost everything. Agile gave teams a way to work with uncertainty. Scrum gave organisations a way to plan, estimate and coordinate work. DevOps responded to the friction between building software and running it. Test automation, code reviews, architecture boards and documentation standards all emerged from the same underlying reality. Humans were the production system, so the job of management was to organise, support, constrain and improve human production.

We are making the thing... that makes the thing.

- Kev Smith 

Harness Engineering matters because it signals a change in that reality. The phrase sounds technical, and in one sense it is. It deals with the environment around AI agents, the tools they can use, the context they can access, the rules they must follow and the checks they must pass. Yet the deeper significance is organisational. It asks what happens when software delivery stops being limited by the number of people available to write code and starts being shaped by the quality of the system that surrounds autonomous capability.

Many organisations are still thinking about AI as an individual productivity tool, a faster autocomplete, a better assistant or a way for developers to move through tickets more quickly. That has value, but it is not where the bigger story sits. The bigger story begins when the model is no longer simply helping a person complete a task, but is acting inside a structured environment that allows it to produce, test, validate, document and refine meaningful software artefacts with far less human intervention.

In our recent TNTN: LIVE discussion, Kev described this as “making the thing that makes the thing.” That phrase captures the movement of value away from the direct production of code and towards the design of the production environment itself. The work of the engineer begins to shift from writing every line, fixing every issue and manually pushing every artefact through the delivery process, towards building the constraints, inputs, workflows and feedback loops that allow an agentic system to do useful work reliably.

This is where the conversation around AI in software development often goes wrong. It becomes too focused on the intelligence of the model. We compare benchmarks, context windows, coding scores and reasoning capabilities, as if the smartest model will automatically produce the best outcome. In practice, the model is only one part of the system. The same model can produce shallow, brittle, chaotic output in one environment and useful, tested, maintainable software in another. The difference is the quality of the harness.

A harness gives the agent its working world. It gives it access to the repository, the command line, documentation, architecture decisions, coding standards, test suites, security checks, deployment constraints and organisational context. It defines what the agent is allowed to do, what it must verify, what it should consider and where human judgement is required. It turns raw model capability into something that can operate inside a professional engineering environment.

That point matters because software development has never been about code alone. Good software depends on judgement, architecture, context, constraints, priorities and trade-offs. The code is only the visible artefact of a much larger system of thinking. AI can generate code at remarkable speed, but speed alone does not create trust. Without a harness, speed can simply produce more mess, more inconsistency and more technical debt at a pace that humans can no longer comfortably inspect.

This is why Harness Engineering should be seen as a trust discipline as much as a delivery discipline. The central question for organisations is not whether AI can produce software. It clearly can. The question is whether they can create an environment in which AI-produced software can be relied upon. That requires more than a prompt and more than access to a coding agent. It requires an engineered system of context, validation and control.

The implications for software teams are significant. If agents can produce code, generate tests, update documentation, run analysis and perform refactoring work inside a controlled process, then the human role changes. Human time becomes concentrated around the points of greatest leverage: defining the problem clearly, shaping the architecture, reviewing the quality of the output, improving the harness and deciding whether the result is good enough to move forward. The craft does not vanish, but it moves. It becomes less about constant manual production and more about direction, judgement and system design.

That movement will create discomfort, particularly in organisations that have spent years refining delivery processes around human throughput. Many rituals of software delivery exist because human coordination is difficult. Sprint planning, estimation, story sizing, stand-ups and review cycles all make sense in a world where work is performed by teams of people operating across days and weeks. They may still have value, but their value needs to be re-examined in a world where a feature can be generated, tested, corrected and regenerated in a fraction of the time.

The leadership conversation is therefore more interesting than the technology conversation. Harness Engineering is not simply a new way for developers to work. It is a sign that delivery capacity is becoming less dependent on headcount and more dependent on the maturity of the system around the work. That is a profound change for any organisation that still thinks increased output primarily comes from larger teams.

The uncomfortable truth is that many organisations are not ready for this shift. Their processes are too slow, their documentation is too fragmented, their governance is too manual and their architecture is too implicit. Much of the knowledge required to guide intelligent systems still lives in people’s heads, old tickets, scattered wikis and undocumented conventions. That was already a problem for human teams. It becomes a much larger problem when agents need structured access to that knowledge in order to produce reliable work.

Harness Engineering forces that hidden weakness into the open. If your organisation cannot clearly describe how good software should be produced, an agent will not magically infer it. If your standards are inconsistent, your outputs will be inconsistent. If your architecture is unclear, your generated code will drift. If your validation is weak, speed will amplify risk. The harness reflects the maturity of the organisation that builds it.

This is why Harness Engineering should not be dismissed as another passing term. It names the part of the AI transition that enterprises most need to understand. The model may provide the intelligence, but the harness determines whether that intelligence can be trusted with real work. The organisations that learn this lesson will build faster, but more importantly they will build with greater confidence. Those that do not will either underuse the technology or unleash it into environments too weak to absorb the consequences.

Software delivery is becoming a system problem in a much deeper sense than before. The winners will be the organisations that design the system deliberately.