The Model Is Only 10%: The Real Lesson of the New SDLC

📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent Google whitepaper reveals that in AI-assisted development, the model accounts for only about 10% of system behavior. The focus should be on the harness and context engineering, which constitute 90%. This shift impacts how organizations should invest in AI tools and infrastructure.

A new Google whitepaper titled The New SDLC With Vibe Coding emphasizes that the model accounts for only 10% of AI system behavior. The paper argues that the real focus should be on the harness and context engineering, which make up roughly 90%. This insight challenges common industry assumptions and has significant implications for how organizations develop and deploy AI systems.

The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, states that the dominant part of AI system performance depends on the configuration, scaffolding, and contextual setup. The authors illustrate this with experiments showing that a single model, when paired with different harnesses, can dramatically change outcomes—moving from outside the top 30 to the top 5 on benchmarks.

Furthermore, the paper introduces the concept that the model is only about 10% of what determines AI behavior, while the harness—including prompts, tools, rules, and observability—comprises the remaining 90%. This shifts the strategic focus from model selection to configuration and context engineering, which are more controllable and cost-effective over time.

The authors also discuss the economics of AI development, noting that while vibe coding (quick prompts with minimal review) appears cheap initially, it incurs high long-term costs due to token inefficiency, maintenance, and security vulnerabilities. Conversely, disciplined, structured approaches—what they call agentic engineering—involve higher upfront investment but lower marginal costs, making them more sustainable.

At a glance
reportWhen: published early 2026
The developmentGoogle’s new whitepaper highlights that the most significant aspect of AI-driven software development is the harness and context, not the AI model itself.
The Model Is Only 10% — The New SDLC With Vibe Coding
AI Dispatch · Field Notes
Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified
Vibe Coding
Casual prompts · “does it seem to work?” · disposable code · high risk
Structured AI-Assisted
Detailed prompts + constraints · manual testing · features in real codebases
Agentic Engineering
Formal specs · automated tests + evals + CI gates · production scale · low risk
Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.
The idea worth building your strategy around
Agent = Model + Harness
~10%
HARNESS — prompts · tools · context · hooks · sandboxes · observability
MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S
Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.
“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.
The economics: it’s a token-cost problem (CapEx vs OpEx)
Vibe Coding
Low CapEx · High OpEx
Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.
Agentic Engineering
High CapEx · Low OpEx
Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.
85%
of devs use AI coding agents (51% daily)
41%
of all new code is AI-generated
~90%
of agent behavior is the harness, not the model
+19%
longer on some tasks (METR) — verification is the cost
The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.
thorstenmeyerai.com

Implications for AI Development Strategies

This shift in understanding emphasizes that organizations should prioritize building robust harnesses and context management around AI models rather than focusing solely on acquiring the latest models. It suggests that long-term competitive advantage depends on how well teams engineer and maintain their AI environments, which is more controllable and cost-effective than chasing model upgrades.

For decision-makers, this means re-evaluating investments in AI infrastructure, training staff in context engineering, and developing best practices for configuration management. It also highlights that cost efficiency in AI is achieved through disciplined engineering rather than ad-hoc prompt optimization.

Agentic Development: The Complete Guide to AI-Assisted Coding with Claude, Cursor, and Beyond

Agentic Development: The Complete Guide to AI-Assisted Coding with Claude, Cursor, and Beyond

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Evolution of AI Development Practices and Industry Insights

The whitepaper builds on recent industry trends where AI adoption has skyrocketed, with 85% of developers using AI coding agents as of early 2026. Previously, the focus was on model performance and new AI models. However, recent experiments and benchmarks demonstrate that configuration and scaffolding play a more significant role than raw model power.

This perspective aligns with earlier industry observations that many AI failures stem from poor setup or misconfiguration. The paper formalizes this understanding, framing it as a fundamental shift in the software development lifecycle (SDLC) in the AI era.

It also references ongoing discussions about cost management, security, and reliability, emphasizing that controlling the environment yields more predictable and sustainable results than continuously chasing model improvements.

“The model is only about 10% of what determines behavior; the harness and context are 90%. Our focus should shift accordingly.”

— Addy Osmani, co-author of the whitepaper

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

The AI Prompt Playbook: Master AI Prompt Engineering with 140 Ready-to-Use Templates for ChatGPT, Claude, Gemini & Copilot

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Unclear Aspects of Implementation and Industry Adoption

It remains unclear how quickly organizations will adopt this new focus on harness and context engineering at scale. The specific best practices, tools, and frameworks for effective implementation are still emerging, and industry-wide standards are yet to be established.

Moreover, the precise quantification of cost savings and performance improvements across different sectors and use cases is still under investigation, meaning that some organizations may experience varied results during their transition.

Observability in the AI-Native Era: Leveraging AIOps to build, observe, and operate resilient systems

Observability in the AI-Native Era: Leveraging AIOps to build, observe, and operate resilient systems

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps for Organizations and Developers

Organizations should begin assessing their current AI workflows, focusing on configuration and environment management. Developing or adopting tools for context engineering and observability will be critical. Industry groups and standard bodies may soon publish best practices based on ongoing research.

Further research and case studies are expected to clarify how best to implement these principles at scale, and whether specific frameworks or platforms will emerge as leaders in harness and context management.

YAML Made Simple: A Beginner’s Guide to Configuration and Data Structuring

YAML Made Simple: A Beginner’s Guide to Configuration and Data Structuring

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of system behavior?

The whitepaper explains that most of the AI system’s performance depends on how the model is integrated, configured, and guided through prompts, tools, and rules, which constitute the harness and context.

How does this shift affect AI investment strategies?

It suggests that organizations should allocate more resources to building robust harnesses and managing context, rather than solely investing in acquiring or upgrading models.

What are the economic implications of this perspective?

While initial setup costs for disciplined engineering are higher, long-term costs decrease due to lower token waste, maintenance, and security risks, making AI development more sustainable.

What remains uncertain about this approach?

It is still unclear how quickly industry-wide adoption will occur, and what specific tools and standards will emerge for effective harness and context engineering.

How can organizations start implementing these insights?

Organizations should evaluate their current AI workflows, focus on improving environment configuration, and invest in tools that support context management and observability.

Source: ThorstenMeyerAI.com

This content is for general information only and is not financial, tax or legal advice. Consult a qualified professional for decisions about your money.
You May Also Like

A War Room for Your Next Idea: Inside IdeaClyst

Discover how IdeaClyst provides founders with a local-first AI-driven war room to validate and develop startup ideas, reducing costly market failures.

The Skills Marketplace, Six Months Later: Predicted vs Actual

An analysis of the skills marketplace’s growth and structure after six months, comparing initial predictions with actual developments and current realities.

The citation. Why generative engine optimization rewards the same brand on the least stable ground.

Analysis of how generative engine optimization favors established brands through citations, revealing structural shifts in AI-driven search and content visibility.

The $60 Billion Bargain: Why Cursor Could Be a Steal for SpaceX

SpaceX acquired AI coding startup Cursor for $60 billion in stock, a move that analysts say offers strategic advantages and potential for high future margins.