There’s a phrase that comes up in almost every honest conversation about enterprise AI right now: pilot purgatory.
You know the place. A proof-of-concept that went well and
generated genuine excitement. A follow-up pilot, broadly successful. Maybe a
third initiative. The portfolio of “in progress” AI projects grows. The number
of those projects that actually reach production and stay there, at scale,
delivering measurable value does not.
Gartner predicted that 30% of GenAI projects would be
abandoned after the proof-of-concept phase by the end of 2025. McKinsey’s
State of AI 2025 found that nearly two thirds of organizations are still
mostly experimenting, only a third are genuinely scaling AI across business
functions. And only 6% qualify as true AI high performers, achieving 5% or
more EBIT improvement they can directly attribute to AI.
Meanwhile, the IBM CEO Study 2025 found that just
25% of AI initiatives delivered their expected ROI and only 16% have
scaled enterprise-wide, despite the fact that most CEOs remain committed to
accelerating their AI investments. The PwC Global CEO Survey 2025 puts
another data point on this: 56% of CEOs report no significant financial
gain from AI so far.
So what’s actually going on?
In my experience, pilot purgatory rarely happens because the
technology failed. It happens because of a cluster of organizational and
strategic failures that show up consistently, across industries, regardless of
how sophisticated the AI itself is.
I spoke recently with a technology director at a large
retail business. They had fourteen AI initiatives running simultaneously, each
with a sponsor, a team, and a budget. Two years in, two were in production. The
other twelve were somewhere between “active development” and “we don’t talk
about that one anymore.” The problem wasn’t capability or commitment. It was
that fourteen parallel bets had produced fourteen half-finished things, and
nobody had the organizational bandwidth to take any of them properly to scale.
Here’s the counter-intuitive lesson: more AI projects
doesn’t mean more AI progress. In most organizations, it means the opposite. Portfolio
sprawl creates the illusion of momentum while actually diffusing the focus,
budget and organizational energy that scaling any single initiative requires.
The organizations consistently outperforming their peers are running fewer AI
programs, not more and taking each one further.
Four reasons pilots don’t make it to production consistently
come up:
The wrong use case:
In the rush to demonstrate AI ambition, organizations pick
what’s technically impressive over what’s strategically valuable. A pilot that
wows in a demo but doesn’t move a business metric will never find a sponsor to
take it further.
No success metric defined upfront: Without a pre-agreed
definition of success, you can’t declare it. Without a clear success story,
momentum evaporates. The project drifts into the backlog.
The business didn’t co-own it: Technology-led projects
handed over to the business at the end almost never stick. If the business doesn’t
help design it, they won’t champion its adoption.
The operating model was an afterthought:
Scaling AI means redesigning workflows, retraining people,
and establishing monitoring loops. Treating this as a footnote is how you end
up with a model in production that nobody uses.
So what do the 6% do differently?
They start with the business outcome, not the technology.
Every use case they pursue begins with the question: what specific,
measurable business result are we trying to improve? The AI is the means.
The business result is the brief.
They build foundations before use cases. Data
infrastructure, governance, security, model monitoring these investments are
made once and repay across every subsequent project. Skipping them for speed
creates technical debt that comes due faster and more expensively than anyone
anticipates.
They make business units co-owners. Not just stakeholders.
Not just recipients. Co-owners, with skin in the game, who helped design the
solution and have a personal interest in making it work. This changes
everything about adoption and scale.
They design for iteration, not for perfection. The best AI
programs launch at minimum viable scale, measure religiously, and improve
continuously. The worst ones spend 18 months trying to get the pilot exactly
right before anyone’s allowed to use it.
And returning to the theme of Post 3 they have executive sponsors who actually
understand what the program needs. Not sponsors who approved the budget and
moved on. Sponsors who show up, remove blockers and make the business case
internally when AI needs defending.
If you’re reading this from inside pilot purgatory right
now, here’s what I’d say plainly: the answer isn’t more pilots.
It’s ruthless prioritization two or three use cases, fully
owned, taken all the way through to real business impact. Then build from
there.
The organizations delivering real results from AI aren’t
necessarily smarter or better resourced. They just stopped treating the pilot
as the destination.
That wraps up The CTO’s AI Playbook. Five challenges. Five
posts. All of them solvable but none of them easy, and none of them purely
technical. If this series has sparked a thought, a debate, or a conversation
you needed to have with your leadership team, then it has done its job. I’d
genuinely love to hear which challenge resonates most with where your organization
is right now.
No comments:
Post a Comment