Solutions

Industries

Markets

Partnerships

Resources

Get in touch

By Sam Devakumar, Head of Data & Machine Learning

GenAI Projects in 2025: The Hype Was Real and So Were the Failures

GenAI Projects in 2025: The Hype Was Real and So Were the Failures

Anyone who claims they haven’t failed with GenAI is either being economical with the truth or hasn’t shipped anything that matters. Failure, in and of itself, isn’t the problem. In fact, in emerging technology cycles, it’s expected. But the scale and pattern of failure we’ve seen across GenAI projects in 2025 deserves a closer look. Not for post-mortem curiosity, but for its potential to provide course correction. 

Let’s start with the numbers. Gartner analysts predicted that roughly 30% of GenAI projects would be abandoned after proof-of-concept by 2025. By early 2026, that estimate had climbed to at least 50% according to Gartner. Some broader estimates by RAND suggest that over 80% of AI projects fail and that’s roughly double the failure rate of traditional IT initiatives. 

Of course, ‘failure’ can be subjective here.  A project might never reach production, get blocked indefinitely by compliance, launch but fail to gain adoption, or simply produce no measurable business impact. Sometimes it works until it doesn’t, with quality degrading over time. 

As someone who is responsible for both data and machine learning in a production environment, I can say all those outcomes count. And they’re not cheap.  

A typical GenAI MVP in a European tech scale-up might run for three to six months with a small cross-functional team: product, engineering, data, and domain expertise. Even with conservative assumptions, you’re looking at €150k to €400k per initiative. That’s before you factor in opportunity cost and delayed roadmaps. 

So yes, failure is expensive. But it’s also, in many cases, predictable. 

Are GenAI Projects Actually Harder? 

There’s a sense of déjà vu in many of these post-mortems. Over the years, I’ve seen similar questions asked about data migrations, analytics platforms, and machine learning models. The root causes haven’t fundamentally changed. What has changed is the penalty for getting things wrong. 

GenAI introduces higher uncertainty, more complex evaluation challenges, stricter governance requirements, and a tendency for outputs to degrade over time. It doesn’t create entirely new failure modes, but it does amplify existing ones. 

If your data quality is weak, GenAI will expose it faster. If your product thinking is fuzzy, GenAI will make it obvious. If your engineering practices cut corners, GenAI will punish that too. 

Based on my own experience, I’ve come up with some situations that you can try to avoid if you want to deliver a successful GenAI project. So below are my:

Seven Reasons Why GenAI Projects Fail

1. No Clear Problem 

In 2025, ‘use AI’ became the strategy. That’s usually where things start to go wrong. When a project begins with a solution rather than a problem, you get vague goals, impressive demos, and very little real-world value. ‘Build an agent that answers anything’ sounds ambitious, but it lacks the constraints that make a product useful.  

The projects that succeed are painfully specific. They define a user, a workflow, a decision point, and a measurable outcome. Everything else doesn’t matter, in the end. 

2. Data Isn’t Production-Grade 

Most GenAI prototypes are built on whatever data is easiest to access. That’s fine until you try to scale. Production systems require data that is complete, correct, up-to-date, and governed. They require ownership, lineage, and access controls. Without that, you’re effectively automating inconsistency. One of the most common anti-patterns I’ve seen is the ‘ingest everything’ approach. This includes dumping internal knowledge into a model and hoping it figures things out. It rarely does. 

Good teams curate. They define authoritative sources, assign ownership, and treat data as a product, and not as a byproduct. 

3. Productionisation Is Underestimated 

There’s a massive gap between a working demo and a working product. Running a model locally is trivial. Embedding it into a real workflow, one with proper logging, permissions, fallback mechanisms, and cost controls, is not. Many teams defer integration, assuming it’s a final step. In reality, it’s the hardest part.  

If you don’t design for failure modes, latency, and user context from the beginning, you end up with something that looks good in isolation but fails in practice. 

4. Evaluation Is Missing, or Misleading 

You need to have a satisfactory answer to "How do you evaluate the quality of the outputs and how do you ensure it doesn't degrade over time?"  

Evaluate GenAI outputs using a mix of human scoring on predefined quality rubrics, and LLM-as-judge methods to assess accuracy, relevance, tone, and groundedness. For production use, combine these with repeatable test sets and ongoing monitoring to catch quality drift over time. 

5. No Ownership, Weak Product Integration 

A model without ownership is a feature without a future. Too often, AI teams build something and hand it off, assuming adoption will follow. It doesn’t. If the feature isn’t embedded into existing workflows, if users aren’t trained, and if no one is accountable for outcomes, usage will stall. 

Successful teams treat GenAI features like any other product capability. They define ownership across product, engineering, and business functions. They track usage. They drive adoption. They iterate based on feedback. 

6. Risk, Privacy, and Compliance Are Afterthoughts 

Prototyping in ‘innovation mode’ is fast. Retrofitting compliance is slow. GenAI systems often touch sensitive data, operate in regulated environments, and introduce new risks around explainability and auditability. If those considerations aren’t built in from day one, they will surface later and usually as blockers. I’ve seen projects delayed for months because basic questions couldn’t be answered: What data is used? Where is it stored? Who can access outputs? How long is it retained? These are non-negotiable. 

7. ROI Doesn’t Materialise 

Even when everything works, the business case can still fail. Small efficiency gains don’t always translate into meaningful impact. Adoption may be lower than expected. Costs that are both technical and operational can exceed projections. One of the most common traps is overestimating usage. A tool that saves 30 seconds per task sounds valuable until you realise it adds review overhead, requires training, and might introduce new risks. 

Technical success does not guarantee economic success. 

Here’s My Practical Checklist 

Over time, we’ve developed a simple checklist to sanity-check GenAI initiatives before committing significant resources: 

First, is the problem meaningful enough to justify solving and is the ROI realistic? 

Second, does the solution actually require GenAI, or could simpler approaches deliver most of the value? 

Third, is ownership clearly defined across product, engineering, data, and risk? 

Fourth, is the data ready. Not just available, but governed and maintained? 

Fifth, is there an agreed evaluation framework?  

Sixth, is the scope narrow enough to ship quickly and iterate? 

And finally, is there a feedback loop in place to monitor quality and adapt over time? 

Closing Thoughts 

GenAI projects don’t usually fail because of lack of ambition. If anything, ambition is one of the culprits. Failure occurs because teams skip the fundamentals: clear problem definition, disciplined data practices, thoughtful product design, robust governance, and realistic ROI expectations.  
 
The organizations that are succeeding with GenAI aren’t necessarily the ones building the most sophisticated models. They’re the ones treating GenAI like a product and subjecting it to the same rigor, constraints, and accountability as any other part of the business. In a year defined by hype, that was the real differentiator. 

Anyone who claims they haven’t failed with GenAI is either being economical with the truth or hasn’t shipped anything that matters. Failure, in and of itself, isn’t the problem. In fact, in emerging technology cycles, it’s expected. But the scale and pattern of failure we’ve seen across GenAI projects in 2025 deserves a closer look. Not for post-mortem curiosity, but for its potential to provide course correction. 

Let’s start with the numbers. Gartner analysts predicted that roughly 30% of GenAI projects would be abandoned after proof-of-concept by 2025. By early 2026, that estimate had climbed to at least 50% according to Gartner. Some broader estimates by RAND suggest that over 80% of AI projects fail and that’s roughly double the failure rate of traditional IT initiatives. 

Of course, ‘failure’ can be subjective here.  A project might never reach production, get blocked indefinitely by compliance, launch but fail to gain adoption, or simply produce no measurable business impact. Sometimes it works until it doesn’t, with quality degrading over time. 

As someone who is responsible for both data and machine learning in a production environment, I can say all those outcomes count. And they’re not cheap.  

A typical GenAI MVP in a European tech scale-up might run for three to six months with a small cross-functional team: product, engineering, data, and domain expertise. Even with conservative assumptions, you’re looking at €150k to €400k per initiative. That’s before you factor in opportunity cost and delayed roadmaps. 

So yes, failure is expensive. But it’s also, in many cases, predictable. 

Are GenAI Projects Actually Harder? 

There’s a sense of déjà vu in many of these post-mortems. Over the years, I’ve seen similar questions asked about data migrations, analytics platforms, and machine learning models. The root causes haven’t fundamentally changed. What has changed is the penalty for getting things wrong. 

GenAI introduces higher uncertainty, more complex evaluation challenges, stricter governance requirements, and a tendency for outputs to degrade over time. It doesn’t create entirely new failure modes, but it does amplify existing ones. 

If your data quality is weak, GenAI will expose it faster. If your product thinking is fuzzy, GenAI will make it obvious. If your engineering practices cut corners, GenAI will punish that too. 

Based on my own experience, I’ve come up with some situations that you can try to avoid if you want to deliver a successful GenAI project. So below are my:

Seven Reasons Why GenAI Projects Fail

1. No Clear Problem 

In 2025, ‘use AI’ became the strategy. That’s usually where things start to go wrong. When a project begins with a solution rather than a problem, you get vague goals, impressive demos, and very little real-world value. ‘Build an agent that answers anything’ sounds ambitious, but it lacks the constraints that make a product useful.  

The projects that succeed are painfully specific. They define a user, a workflow, a decision point, and a measurable outcome. Everything else doesn’t matter, in the end. 

2. Data Isn’t Production-Grade 

Most GenAI prototypes are built on whatever data is easiest to access. That’s fine until you try to scale. Production systems require data that is complete, correct, up-to-date, and governed. They require ownership, lineage, and access controls. Without that, you’re effectively automating inconsistency. One of the most common anti-patterns I’ve seen is the ‘ingest everything’ approach. This includes dumping internal knowledge into a model and hoping it figures things out. It rarely does. 

Good teams curate. They define authoritative sources, assign ownership, and treat data as a product, and not as a byproduct. 

3. Productionisation Is Underestimated 

There’s a massive gap between a working demo and a working product. Running a model locally is trivial. Embedding it into a real workflow, one with proper logging, permissions, fallback mechanisms, and cost controls, is not. Many teams defer integration, assuming it’s a final step. In reality, it’s the hardest part.  

If you don’t design for failure modes, latency, and user context from the beginning, you end up with something that looks good in isolation but fails in practice. 

4. Evaluation Is Missing, or Misleading 

You need to have a satisfactory answer to "How do you evaluate the quality of the outputs and how do you ensure it doesn't degrade over time?"  

Evaluate GenAI outputs using a mix of human scoring on predefined quality rubrics, and LLM-as-judge methods to assess accuracy, relevance, tone, and groundedness. For production use, combine these with repeatable test sets and ongoing monitoring to catch quality drift over time. 

5. No Ownership, Weak Product Integration 

A model without ownership is a feature without a future. Too often, AI teams build something and hand it off, assuming adoption will follow. It doesn’t. If the feature isn’t embedded into existing workflows, if users aren’t trained, and if no one is accountable for outcomes, usage will stall. 

Successful teams treat GenAI features like any other product capability. They define ownership across product, engineering, and business functions. They track usage. They drive adoption. They iterate based on feedback. 

6. Risk, Privacy, and Compliance Are Afterthoughts 

Prototyping in ‘innovation mode’ is fast. Retrofitting compliance is slow. GenAI systems often touch sensitive data, operate in regulated environments, and introduce new risks around explainability and auditability. If those considerations aren’t built in from day one, they will surface later and usually as blockers. I’ve seen projects delayed for months because basic questions couldn’t be answered: What data is used? Where is it stored? Who can access outputs? How long is it retained? These are non-negotiable. 

7. ROI Doesn’t Materialise 

Even when everything works, the business case can still fail. Small efficiency gains don’t always translate into meaningful impact. Adoption may be lower than expected. Costs that are both technical and operational can exceed projections. One of the most common traps is overestimating usage. A tool that saves 30 seconds per task sounds valuable until you realise it adds review overhead, requires training, and might introduce new risks. 

Technical success does not guarantee economic success. 

Here’s My Practical Checklist 

Over time, we’ve developed a simple checklist to sanity-check GenAI initiatives before committing significant resources: 

First, is the problem meaningful enough to justify solving and is the ROI realistic? 

Second, does the solution actually require GenAI, or could simpler approaches deliver most of the value? 

Third, is ownership clearly defined across product, engineering, data, and risk? 

Fourth, is the data ready. Not just available, but governed and maintained? 

Fifth, is there an agreed evaluation framework?  

Sixth, is the scope narrow enough to ship quickly and iterate? 

And finally, is there a feedback loop in place to monitor quality and adapt over time? 

Closing Thoughts 

GenAI projects don’t usually fail because of lack of ambition. If anything, ambition is one of the culprits. Failure occurs because teams skip the fundamentals: clear problem definition, disciplined data practices, thoughtful product design, robust governance, and realistic ROI expectations.  
 
The organizations that are succeeding with GenAI aren’t necessarily the ones building the most sophisticated models. They’re the ones treating GenAI like a product and subjecting it to the same rigor, constraints, and accountability as any other part of the business. In a year defined by hype, that was the real differentiator. 

Fourthline has been certified by EY CertifyPoint to ISO/IEC27001:2022 with certification number 2021-039.

Copyright © 2026 - Fourthline B.V. - All rights reserved.

Fourthline has been certified by EY CertifyPoint to ISO/IEC27001:2022 with certification number 2021-039.

Copyright © 2026 - Fourthline B.V. - All rights reserved.