Book a Discovery Call

What Is a Data Readiness Assessment — and Why AI Fails Without One

Buying an AI tool before checking your data is like ordering a production line before measuring the floor. It might fit. More likely you'll discover, expensively, that it doesn't. A data readiness assessment is that measurement — the diagnosis that comes before the prescription. Here's what it is, what it produces, and why skipping it is the single most common way AI projects fail.

A data readiness assessment is a structured evaluation of whether your data is connected, clean, governed, accessible, and structured well enough for BI and AI to run on. It produces three things: a map of the data you have, a ranked list of the gaps, and a costed roadmap to close them — before you spend on tooling.

It's the first step of Discovery & Assessment, and the cheapest insurance you can buy against a failed pilot.

Why it has to come first

The numbers make the case. More than 80% of AI projects never reach production (RAND, 2024), and Gartner expects 60% of those lacking AI-ready data to be abandoned through 2026 (Gartner). Almost every one of those failures traces to the same root cause: the data wasn't ready, and nobody checked before building.

An assessment flips that. Instead of discovering mid-project that the sensor data was never logged or that ERP and MES don't reconcile, you find out first — when it's cheap to fix and easy to resequence. The cost of an assessment is trivial next to the cost of a six-figure pilot that dies on unready data.

What the assessment covers

A real assessment walks the floor and the systems together, because the gaps live between them. It evaluates:

  • Data inventory — what data exists and where: PLCs, SCADA, MES, ERP, QMS, and IoT sensors — including the high-value data that's trapped on a machine or not captured at all.
  • Data quality — completeness, consistency, and accuracy, scored honestly against what AI and BI actually require.
  • Connectivity and silos — where systems don't talk or don't reconcile, and which numbers consequently disagree.
  • Governance and access — who owns the data, how it's defined, and who can reach it.
  • Readiness scoring — all of the above rolled into your stage on the Data Maturity Model, so you know exactly where you stand.

(The hands-on version of the inventory step: How to audit your factory floor data.)

What you actually get out of it

The value is in the deliverables — concrete artifacts you can act on and put a budget against:

  1. A map of your data estate — what you have, where it lives, and where it's trapped. Most leadership teams have never seen this clearly.
  2. Your stage on the maturity model — an honest read on where you are, from Disconnected to Autonomous.
  3. A prioritized gap list — the gaps ranked by impact on margin, so you fix the things that matter first instead of boiling the ocean.
  4. A costed, phased roadmap — what to do, in what order, at what cost — sequenced so each step builds the ground the next one needs.

That roadmap is what turns "we should do something with AI" into a concrete, de-risked plan.

How it differs from just buying a tool

A vendor tool assumes your data is ready. The assessment checks. That's the whole difference — diagnosis before prescription. Plenty of manufacturers skip straight to the tool because it feels like progress, then spend the next six months discovering everything the assessment would have told them upfront. The tool isn't wrong; it's just premature on data that can't yet support it. Confirm readiness first, and the tool you eventually choose actually works.

Where it leads

The assessment is step one of a sequence you can't shortcut. Findings flow into data engineering (connect and clean the sources into one foundation), then analytics (live BI on top), then AI — each stage of the maturity model built in order. The assessment is what tells you where in that sequence to start, and what the first dollar should buy. (See Where to start with manufacturing AI: a maturity-based roadmap.)

Composite Case

A real-world example

(Brief composite illustration — not a specific named client.)

A manufacturer had budget approved for a significant AI rollout and was ready to sign with a vendor. A short readiness assessment first revealed that only a fraction of the data the project needed was actually usable — the rest was siloed, inconsistent, or uncaptured. Rather than burn the budget on a pilot destined to stall, they redirected it: fix the foundation first, then deploy. The AI eventually shipped and delivered — precisely because the assessment caught the gap before the spend, not after.

FAQs

Frequently asked questions

Typically a couple of weeks of floor walks, system reviews, and data-quality scoring — fast relative to the cost of the failed pilot it prevents.
Almost always. Against an 80%+ AI failure rate driven by unready data, an assessment is cheap insurance — it either confirms you're ready or saves you from a far more expensive mistake.
An assessment is still worth running — especially if the project has stalled. It diagnoses why, which is usually the data, and tells you what to fix to get it back on track.

Next steps

3-min assessment

Data Readiness Scorecard

Gauge where your data stands before building anything on top of it.

Take the Scorecard
Service

Discovery & Assessment

We map your data gaps, score your AI readiness, and hand you a prioritised next-step roadmap.

See how it works
Talk to us

Book a Discovery Call

See exactly how we'd approach this for your operation. No pitch decks.

Book a Discovery Call

Sources

  • RAND Corporation (2024) — >80% of AI projects fail to reach production (~2× the non-AI rate).
  • Gartner — 60% of AI projects lacking AI-ready data forecast to be abandoned through 2026.
  • IDC (2022) — >80% of manufacturing data is "dark" / unused.
  • RAND Corporation (2024) — >80% of AI projects fail to reach production (~2× the non-AI rate).
  • Gartner — 60% of AI projects lacking AI-ready data forecast to be abandoned through 2026.
  • IDC (2022) — >80% of manufacturing data is "dark" / unused.