A factory floor data audit inventories the data your systems produce, scores its quality, finds the silos and gaps, and ranks what to fix first. It's the practical groundwork for data readiness — and the cheapest way to avoid spending on tooling your data can't yet support.
Done well, it turns "we should do something with our data" into a concrete, ordered plan.
Why audit first
Skipping the audit is how AI pilots die. More than 80% of AI projects fail to reach production (RAND, 2024), almost always because nobody checked the data first. And the raw material is usually there — IDC has estimated over 80% of manufacturing data is "dark," captured but unused (IDC, 2022). An audit surfaces exactly what you have and what's missing, so you fix the right things in the right order instead of guessing.
The step-by-step audit
Step 1 — Walk the floor and list every source
Inventory every system that produces data: PLCs and machine controllers, SCADA, MES, ERP, QMS, IoT sensors — plus the spreadsheets and paper logs people actually rely on. For each, note what it holds, what format it's in, and how you'd get data out.
Step 2 — Trace where the data goes (or doesn't)
For each source, follow the data. Is it captured centrally, logged only on the machine, exported by hand, or not captured at all? This is where you find the trapped and lost data — the sensor stream nobody stores, the report that exists only in someone's inbox.
Step 3 — Score the quality honestly
Rate each source on completeness, consistency (are units and part numbers standardized?), accuracy, and timeliness. Be ruthless — optimistic scoring here just defers the pain to a failed project later.
Step 4 — Map the silos and gaps
Identify where systems don't reconcile (the reason two reports show two numbers), where high-value data isn't captured at all, and where definitions disagree across systems or sites. (Deeper: Data silos in manufacturing.)
Step 5 — Rank by impact
Not every gap matters equally. Rank them by cost — which gaps drive downtime, force late decisions, or block the analytics and AI you want. Fix the highest-impact ones first instead of trying to fix everything.
Step 6 — Turn it into a roadmap
Convert the ranked gaps into a sequenced plan, and place yourself on the Data Maturity Model. The audit tells you where you are; the roadmap tells you what to do next, in order.
Red flags to watch for
During the walk, these tell you the foundation is the bottleneck:
- Two reports showing different numbers for the same KPI.
- OEE or other metrics built by hand in spreadsheets.
- Equipment data generated but logged nowhere central.
- Critical know-how living in one veteran's head.
- Every data question routing through IT and arriving late.
If several show up, you've confirmed the diagnosis. (More: 7 signs your data isn't ready for AI.)
DIY vs a formal assessment
You can start this audit yourself — and you should, even informally, before any tooling decision. A formal Discovery & Assessment goes further: rigorous quality scoring, a deeper silo analysis, and a costed roadmap. Think of the DIY audit as the first pass that tells you whether you need the full one. (What the formal version delivers: What is a data readiness assessment.)
Either way, the findings feed the same next step — connecting and cleaning your sources into one foundation, the work of data engineering.
A real-world example
(Brief composite illustration — not a specific named client.)
A manufacturer ran a floor data audit expecting to confirm they were "mostly there." Step 2 said otherwise: the vibration and cycle data from their most critical machines wasn't being logged anywhere central — it was generated and discarded in real time. They'd been about to buy a predictive-maintenance tool that would have had nothing to learn from. The audit caught it before the spend, and reordered the plan: capture and connect the data first, then add the tool.
Frequently asked questions
Next steps
Data Readiness Scorecard
Gauge where your data stands before building anything on top of it.
Take the ScorecardDiscovery & Assessment
We map your data gaps, score your AI readiness, and hand you a prioritised next-step roadmap.
See how it worksBook a Discovery Call
See exactly how we'd approach this for your operation. No pitch decks.
Book a Discovery CallSources
- RAND Corporation (2024) — >80% of AI projects fail to reach production, most often because data readiness was never checked.
- IDC (2022) — >80% of manufacturing data is "dark" / unused.
- RAND Corporation (2024) — >80% of AI projects fail to reach production, most often because data readiness was never checked.
- IDC (2022) — >80% of manufacturing data is "dark" / unused.