Book a Discovery Call

How to Integrate ERP, MES, and Shop-Floor Data

The three-numbers-that-don't-agree problem has one root cause: systems that were never built to talk to each other. Your ERP knows what you planned, your MES knows what you ran, and your SCADA and PLCs know what the machines actually did — and none of them compares notes. Integration is the work of making them. Here's how to do it, step by step, and the traps to avoid.

**Integrating ERP, MES, and shop-floor data means connecting these systems into one foundation where their data is reconciled to a common model — so a part, a shift, and a "good unit" mean the same thing everywhere. It's built around your existing systems, not by ripping them out.**

This is the core of data engineering, and the leap from a disconnected floor to a connected one.

Why it's hard: the OT/IT divide

Integration is harder in manufacturing than in most industries because you're bridging two different worlds. Operational technology (OT) — your PLCs, SCADA, and MES — speaks industrial protocols, runs in real time, and often uses proprietary or vendor-specific formats. Information technology (IT) — your ERP and QMS — speaks databases and APIs, and often works in batches. The two rarely share common keys, common units, or a common clock.

That gap is the leading technical reason manufacturing AI stalls: RAND's analysis put manufacturing AI failure around 76%, with OT/IT integration and data quality named as the top causes (RAND, 2025). Bridging OT and IT cleanly is exactly what separates a foundation that works from a pile of disconnected systems.

What each system holds

Integration is really reconciliation, so it helps to know what you're reconciling:

  • ERP — orders, inventory, scheduling, cost: what was planned.
  • MES — work orders, sequence, output, rate: what's actually being produced.
  • SCADA / PLCs — temperatures, pressures, cycles, faults: the real-time process.
  • QMS — inspections, scrap, rework: quality.

Each holds one piece of the truth. (For a deeper breakdown of the gaps between them, see ERP, MES, SCADA: what data each system holds.)

How to integrate it, step by step

Step 1 — Map what you have

Inventory every source: which systems, what data they hold, what format and protocol they use, and where the gaps are. You can't connect what you haven't mapped. This is the audit at the heart of data readiness.

Step 2 — Define a common model

Decide, once, what the shared entities and definitions are: a part number is the same part everywhere, a "shift" has one meaning, a "good unit" is counted the same way on the floor and in the back office. This common model is the backbone of reconciliation — without it, you're connecting systems that still disagree.

Step 3 — Build the pipelines

Move data out of each system automatically. On the OT side, a standard like OPC-UA bridges PLCs and SCADA into a common stream; on the IT side, APIs and connectors pull from ERP and MES. Use streaming for real-time floor data and batch (ETL/ELT) where it fits. These data pipelines replace the brittle weekly export. (Primer: Data pipelines for manufacturing.)

Step 4 — Land it in one foundation

Pipe everything into a single governed data warehouse or lakehouse, where it's cleaned, reconciled to the common model, and structured for use. This is the single source of truth the whole operation reads from.

Step 5 — Govern it

Apply data governance: one agreed definition per metric, role-based access, and an audit trail. This is what keeps the integrated foundation trustworthy — and it's non-negotiable in regulated sectors.

Step 6 — Keep it live and healthy

Integration isn't one-and-done. Pipelines need monitoring, new sources get added, and definitions evolve. Tending the foundation is the work of continuous optimization — neglect it and it decays back toward dark data.

Common pitfalls

The mistakes that sink integration projects:

  • Trying to rip and replace. You don't need to scrap a working ERP or MES. Good integration builds around existing investments.
  • Dumping data into a lake with no common model. Connection without reconciliation just creates a bigger, faster mess — not a foundation.
  • Relying on manual exports. A spreadsheet pulled by hand is stale the moment it's saved. Integration has to be automated to stay live.
  • Skipping governance. Without one definition and clear access, the "integrated" data drifts back into conflicting numbers.
  • Underestimating OT. Floor protocols and proprietary formats are genuinely harder than business-system APIs. Plan for them.

The payoff

Integration is the leap from Disconnected to Connected — Stage 1 to Stage 2 on the Data Maturity Model. It's the most important jump in the model, because it's what finally makes one number trustworthy and gives BI and AI something solid to run on. Poor data quality alone costs organizations an average of $12.9 million a year (Gartner); integration is how you stop paying that tax.

Composite Case

A real-world example

(Brief composite illustration — not a specific named client.)

A manufacturer with a heavy logistics and supply-chain operation couldn't trust its OTIF. The ERP tracked what shipped, the MES tracked what was produced, and the two never reconciled — so on-time-in-full was a guess, and customer commitments were made on shaky ground. By defining a common model (one order, one part, one definition of "complete") and piping both systems into a single foundation, OTIF became a real, trustworthy number for the first time. Nothing was ripped out — the existing systems stayed, finally speaking the same language.

FAQs

Frequently asked questions

No. Integration connects the systems you already run; it's additive, not a rip-and-replace. Replacing working systems is rarely necessary and usually the most expensive path.
It depends on how many sources you have and how clean they are — which is why it starts with mapping. A readiness assessment sizes the work before you commit, so there are no surprises.
That's the harder half, and it's handled with standards like OPC-UA to bridge PLCs and SCADA into a common stream, alongside streaming pipelines. Plan for OT explicitly rather than treating the floor like another database.

Next steps

3-min assessment

Data Readiness Scorecard

Gauge where your data stands before building anything on top of it.

Take the Scorecard
Service

Data Engineering

We build the pipelines and data layer that make every system downstream reliable.

See how it works
Talk to us

Book a Discovery Call

See exactly how we'd approach this for your operation. No pitch decks.

Book a Discovery Call

Sources

  • RAND Corporation (2025) — manufacturing AI failure ~76%, with OT/IT integration and data quality named as leading causes.
  • Gartner, *Magic Quadrant for Data Quality Solutions* — poor data quality costs organizations an average of $12.9 million/year.
  • IDC (2022) — >80% of manufacturing data is "dark" / unused.
  • RAND Corporation (2025) — manufacturing AI failure ~76%, with OT/IT integration and data quality named as leading causes.
  • Gartner, *Magic Quadrant for Data Quality Solutions* — poor data quality costs organizations an average of $12.9 million/year.
  • IDC (2022) — >80% of manufacturing data is "dark" / unused.