
Every CDMO invests millions in state-of-the-art cleanrooms, automated bioreactors, and top-tier LIMS. Their internal data capture is virtually flawless and then, at the very end of this hyper-modern, highly regulated process, they deliver the stability report to the sponsor via an email attachment named: v2_FINAL_revised_financereview2.xlsx.
Here is how data integrity goes to die in a spreadsheet handoff: The LIMS generates a report. The project manager exports it to Excel. QA opens it to add compliance checks. Finance opens it to verify billing milestones. The sponsor finally receives it to ingest into their own systems.
Somewhere between multiple people, departments, and rounds of edits, a cell gets dragged accidentally, a column gets overwritten, and before anyone notices, the spreadsheet has drifted from its original source. Nobody is being careless; it's an inevitable consequence of moving files across teams and workflows.
Nothing immediately looks wrong. The file opens normally, the numbers seem fine, and the report gets shared and approved. The problem arises much later during an audit, when someone asks where a number came from and nobody can trace which version of the spreadsheet actually produced it.
Audit reconstruction costs days, not hours: when a number can’t be traced back properly, teams end up spending days digging through old emails, file versions, and git history trying to piece together what happened.
Excel is excellent analytical tool, but it was never designed to serve as a reliable system to transfer critical data between organizations.
This challenge is becoming increasingly important as CDMOs scale operations and manage larger volumes of regulated data. While many organizations have invested heavily in automation, the absence of robust data traceability, version control, and audit-ready workflows can introduce hidden risks. We explored this challenge in greater depth in our article, Automation Without Traceability: The Silent Risk in CDMO Workflows, where we discuss why automation alone is not enough without systems that preserve data integrity throughout the lifecycle.
Whenever this issue comes up, the first reaction is usually, “We should just stop using spreadsheets.” But that’s not realistic. Scientists and analysts trust Excel for good reason: because it’s fast, flexible, and genuinely useful for working with data. The problem isn’t Excel itself - it’s using spreadsheets as the primary way to move data between teams and organizations.
Now imagine a structured, automated data layer sitting between the CDMO’s internal systems and the sponsor. This fundamentally changes the handoff:
Sponsors already know that many CDMO data handoffs still involve manual steps, spreadsheets, and a certain amount of uncertainty. That has quietly become part of the industry norm.
But when a CDMO can instantly explain what changed between reports, with clear version history, timestamps, and traceable source records - the relationship changes. Conversations become faster, audits become easier, and confidence in the data increases significantly.
The problem is not that teams are using Excel incorrectly- it’s that spreadsheets were never designed to handle regulated, traceable data handoffs between organizations. Until the structured data layer replaces the file as the unit of exchange, spreadsheet hell remains the default. Structural problem needs a structural solution.
It’s time to retire files like v2_FINAL_revised_financereview2.xlsx for good. Let Excel remain the analytical tool, and let systems become the actual medium of exchange.
Watch our on-demand webinar, Competing Smarter with the Right AI and Data Infrastructure for CDMOs, to explore the strategies and technologies shaping the next generation of CDMO operations.