The Document Bottleneck Holding Up AI Transformation

Companies have thrown serious money at AI, but for finance and compliance teams, the most critical information is still locked away in documents that AI systems cannot read or make sense of. This perspective is attributed to the Adobe Team.

There's a massive investment in AI going on right now, and it's flowing into all sorts of organisations. Yet in boardrooms, finance departments, and compliance teams, a single issue is emerging time and time again. It's not a lack of data, a shortage of talent or a scarcity of budget. It's a document.

For finance teams, a not insignificant amount of working time is spent poring over dense, unformatted documents: contracts, board packs, ESG reports and all the rest. And when organisations ask whether AI can extract the important stuff from those documents with any reliability, the answer is usually, we're not really sure.

This is a bit of a chasm in enterprise AI adoption, and one that hasn't had the attention it deserves.

Research from IDC says that around 80% of enterprise data is unformatted, and an awful lot of that is stuck in documents. Meanwhile, McKinsey reckons that knowledge workers spend an average of 2.5 hours a day on their feet, trawling through information, that's a consistent finding that's turned up in study after study. And it's been over 30 years since Adobe invented the PDF format, which is still the standard for business documents the world over.

 

Where AI Has Served a Purpose; and Where It Hasn't

AI programmes have delivered some real gains in areas where the data is clearly labelled and structured: customer service automation, supply chain forecasting, financial modelling. These areas are where AI has genuinely made a difference.

But many business decisions need more than just structured data. They need documents: a risk committee working through a supplier contract, a board getting its quarterly report, a compliance team producing an ESG disclosure, a legal team sorting out liability across a portfolio of agreements.

These processes are document-heavy, time-consuming and mainly done by hand. AI systems that have done well in other parts of the business often hit a brick wall when documents come into the picture.

In lots of organisations, the important information isn't stored in a nice neat database; it's stuck in a document. And for the most part, AI hasn't been designed to deal with it.

Why the Problem of Documents Has Been So Tough to Fix

The technical challenge is real. Documents are a nightmare: free-form writing, tables, footnotes, formatting conventions that just don't translate well into machine-readable data. AI models that are built for structured data struggle with this kind of content.

And then there's the problem of scanned PDFs. While modern PDFs are generally readable by machines, a lot of older contracts, filings and board materials were digitised as image files rather than as text, so their content is invisible to most automated processing systems, including AI.

Regulatory developments are making things worse. The Corporate Sustainability Reporting Directive, ISSB standards and growing scrutiny of ESG disclosures mean organisations are having to produce, interrogate and stand up behind more and more critical documentation than ever before. Manual workflows that were good enough in a less demanding regulatory environment are starting to buckle under the strain.

For finance and compliance leaders, the implications are on the ground. A clause that was overlooked in a supplier contract, a mistake in an ESG report or a delay in getting board materials together can all have real-world consequences for an organisation, and as compliance requirements get tougher, the risk is going up all the time.

A Capable AI is Emerging

AI-powered document tools have been around for a while now, from early search and OCR applications through to today's more advanced natural language processing systems. What's changed in recent years is that these tools have got a lot more capable, and are now ready for enterprise use.

Adobe's Acrobat AI Assistant is one recent example of the more advanced AI tools that are now available. Built on the PDF format that Adobe introduced all the way back in 1993 (and which is still the standard the world over) it lets professionals query complex documents in plain English, comparing versions of a governance document, condensing a board pack into an executive summary or interrogating a regulatory filing all in a fraction of the time it used to take.

For teams that have big volumes of documentation to deal with, capabilities like automated summarisation and cross-language translation help to address the operational bottlenecks that have historically taken up so much time.

The Case for Acting Now

Where AI can help with document review and extraction, compliance teams can redirect capacity from processing to analysis and decision-making, and where ESG reports can be automatically checked for internal consistency, governance risk can be managed more proactively. And where board materials can be interrogated quickly, decision-making at the executive level gets a lot better.

Companies that have not yet applied AI to their document workflows are missing out on a productivity and risk-reduction opportunity that's real and growing all the time, in an environment where regulatory and competitive pressure to act is getting stronger by the day.

Getting AI transformation to the point where it can reach all the information in an organisation, not just the bits that are neatly structured and labelled, is a game-changer. And the organisations that will get the most out of AI over the coming years won't just be the ones with the most models deployed. They'll be the ones that've thought carefully about where their information actually lives, and made sure their AI programmes are up to the job. For a lot of businesses, a lot of that information is locked up in documents. The tools to get to it are out there. So the big question isn't whether to go after it now, it's how fast can you?

By Adam Luciano, Vice President of Product Management, MariaDB plc.
By Derek Slager, Co-founder and Co-CEO, Amperity.
By Ismail Amla, Senior Vice President Kyndryl Consult at Kyndryl.
By Ciaran Flanagan, Vice President & Global Head of Data Center Solutions & Services, Siemens.
By Will Holmes, Business Manager (Public Sector), Panasonic TOUGHBOOK
Written by Cyrille Badeau, Vice President, International Markets, Securonix
Casper Klynge, Vice President, Head of Government Affairs and Public Policy in EMEA at Zscaler
By Katie Barnett, Director of Cyber Security, Toro Solutions