October 05, 2025 · Nicholas Arbuckle

Are File Formats Holding AI Back?

And how LLMs are quietly redefining the future of “formats” themselves.

In the early 2000s, formats started to emerge as a a digital standard of sorts for how work was to be executed in a digital era. Formats such as .docx, .pptx, .xlsx e.t.c became familiar across every organization built processes.

But as Large Language Models (LLMs) evolve, something interesting is happening: the format itself is becoming fluid.

Take Anthropic’s recent update, where Claude can create, edit, and export PowerPoint decks, Word documents, and PDFs through natural language. Whats interesting however, is what happens under the hood, where these systems don’t “type” in PowerPoint; they are actually using HTML, CSS, and rendering layers like Playwright and PptxGenJS to simulate PowerPoint and then output .pptx as a final step. In short, the “format” is now an output layer, not a working constraint.

This means each slide is actually a rendered image embedded in a .pptx container, not a fully editable PowerPoint file. You can open it in PowerPoint, but you can’t easily change the text or move elements, because they’re flattened into pixels.

It’s a brilliant shortcut, but also a subtle limitation.

AI is trading interactivity for fidelity.

As one Reddit user put it:

“Going forward, what’s even the point of formats like PPT or DOCX? Might as well generate slides with HTML, CSS & JS — formats LLMs can manipulate directly for the desired outcome.”

And thats what inspired this article today. They’re right that the implications go far beyond just convenience. The implication is that human driven formats as we know them today, aren’t interoperable with LLM’s, which begs the question, are computer systems as we know them today ready for, and interoperable with, the LLM’s of tomorrow & how will this transformation look?

The Format Collapse: From Static Files to Living Contexts

Traditional formats like .pptx or .docx are rigid. They define structure before intent, a slide deck expects a hierarchy, a Word doc expects linear prose, a spreadsheet expects tables.

LLMs invert that completely. They start with intent, then generate structure dynamically.

In this new world:

“Documents” become context streams, regenerated on demand, not saved once and forgotten.
“Presentations” become interactive canvases, rendered differently each time they’re invoked.
“Reports” become living prompts, pulling real-time data instead of static summaries.

The HTML-based workflow Anthropic uses is an early sign of this potential shift: it’s easier for an AI to reason in web-native, semantic structures than to wrestle with brittle proprietary formats.

In other words, the “native language” of AI is the web open, composable, and machine-readable, not the legacy XML of office software.

The Trust Layer: Privacy Meets Portability

This is where the actual challenge for format iteration becomes clear: as LLMs begin to touch, transform, and reformat every layer of digital content, from emails to spreadsheets to medical reports — who controls the data flow?

Big tech’s “data Stockholm syndrome” business model has always been collect first, secure later. But the world is waking up.

A 2025 Pew Research survey found that 61% of Americans want more control over how AI uses their personal data. And the Edelman Trust Barometer (2025) showed that trust in tech has fallen to its lowest point in over a decade.

We’ve spent decades stuck in what I call “data Stockholm syndrome”, where users are over-trusting platforms that trade free tools for unrestricted access to our data. Now, as AI shows how powerful that data really is, people are realizing just how much they’ve been giving away.

BlueNexus: Code-Level Privacy for a Post-Format World

At BlueNexus, we believe the next generation of AI infrastructure won’t just decide how data is formatted, but how it’s governed.

Our architecture enforces privacy, sovereignty, and compliance through code, not policy. That means:

Data is encrypted and processed only inside Trusted Execution Environments (TEEs), no plaintext exposure, ever.
Users control access cryptographically, not contractually — privacy by default, not by design document.
Developers can connect apps via a single Model Context Protocol (MCP) layer that handles memory, triggers, and file conversion securely across any format, be that .docx, .pdf, .json, or what comes next.

When privacy is built into the substrate, formats become composable, interoperable, and reversible, the way they should have always been.

The Future: From Documents to “Dynamic Knowledge”

Imagine a future where:

You never “open” a Word doc; your AI agent just retrieves the relevant section from your contextual memory.
You never “export” a slide deck; your presentation auto-renders for each audience from a live knowledge graph.
You never “attach” a file; your data sovereignty layer shares only what’s necessary, verifiably, and with your consent.

This is the world we’re moving toward, one where formats become functions, and privacy becomes programmable.

At BlueNexus, we’re building the infrastructure to power that shift, where AI isn’t just aware of your data, but where your data finally belongs to you.

Formats may fade, but trust doesn’t. The future of AI isn’t about file types, it’s about data sovereignty.

As AI blurs the line between documents, data, and dialogue — do you think file formats will survive the next decade, or will we move toward a world of living, fluid knowledge? Share your thoughts below.