Key Takeaways
- OpenAI and Dell announced a May 18 deal to bring Codex closer to company systems.
- Codex is an AI agent that can help with software work, code and internal documents.
- For CFOs, the risk is simple: who can the agent reach, what can it change and what does it cost?
- The first finance question is not "does it work?" It is "can we control it?"
What did OpenAI and Dell actually announce?
OpenAI and Dell are moving Codex closer to company systems. That makes the AI agent part of the finance control conversation.
OpenAI announced on May 18 that it is working with Dell Technologies to bring Codex into hybrid and on-premises setups. "On-premises" means the tool can run closer to a company's own servers and private systems instead of only in a public cloud.
OpenAI said the goal is to put Codex closer to the company context it needs. That includes code, documents, business systems, work notes and team workflows.
That matters because Codex is not just a chatbot. It is an AI agent that can help with software work. If it is close to company systems, it may also be close to private data, internal tools and the work that keeps the business running.
Dell framed the news as part of its AI Factory push. In plain English: Dell wants to help companies run AI tools on their own technology stack. That may help with control. It also raises new questions about cost, access and review.
Why does on-prem Codex matter to CFOs?
On-prem Codex matters because an AI agent near company systems can affect cost, security and who controls the work.
A CFO does not need to manage every coding tool. But finance should care when an AI agent can touch company documents, business systems or private work notes. At that point, the spend is not just a software subscription. It is part of the company's operating system.
This is different from buying a small SaaS tool. An on-prem setup can include hardware, support, usage tracking, security review and staff time. Those costs do not always show up in the first sales pitch.
The point is not to slow down AI. The point is to count the whole cost. Codex may help teams move faster. But if another team has to review every output, fix mistakes and watch the logs, the savings are smaller than they look.
Here is the simple version. A $250,000 pilot for one tech team is one decision. A $1.2 million rollout that touches product code, customer files and internal systems is a company decision. The CFO should be in that room.
| Decision Area | Regular Cloud AI Tool | Codex Near Company Systems | CFO Question |
|---|---|---|---|
| Cost | Seat or usage fee | Hardware, support and usage costs | What is the cost per accepted output? |
| Data access | Limited vendor workspace | Internal systems and documents | Who approves each system connection? |
| Evidence | Prompt logs if enabled | Internal audit trail required | Can we reconstruct what the agent changed? |
| Ownership | Department buyer | IT, security, finance and engineering teams | Who signs off before expansion? |
How is this different from buying another SaaS AI tool?
A cloud AI tool is usually a vendor choice. An AI agent inside company systems is a work process choice.
That difference matters. If a team buys an AI coding tool in the browser, finance can review the price, contract and basic security terms. If Codex runs close to internal systems, the review needs more detail.
Start with basic questions. Which code files can Codex see? Which ticket systems can it open? Can it reach finance systems? Can it write changes or only suggest them? Who checks the work before it goes live? How long do logs stay saved?
Those may sound like IT questions. They are also finance questions because they affect cost, risk and proof. A poorly controlled internal agent can touch the wrong file, create bad work or add review time that wipes out the savings.
The finance risk is quiet
The first failure may not look like a security breach. It may look like one team claiming AI savings while another team does all the checking. CFOs should count the full workflow cost, not just the time one team says it saved.
What controls should finance require before Codex touches internal systems?
Finance should require access limits, review steps, usage logs and one clear owner before Codex connects to company systems.
Start with a list of systems. Codex should not connect to everything just because it can. Each connection needs a reason and an owner.
Code files are one type of access. Internal documents are another. Systems with customer, finance or employee data need a higher bar.
Then define what Codex can do. Reading a file is different from changing a file. Drafting a code change is different from sending that change live. The review rule should match the risk.
Finance should also ask for a simple report before expansion. How many tasks did Codex start? How many finished? How many were accepted on first review? How many were sent back? That is the same lesson from Nexairi's recent AI agent benchmark coverage: a good demo does not prove real work got done.
A first report does not need to be fancy. It can be a weekly table from the engineering lead. Show the task name, the system Codex touched, the reviewer, the result and the rework time. If that table is hard to fill out, the process is not ready to scale.
What should CFOs measure before expanding enterprise AI agents?
CFOs should measure accepted work, review time, error rate, system cost and control failures before scaling Codex.
The cleanest pilot is one narrow workflow. Pick one code cleanup task, one reporting task or one internal document cleanup task. Run Codex on that known backlog. Track how many outputs pass human review without rework.
Do not measure activity. Measure accepted work. An agent that opens 50 code changes and creates 40 review problems is not productive. An agent that completes 12 routine fixes with clean logs and low review time may be worth expanding.
The CFO also needs a stop rule. If errors stay high after the pilot, fix the workflow before adding more systems. The fastest way to waste AI budget is to connect agents to messy processes and then blame the model for reflecting the mess.
The stop rule should be written before launch. For example: if more than 20% of Codex outputs need major rework after 30 days, the pilot pauses. That gives teams room to learn without letting a weak process spread.
OpenAI and Dell are making it easier to put AI agents near sensitive systems. The winners will treat that like infrastructure: budget it, measure it and control it before it becomes normal.
Sources
Related Articles on Nexairi
Free Assessment
Is your firm ready for AI?
A 5-minute governance check for CPA firms using ChatGPT, Copilot or AI accounting software. Get your score and your top gaps — free.
Staff Writer
Curated insights from the NEXAIRI editorial desk, tracking the shifts shaping how we live and work.


