Why AI Is Becoming Too Expensive to Build: Inside...

The New Reality: Compute Costs Dominate AI Budgets

Across industries, AI compute costs are starting to eclipse every other line item in production budgets. A striking example is “Hell Grind,” an AI-generated film premiering at Cannes. The project reportedly cost USD 500,000 (approx. RM2,300,000) to make, with a staggering USD 400,000 (approx. RM1,840,000) spent purely on compute. In other words, about 80% of the total spend went to GPU infrastructure expenses rather than cast, crew, or sets. Every character, set piece, and explosion was generated by AI models that had to be prompted, re-run, and refined thousands of times. This flips the traditional assumption that AI will automatically make creative work cheaper. Instead, it highlights a new cost structure: the more complex and polished the AI output, the more GPU cycles—and cash—are burned, reshaping how studios think about their AI implementation budget.

Inside an AI Film: When GPUs Eat the Production

“Hell Grind” shows how operational AI costs can quietly take over a project. Higgsfield AI reportedly relied on multiple specialized cloud providers optimised for AI workloads, rather than conventional hosting. Each video clip needed prompts averaging 3,000 words, and the team generated more than 16,000 clips for just the opening sequence, discarding most of them. Every failed attempt still incurred GPU compute costs, driving the infrastructure bill to around four-fifths of the total production spend. Traditional mid-budget films with similar visual ambition might spend heavily on locations, stunts, and VFX teams; here, the main expense was simply renting enough compute to iterate until the AI output looked acceptable. This illustrates a core tension in AI economics: quality requires scale, and scale demands vast, continuous GPU infrastructure expenses that can rival or exceed entire conventional production departments.

When AI Tools Outspend Human Engineers

In software development, AI coding assistants were expected to lower costs. Instead, they are exposing how quickly an AI implementation budget can be blown. Microsoft recently cancelled most direct licenses for Anthropic’s Claude Code after the tool became “a little too popular” internally, rapidly exhausting its allocated token budget. Engineers were told to migrate to GitHub Copilot CLI instead, a move that coincides with the end of Microsoft’s fiscal year and reflects mounting concern over operational AI costs. Uber provides an even sharper warning: after giving about 5,000 engineers access to Claude Code and rival tool Cursor, the company burned through its entire 2026 AI coding-tool budget in just four months. Monthly per-engineer costs reportedly ranged from USD 500 (approx. RM2,300) to USD 2,000 (approx. RM9,200), forcing leadership to reconsider whether the GPU-backed tools justify reductions in engineering headcount.

Why AI Is Becoming Too Expensive to Build: Inside the Compute Cost Crisis

The Agentic AI Paradox: Falling Prices, Rising Bills

Part of the crisis comes from how modern AI systems work. Companies pay for usage in tokens—the units of text models read and generate—while infrastructure providers recover massive GPU compute costs behind the scenes. Token prices have been falling, but total invoices are rising because companies are shifting toward agentic AI systems that autonomously plan, call tools, and iterate. These systems consume far more tokens per task than simple prompt–response chatbots. As a result, total token usage outpaces unit price declines, sending operational AI costs higher even as individual calls get cheaper on paper. Nvidia executive Bryan Catanzaro has warned that AI compute costs now significantly exceed employee payroll in many scenarios, undermining the idea that AI is a straightforward replacement for human labor. The paradox: smarter, more autonomous AI often costs more to run, not less.

From Hype to Discipline: AI ROI Faces a Reckoning

The emerging pattern is clear: without strict controls, AI compute costs can outstrip the value delivered. Microsoft’s move to standardize on its own tooling, Uber’s scramble after burning its annual budget in months, and internal cultures of “tokenmaxxing” show how ungoverned enthusiasm leads to runaway spend. Some firms are even finding that AI tools are more expensive than hiring additional employees for equivalent work, challenging the narrative that AI automatically improves margins. For enterprises, the next phase of AI adoption will hinge on treating GPU infrastructure expenses like any other scarce resource: metered, budgeted, and tied to measurable outcomes. That means monitoring usage, capping access, and demanding clear productivity or product wins. Until AI economics align with sustainable ROI, the biggest barrier to implementation will not be model quality—it will be the compute cost crisis.