MilikMilik

How Open Standards Aim to Rein In Runaway AI Token Costs

How Open Standards Aim to Rein In Runaway AI Token Costs
Interest|High-Quality Software

Defining the AI Cost Crisis and the Token Problem

The AI cost crisis is a growing business problem where token‑based pricing, opaque billing, and soaring usage turn AI from an experimental perk into a major, hard‑to‑control infrastructure expense. As large enterprises embed models into coding, research, and support workflows, token consumption rises fast: each prompt, retry, and background task adds to AI token costs. Companies that once encouraged broad experimentation now face budgets that can double or triple as agent-heavy workflows spin up parallel subagents and multi-step reasoning in the background. One unnamed company reportedly spent USD 500 million (approx. RM2.3 billion) in a single month on AI tools after failing to cap licenses, turning premium access into a finance-controlled resource. This shift is forcing procurement teams to ration usage, prioritize cheaper defaults, and scrutinize whether premium models deliver measurable returns rather than mere activity.

Linux Foundation’s Tokenomics Foundation Steps In

Into this turmoil comes the Tokenomics Foundation, a new Linux Foundation initiative created to bring order to AI token economics. Backed by Google, Microsoft, IBM, JPMorgan Chase, Oracle, KPMG, and Salesforce, the group aims to write open standards for how tokens are measured, billed, and reported across the stack. A token is the basic unit of text an AI model processes, and it now sits at the center of infrastructure cost management: it is what the model consumes, what the data center meters, and what the enterprise pays for. Yet token behavior is far less predictable than previous technology expenses, including cloud. The foundation will publish benchmarks and best practices that make AI billing transparency easier for finance, procurement, and engineering teams who currently face fragmented metrics from each hyperscaler, model provider, and hardware vendor.

How Open Standards Aim to Rein In Runaway AI Token Costs

Why Enterprises Are Rationing Access and Seeking Cheaper Models

Rising AI token costs are already reshaping everyday decisions inside large organizations. Procurement and finance leaders now ask which tasks merit premium models, which teams must switch to cheaper tools, and which projects need new budget approval. Agentic workflows worsen the problem: multi-step reasoning, retrieval, code checks, and automated retries can multiply the hidden calls behind one visible request, so invoices spike even as per-call prices fall. According to data cited by The New Stack, average monthly token spend has increased 13-fold since January 2025 among tracked customers. Some firms now treat premium AI as a scarce resource, not a standard entitlement. Amazon’s removal of an internal AI usage leaderboard, after staff chased token counts instead of outcomes, shows how “token maxxing” is giving way to hard spending limits and more disciplined return-on-investment checks.

Open Standards AI and the Push for Billing Transparency

The Tokenomics Foundation’s core promise is open standards for AI token accounting that make costs comparable and explainable. Today, every provider exposes different usage data, mixes metrics, and packages token prices in ways that prevent apples-to-apples comparison. That blinds both buyers and sellers and makes infrastructure cost management reactive. By aligning with the FinOps Foundation, which already built shared practices for cloud bills, the new body wants AI billing transparency to reach the same level of maturity. Shared taxonomies, reference metrics, and standard reports would let finance teams see not only total spend but which applications, workflows, and teams generate it. With GitHub’s move from flat-rate Copilot subscriptions to token-based pricing and the backlash that followed, a neutral framework for explaining and forecasting token spend is turning from a nice-to-have into a business necessity.

From Experiment to Critical Infrastructure Cost Line

As token usage accelerates, AI is shifting from experimental software to a permanent infrastructure line item that demands serious governance. Finance teams now treat AI alongside cloud, SaaS, and data platforms, requiring proof that every additional burst of token activity improves speed, quality, or margin. Data from Goldman Sachs shows how urgent this is, projecting token usage to grow 24-fold between 2026 and 2030, reaching 120 quadrillion tokens per month. Without shared standards, each enterprise must build its own fragmented dashboards and policies to keep up. The Tokenomics Foundation offers a path toward consistent measurement, clearer budgeting, and less guesswork when comparing premium and cheaper AI options. If it succeeds, companies may no longer need blunt rationing alone; they will have a common framework to tune access, choose models, and manage risk with far more precision.

Milik earns a commission when you shop through our links, at no extra cost to you. Editorial content is independently selected by our team.

You May Also Like

Comments
Say something...
No comments yet. Be the first to share your thoughts!