GitHub Copilot pricing and the new AI Credits model

From Premium Requests to Token-Based Billing: What Changed

GitHub Copilot’s new token-based billing is a pricing model where users receive monthly AI Credits that are consumed per token of AI usage, tying costs directly to the size of prompts, responses, and chosen models instead of fixed request counts. Previously, Copilot combined a flat subscription with “premium requests” that gave access to heavier compute tasks without exposing their real cost. GitHub absorbed much of these escalating inference costs as developers shifted from quick inline suggestions to long-running, agentic sessions across entire repositories. Now each plan includes a bundle of GitHub AI Credits, and every token of input, output, or cached data draws from that pool at published model rates. Code completions and next-edit suggestions remain unmetered for paid plans, but advanced chat and agent-style workflows are now firmly on the meter, making GitHub Copilot pricing more tightly linked to actual usage.

GitHub Copilot’s New Token-Based Billing: What It Means for Your Costs

How the New AI Credits System and Flex Allotments Work

Under the new AI Credits system, each paid Copilot plan includes a monthly credit bundle, with one credit equal to one cent of metered AI usage. The Pro plan at USD 10 (approx. RM46) per month includes 1,500 credits, worth USD 15 (approx. RM69) in AI usage, while Pro+ at USD 39 (approx. RM179) includes 7,000 credits. A new Copilot Max tier costs USD 100 (approx. RM460) per month and comes with 20,000 credits, equivalent to USD 200 (approx. RM920) in metered usage. According to Microsoft VP Joe Binder, the base credits are permanent, while an added “flex allotment” is designed to change over time as model pricing and AI infrastructure costs evolve. Business and enterprise seats keep per-user pricing at USD 19 (approx. RM87) and USD 39 (approx. RM179), with credits matched to those prices but no flex allotment.

Metered Usage: Why Credits Can Disappear in Days

The move to usage-based billing exposes how quickly different models and habits consume tokens. Under the old premium request system, a short question and a multi-hour refactor felt identical from a cost perspective. Now every token counts, and the gap between models is clear. One million output tokens from a lightweight model such as GPT-5.4 nano costs about USD 1.25 (approx. RM6) in credits, while the same output from a frontier-class GPT-5.5 costs about USD 30 (approx. RM138). A “build a Minesweeper game” prompt via Claude Haiku 4.5 has been shown to burn around 94 credits on its own. For developers who leave defaults on, a few exploratory sessions with large context windows can consume what once looked like months of headroom. Many are discovering that spontaneous experimentation now has a visible price tag under the AI Credits system.

Enterprise Budgets, Shared Pools, and Copilot Cost Management

For organizations, GitHub has tied Copilot cost management to shared credit pools and new budget controls. Business and enterprise plans retain per-seat pricing, but credits are now aggregated at the organization level instead of locked to individuals. Power users can draw more from the pool when they run sustained agentic work, while lighter users consume far less, smoothing out team-wide usage. GitHub has introduced layered budget tools: universal user-level budgets, per-user overrides, cost center budgets, and an enterprise-wide cap. Together, these settings define how quickly the shared pool can be spent and what happens when it runs out. Through August, GitHub is offering promotional credits of USD 30 (approx. RM138) per Business user and USD 70 (approx. RM322) per Enterprise user to ease the transition. Code review now consumes both AI Credits and GitHub Actions minutes, adding another dimension for finance and engineering leaders to track.

Practical Strategies for Living with Usage-Based Billing

The shift to token-based billing nudges developers to rethink daily Copilot use. Since costs vary by model and context size, a straightforward approach is to reserve expensive frontier models and long-running agent sessions for clearly defined tasks, while keeping routine queries on lighter models. Watching the credit meter during the first month can reveal which workflows are unexpectedly costly and where small habit changes, like trimming prompt size, make a difference. For heavy individual users who keep hitting limits, the Copilot Max plan with USD 200 (approx. RM920) in monthly credits may be easier than constantly throttling usage. In teams, admins can tune default budgets and cost center caps so high-value projects get enough headroom without risking runaway spend. Usage-based billing is meant to align GitHub Copilot pricing with real consumption; now the challenge is aligning coding patterns with those same signals.