NOW LET US – AI RAG SaaS Studio TP.HCM
NOW LET US
Digital Product Studio
Back to news
DEV-TOOLS...4 min read

Codex pricing to align with API token usage, instead of per-message

Share
NOW LET US Article – Codex pricing to align with API token usage, instead of per-message

OpenAI is transitioning Codex pricing from a per-message model to a token-based structure to align with API usage. This change affects ChatGPT Business and new Enterprise plans immediately, with others to follow.

Overview

This article outlines the current credit rates for Codex, under the flexible pricing structure for Plus, Pro, Business, and Enterprise/Edu plans.

Note: As of April 2, 2026, we’ve updated Codex pricing to align with API token usage, instead of per-message pricing. This change is applicable to new and existing ChatGPT Business and new ChatGPT Enterprise plans - please refer to the new rate card in the section below for details.

**Customers on existing Plus, Pro and Enterprise/Edu plans **should continue to use the legacy rate card. We’ll migrate you to the new rates in the upcoming weeks.

All customers should **carefully review both rate cards **to understand the pricing changes, and how they apply to your plan.

Codex rate card - token based pricing

This rate card applies to the following customer plans:

New and existing ChatGPT Business customers****New Enterprise customers

Codex usage is priced based on **API token usage, **calculated as credits per million input tokens, cached input tokens and output tokens. Learn more about tokens here.

This format replaces average per-message estimates with a direct mapping between token usage and credits. It is most useful when you want a clearer view of how input, cached input, and output affect credit consumption.

Under this model, actual credit usage depends on the mix of input, cached input, and output tokens in each task. The table below displays **credits per 1M tokens for each token type. **

Model | Input Tokens | Cached input tokens | Output Tokens | |---|---|---|---| | GPT-5.4 | 62.50 credits | 6.250 credits | 375 credits | | GPT-5.4-Mini | 18.75 credits | 1.875 credits | 113 credits | | GPT-5.3-Codex | 43.75 credits | 4.375 credits | 350 credits | | GPT-5.2-Codex | 43.75 credits | 4.375 credits | 350 credits | | GPT-5.2 | 43.75 credits | 4.375 credits | 350 credits | | GPT-5.1-Codex-Max | 31.25 credits | 3.125 credits | 250 credits | | GPT-5.1-Codex-mini | 6.25 credits | 0.625 credits | 50 credits |

Note:

Fast mode consumes 2x as many credits.

Code review uses GPT-5.3-Codex.

GPT-5.3-Codex-Spark may be available in Codex as a research preview - credit rates for this model are not final.

Read about

Codex usage rate limits.

On average, Codex costs ~$100-$200/developer per month, though there is a large variance depending on model used, number of instances users are running, automations and usage of fast mode. Read more about best practices in maximizing your rate limits and managing token consumption.

You can monitor your workspace's token usage in Codex settings > Usage panel.

Legacy Rate Card

This rate card applies to the following plans:

New and existing Plus/Pro customersExisting Enterprise/Edu customersNew & existing ChatGPT Edu, Teacher and Healthcare customers

*Customers on the plans referenced above should continue using the legacy rate card until we migrate you to the new token-based pricing for Codex. Also read about new Codex seats for ChatGPT Business and ChatGPT Enterprise. *

Existing Plus/Pro and Enterprise/Edu customers should **continue to use the legacy rate card **displayed below, until we migrate you to the new rates in the future.

Plus/Pro and Edu users should monitor this rate card and our release notes pages for information on when the new rates apply.

Specifics of the migration, including timelines, will be provided to Enterprise admins and owners by email - contact your OpenAI sales representative if you have questions about the migration.

The legacy rate card expresses Codex usage as approximate average credits per message or pull request. These averages are useful for rough planning, but actual credit usage can vary based on task size, model choice, and reasoning requirements.

Unit | GPT-5.4 | GPT-5.3-Codex | GPT-5.1-Codex-mini | | |---|---|---|---|---| | Local Tasks | 1 message | ~7 credits | ~5 credits | ~1 credit | | Cloud Tasks | 1 message | ~34 credits | ~25 credits | Not available | | Code Review | 1 pull request | ~34 credits | ~25 credits | Not available |

These averages also apply to legacy GPT-5.2, GPT-5.2-Codex, GPT-5.1, GPT-5.1-Codex-Max, GPT-5, GPT-5-Codex, and GPT-5-Codex-Mini.

FAQ

Why are there two Codex rate cards?

We’ve modified our pricing from credits per message, to credits per token type consumed. OpenAI supports both the legacy rate card and the updated token-based rate card. The applicable version depends on workspace migration status.

Which rate card should I use?

New and existing ChatGPT Business customers, and new ChatGPT Enterprise customers should use the token-based pricing rate card. Customers on all other plans should use the legacy rate card. We’ll continue to update this page over time as we migrate your plan to the new rates.

What changed in the updated token-based rate card?

The legacy rate card shows approximate average credits per message or pull request. The updated token-based rate card shows credits by token type and converts API-priced usage into credits.

Why is the rate card being changed?

Credits remain the core pricing unit that customers purchase and consume. The updated token-based format makes credit usage easier to map to actual model activity, aligns Codex pricing more closely with token-based metering, and gives clearer visibility into how input, cached input, and output contribute to total usage.

How does this affect my pricing?

The impact depends on your workload mix. Some users may see higher credit consumption, while others may see lower credit consumption, depending on how much input, cached input, and output their tasks use. Output-heavy tasks and fast mode generally consume more credits than lighter tasks.

© 2026 Now Let Us. All rights reserved.

Source: Hacker News

Advertisement
Ad slot ready: 5887729102

More in this category

NOW LET US Related – Leaving Mozilla

dev-tools

Leaving Mozilla

A poignant and candid reflection from a 15-year Mozilla veteran upon their departure. The author highlights the leadership's missteps in trying to emulate tech giants and urges Mozilla to return to its core values: community and uniqueness.

NOW LET US Related – Shepherd's Dog: A Game by the Most Dangerous AI Model

dev-tools

Shepherd's Dog: A Game by the Most Dangerous AI Model

A developer tested Anthropic's latest, supposedly 'too dangerous' AI model by asking it to build a long-held game idea in a single shot. The model succeeded, generating a complete 2,319-line game after a 45-minute reasoning session.

NOW LET US Related – Open source AI must win

dev-tools

Open source AI must win

If artificial intelligence becomes a utility rented only from a few closed institutions, humanity loses its operational freedom. Open-source AI is a vital infrastructure for the future of our digital society.

NOW LET US Related – Statement on US government directive to suspend access to Fable 5 and Mythos 5

dev-tools

Statement on US government directive to suspend access to Fable 5 and Mythos 5

The US government has issued an export control directive forcing Anthropic to suspend all access to its Fable 5 and Mythos 5 models due to national security concerns, a move the AI safety startup strongly disputes.

NOW LET US Related – Electric motors with no rare earths

dev-tools

Electric motors with no rare earths

Renault Group is pioneering the development of electrically excited synchronous motors (EESM) that eliminate the need for rare earth magnets, reducing dependency on global monopolies while driving efficiency and sustainability.

NOW LET US Related – Swift at Apple: Migrating the TrueType hinting interpreter

dev-tools

Swift at Apple: Migrating the TrueType hinting interpreter

Apple has rewritten its TrueType hinting interpreter from C to memory-safe Swift for its Fall 2025 OS releases, improving security and boosting performance by an average of 13%.

EXPLORE TOPICS

Discover All Categories

Deep dive into the specific technology sectors that matter most to you.