OpenAI and Broadcom Unveil Jalapeño, a Custom AI Chip 50% Cheaper Than GPUs

OpenAI and Broadcom unveiled Jalapeño, an inference accelerator costing half as much to run as typical AI GPUs, developed in a record nine months using OpenAI's own models.

For years, every ChatGPT response came at a steep hardware cost — OpenAI's bills to Nvidia for server time ran into the billions. Today the company moved to change that equation. Together with Broadcom, it unveiled Jalapeño: its first purpose-built AI chip.

Jalapeño is an inference accelerator, meaning it handles serving responses to users rather than training models from scratch. That distinction matters for the bottom line. Early testing puts its operating cost at roughly half that of typical AI GPUs, with performance-per-watt metrics that outpace current state-of-the-art alternatives.

The development timeline stands out. From initial design to manufacturing tape-out took nine months — reportedly the fastest advanced ASIC development cycle ever recorded in high-performance semiconductors. OpenAI accelerated the work by using its own models to handle parts of the engineering process, a genuinely recursive move that compressed what traditionally takes years into under a year.

First silicon samples are now in OpenAI's hands for testing, with customer-serving deployment targeted for late 2026 and full-scale rollout in the first half of 2028.

For Nvidia, this is a clear warning sign. OpenAI was among its biggest Blackwell customers. Microsoft, Google, and Amazon built their own chips years ago, and now the company behind the world's most-used AI assistant is doing the same. Jalapeño probably won't replace Nvidia entirely in OpenAI's data centers — but it gives the company real leverage at the negotiating table. That's the kind of leverage it has never had before.

Questions and answers

Frequently asked questions about this article

What is the Jalapeño chip and who developed it?

Jalapeño is OpenAI's first custom AI chip, co-developed with Broadcom. It's an inference accelerator designed to serve user queries faster and more cheaply, rather than to train language models.

How much cheaper is Jalapeño compared to regular GPUs?

Early testing shows Jalapeño costs roughly 50% less to operate than typical AI GPUs. The chip also delivers better performance per watt than current leading alternatives.

When will OpenAI start using Jalapeño in production?

First chip samples are currently being tested. OpenAI plans to begin serving real user queries on Jalapeño by late 2026, with full-scale deployment in the first half of 2028.

What does this mean for Nvidia?

OpenAI was among Nvidia's biggest Blackwell customers. A proprietary chip shifts OpenAI's negotiating leverage at procurement time and reduces its dependence on a single compute supplier.

Read also

News

Polymarket Staged $1.9M in Fake Bets. Now Senators Want a CFTC Probe

A WSJ investigation found Polymarket paid creators to stage fake bets on replica sites. Bipartisan senators Curtis and Schiff are now demanding a CFTC investigation.

8 hours ago

3 min

News

Alibaba stole 28.8M Claude responses to train Qwen, Anthropic tells Senate

From April to June 2026, Alibaba used 25,000 fake accounts to run 28.8 million exchanges against Claude — harvesting its reasoning to train Qwen. Anthropic calls it the largest distillation attack ever filed.

11 hours ago

5 min

News

Polymarket Lost $3M After Third-Party Vendor Was Hacked

Hackers compromised a third-party vendor to inject malicious code into Polymarket's frontend, draining roughly $3M in PUSD from user wallets. The platform's second security breach in a month.

14 hours ago

3 min