Technology

AI Infrastructure: Reframing the GPU vs. ASIC Debate

By: Joshua Buchalter, Sean O'Loughlin, Lannie Trieu, Krish Sankar, John Blackledge, Derrick Wood

May 30, 2025 - 5 minutes

Lab technicians in cleanroom suits inspecting a silicon transistor wafer.

Overview:

Profitable generative artificial intelligence (GenAI) applications are demanding huge investment in artificial intelligence (AI) processing, making it the most attractive growth vertical in the semiconductor ecosystem.
The poster child for hardware-based AI acceleration has been graphics processing units (GPU). However, hyperscalers have developed and launched application-specific integrated circuits (ASIC) tailor-made for AI.
We reframe the technical and economic decision for hyperscalers' compute deployments to address AI at scale from "GPU vs. ASIC" to a "buy vs. build" analysis with conditions of risk.
Our framework shows that whether custom or merchant, for AI accelerators, performance is king.

The TD Cowen Insight

The modern datacenter GPU is an application-specific chip designed for AI processing. Ramping investment in compute should be framed as a 'build vs. buy' decision under conditions of risk. In this report we describe our custom silicon cost build-up and return on investment (ROI) framework. We show that relative performance has a direct impact on the internal rate of return (IRR) of a custom project and whether it will scale to production volumes.

Generative AI Workloads Driving the Need for Hardware-Based Accelerators

The datacenter is undergoing a fundamental paradigm shift toward accelerated computing in support of Generative AI workloads. Rather than the software-based, central processing-heavy infrastructure that defined the first decade of cloud, the next datacenter decade (and beyond) will feature hardware-based accelerators for networking, storage, and most importantly, AI.

Historically, semis investors have been conditioned to understand that the push-pull between general-purpose and application-specific is a trade-off between flexibility and efficiency (evaluated by either performance or performance-per-watt). General purpose processors (the GPUs) supposedly suffer from a flexibility penalty, while workload-focused ASICs are "better" because they've been designed for one specific workload: GenAI.

We argue that the "GPU vs. ASIC" debate is better framed as "merchant vs. custom," as modern datacenter GPUs are ASICs. At a silicon level, a datacenter GPU has very little "graphics processing". It is a processor designed to accelerate AI workloads. The "general purpose vs. application-specific" framework is unhelpful (and even counterproductive), as "general-purpose" GPUs continue to outperform application-specific competition. In fact, both are ASICs, and the playing field is level. If we instead apply a "merchant vs. custom" framework, GenAI hardware (specifically for large language model (LLM) inference) can be analyzed with a "build vs. buy" framework, comparing relative rates of return across the two options.

Four Key Takeaways From Our Work:

Merchant and custom accelerators will coexist, even within the same hyperscaler's footprint—the risks are too significant to ignore should a custom project fail, and the rewards are too rich to pass up should a custom project succeed.
Competition as the "#2" in merchant is harsh, as you not only must compete for capital expenditure (CapEx) dollars and information technology (IT) mindshare with the leading merchant player, but also with custom on applicable workloads…and those custom parts can be materially less performant.
Similarly, custom silicon is becoming more competitive as each accelerator generation must be evaluated independently for its ability to generate revenue and the "performance penalty" hurdle is easier to clear the lower the ASIC vendor's margins.
Perhaps unsurprisingly, for GenAI acceleration within and across merchant and custom compute, performance is king.

This leads us to recognize that there is likely no stable equilibrium share of custom silicon in the broader accelerator market. There will be an upper limit to custom, as few have the scale required to compete and the programmability of merchant platforms will inherently make their scope wider. But below that upper limit, we should expect volatile share from generation to generation as performance will ultimately determine which custom projects ever see the light of day…or rather, ever see the cold, fluorescent inside of a datacenter.

What Is Proprietary?

We develop (and make available to clients) a bottom-up GenAI unit economics model and ROI framework incorporating a hardware bills-of-material and inference revenue generation. We incorporate these unit economics into a project-level ROI analysis and compare relative IRRs for "build" (custom) vs. "buy" (merchant) strategies for GenAI infrastructure.

Financial and Industry Model Implications

We characterize each successive generation of GenAI custom accelerators as binary outcomes rather than continuous growth functions — each generation will be evaluated fresh against the revenue generation potential of the alternative (merchant) platform and the associated IRR. Sunk costs will dictate that a non-performant custom chip should be abandoned rather than ramped with significant financial implications for the ASIC vendor in lost revenue.

We also expand our Datacenter Silicon model to explicitly include the two additional non-tensor processing units (TPU): LLM-focused custom accelerator projects we know about and the non-LLM focused accelerators. While we acknowledge our own low levels of visibility, we arrive at an approximate US$334 billion accelerator market by 2030 estimated (2030E). This would comprise of approximately 15% custom silicon (from approximately 10% today) or growing at a 30% compound annual growth rate (CAGR) to approximately US$50 billion in 2030E from approximately US$11 billion today. We model the merchant market as likely to grow at a more modest approximate 18% CAGR off a much higher base (still almost doubling to nearly US$300 billion over that time).

What to Watch:

Incremental partnership announcements in both merchant and custom have the potential to quickly change sentiment;
Company-specific events;
The trajectory of pricing in GenAI inference (through the lens of revenue per second) will be key to monitor as inputs to the ROI framework, with significant implications for both merchant and custom (as well as overall AI infrastructure spending). Contrary to common sentiment, we believe inference pricing is likely to be more durable as model capabilities improve and hyperscalers can better monetize these capabilities (even if the dollar per token declines).

Subscribing clients can read the full report, A Rose By Any Other Name: Reframing GPU vs. ASIC - Ahead of the Curve, on the TD One Portal

Disclaimer

The views or opinions expressed herein represent the personal views of the writer and do not necessarily reflect the views of TD Securities or its affiliates.

This material is intended to provide commentary on the market for commodities discussed herein.

Not Advice: The information contained in this material is for informational purposes only and is not intended to provide professional, investment or any other type of advice or recommendation, or to create a fiduciary relationship. Neither TD Securities (USA) LLC (“TD Securities USA”) nor any of its affiliates (collectively, “TD”) makes any representation or warranty, express or implied, regarding the accuracy, reliability, completeness, appropriateness or sufficiency for any purpose of any information included in this material. Certain information may have been provided by third-party sources and, while believed to be reliable, has not been independently verified by TD, and its accuracy or completeness acannot be guaranteed. You should not make an investment decision in reliance on this material, which is intended to provide only brief comments on the topics addressed, and is based on information that is likely to change without notice.

Not Securities or Derivatives Research: This material has not been produced, reviewed or approved by TD’s securities or derivatives research departments. The views of the author may differ from others at TD, including TD securities or derivatives research analysts. Not Independent: The views expressed in this material may not be independent of the interests of TD. TD may engage in conflicting activities, including principal trading before or after posting this material, or other services involving commodities discussed in this material, or related financial products. TD may have a financial interest in the commodities discussed in this material, including, without limitation, a financial product that references such commodities.

Not An Offer or Solicitation: Nothing contained in this material is, or should be construed as, an offer, a solicitation of an offer or an invitation to buy or sell any commodity, or any financial product that references such commodity, and it is not intended for distribution in any jurisdiction where such distribution would be contrary to law.

Risk of Loss. Transactions in commodities, and financial instruments that reference commodities, involve risk of loss, and are subject to the risks of fluctuating prices. You should weigh potential benefits against the risks. Past performance is no indicator of future performance and the Materials are not intended to forecast or predict future events.

Joshua Buchalter, CFA

Director, TMT – Semiconductors Research Analyst, TD Cowen

Joshua Buchalter, CFA
Director, TMT – Semiconductors Research Analyst, TD Cowen

Biography

Joshua Buchalter is a director covering the semiconductor and mobility technology sectors. He has been working on the semiconductors team at TD Cowen since 2018.

Prior to joining TD Cowen, Mr. Buchalter covered the semiconductor sector as an equity research associate at Oppenheimer & Co and Needham & Co.

Mr. Buchalter received a BSc in economics from Tufts University, an MBA in finance and accounting from Boston College, and is a CFA® charterholder.

Krish Sankar

Managing Director, TMT - Semiconductor Capital Equipment, IT Hardware Research Analyst, TD Cowen

Krish Sankar
Managing Director, TMT - Semiconductor Capital Equipment, IT Hardware Research Analyst, TD Cowen

Biography

Krish Sankar is a Managing Director and Senior Research Analyst covering the semiconductor capital equipment and IT hardware sectors. He joined TD Cowen’s Technology, Media & Telecom team in February 2018. Prior to joining TD Cowen, Mr. Sankar worked as a top-ranked analyst at Bank of America Merrill Lynch for almost 14 years, where he covered the semiconductor capital equipment, EDA software, and alternative energy sectors. Before joining Bank of America-Merrill Lynch, Mr. Sankar was a product manager at Applied Materials. Mr. Sankar is ranked among the top semiconductor capital equipment analysts in Institutional Investor polls.

Mr. Sankar holds a B.Tech (with Honors) in ceramic engineering from IIT (BHU), Varanasi, and an MS in materials science and engineering from the University of Minnesota.

John Blackledge

Managing Director, TMT – Internet Research Analyst, TD Cowen

John Blackledge
Managing Director, TMT – Internet Research Analyst, TD Cowen

Biography

John Blackledge is a managing director and senior research analyst who covers the Internet and new media sector. Prior to joining TD Cowen in August 2012, he was a vice president covering Internet stocks at Credit Suisse for more than three years. Before joining Credit Suisse, Mr. Blackledge spent six years at JPMorgan, where he was a vice president covering the entertainment, broadcasting, and cable/DBS sectors before assuming research coverage of radio, TV, and outdoor stocks. Earlier in his career, he held positions at ABN AMRO, CIBC, and Arthur Andersen. Mr. Blackledge has a BS degree with a major in accounting from Georgetown University.

Derrick Wood

Managing Director, TMT – Software Research Analyst, TD Cowen

Derrick Wood
Managing Director, TMT – Software Research Analyst, TD Cowen

Biography

Derrick Wood joined TD Cowen in April 2016 and covers the enterprise software sector, including SaaS and big data/analytics. Before joining TD Cowen, Mr. Wood was an analyst with Susquehanna International Group, Wedbush Securities, and Pacific Growth Equities. He started covering enterprise software in 2000.

Mr. Wood received a bachelor’s degree in economics from UC Davis. He is a CFA charterholder.