Enterprise AI is shifting from "buying more GPUs" to "optimal configurations to reduce inference costs"... AMD and Red Hat's solutions are attracting significant attention.

This article is machine translated

Show original

Enterprises are crossing a new watershed in their adoption of artificial intelligence. Today, the market's focus is no longer solely on whether to invest in AI, but rather on how to deploy the right semiconductors and infrastructure for different business operations to maximize cost-effectiveness. Especially with the rapid increase in "agent AI" tasks and rising inference costs, the core challenge for large enterprises is no longer blindly choosing the highest-performance equipment, but rather selecting appropriate computing resources based on their objectives—that is, making "choices."

Against this backdrop, the partnership between AMD and Red Hat has once again come under scrutiny. John Hampton, AMD's Vice President of Global Enterprise Technology Sales, pointed out at the Red Hat Summit 2026 in Boston that enterprises are looking for more flexible AI infrastructure within hybrid environments. He noted that many customers have recently hastily built large-scale GPU clusters to meet AI demands, but are facing far greater cost pressures than anticipated during actual operation.

AI inference costs are rising sharply... Enterprises are re-evaluating their single strategy of using large GPUs.

According to Hampton, many companies focused on purchasing large quantities of high-performance GPUs to avoid falling behind in the early stages of the AI race. The problem is that as services scaled up, the cost of each AI query accumulated, rapidly increasing budget pressure. This phenomenon is known in the industry as "token economics," meaning that as the use of generative AI increases, token processing costs also rise, directly impacting a company's profitability.

He stated, "Enterprises initially purchased large GPU clusters for AI, but now they are experiencing unbearable consequences. While AI applications are growing, the rapid increase in costs is causing significant concerns." This ultimately means that the core of enterprise AI strategies is shifting from "ensuring the highest performance equipment" to "optimizing deployments for specific tasks."

AMD and Red Hat: Providing "Full Spectrum" Solutions from CPUs to GPUs

To address this trend, AMD has launched a "full spectrum" product portfolio encompassing CPUs, cost-effective GPUs, and high-performance accelerators. Its strategy is to combine this hardware with Red Hat's open-source software-based stack, enabling enterprises to flexibly operate AI tasks in hybrid cloud environments without relying on specific vendors.

Taking the AMD Instinct MI350P as an example, it is described as a PCIe-based GPU that can be relatively easily integrated into existing servers. It features an air-cooled design for cost-effectiveness. Red Hat AI, as an enterprise-grade platform, supports the deployment and expansion of AI agents on this type of hardware. Furthermore, leveraging AMD EPYC CPUs and Red Hat virtualization tools enables server consolidation, thereby helping to reduce data center footprint and power consumption.

The core lies in "open architecture"... while simultaneously advancing AI budget control and infrastructure modernization.

The core message conveyed was "openness" and "selectivity." AMD, along with Red Hat, emphasized that compared to a closed ecosystem, enterprises should use open architectures to select the most suitable resources from CPUs, low-power GPUs, and high-performance accelerators for different AI workloads. Not all inference tasks need to be deployed on expensive equipment.

The benefits of this approach extend beyond cost reduction. For businesses, it allows them to fully leverage existing infrastructure without slowing down the adoption of AI, and enables them to reinvest the saved budget and power resources into new AI projects. This is of great practical significance, allowing for the simultaneous modernization of AI infrastructure and budget control.

Hampton predicts that the future evaluation standard for the AI market will likely no longer be "what was bought," but rather "how it was deployed." As the competition among companies to develop AI officially enters the operational phase, some analysts believe that the key to future success will not lie in performance demonstrations, but in the ability to skillfully balance total cost of ownership and actual results.

TP AI Notes: This article is a summary based on TokenPost.ai's language model. Key information in the main text may be omitted or may differ from the facts.

Source

Disclaimer: The content above is only the author's opinion which does not represent any position of Followin, and is not intended as, and shall not be understood or construed as, investment advice from Followin.

Add to Favorites

Comments

Relevant content