ARMONK, N.Y. — October 7, 2025 — Leads & Copy — IBM (NYSE: IBM) has announced the upcoming general availability of the IBM Spyre Accelerator, an AI accelerator designed for low-latency inferencing to support generative and agentic AI use cases, while also ensuring the security and resilience of core workloads. The Spyre will be generally available on October 28 for IBM z17 and LinuxONE 5 systems, and in early December for Power11 servers.
IBM recognized the need for mainframes and servers to run AI models without compromising on throughput. The accelerator is designed to maintain the security and resilience of core data, transactions, and applications, while supporting generative and agentic AI. It allows clients to keep mission-critical data on-prem to mitigate risk and address operational and energy efficiency.
The IBM Spyre Accelerator, a commercial system-on-a-chip with 32 individual accelerator cores and 25.6 billion transistors produced using 5nm node technology, reflects the strength of IBM’s research-to-product pipeline, combining breakthrough innovation from the IBM Research AI Hardware Center with enterprise-grade development from IBM Infrastructure. Each Spyre is mounted on a 75-watt PCIe card, allowing for clustering up to 48 cards in an IBM Z or LinuxONE system, or 16 cards in an IBM Power system, to scale AI capabilities.
According to Barry Baker, COO, IBM Infrastructure & GM, IBM Systems, the Spyre Accelerator extends the capabilities of IBM’s systems to support multi-model AI, including generative and agentic AI, enabling clients to scale their AI-enabled mission-critical workloads with security, resilience, and efficiency, while unlocking the value of their enterprise data.
Mukesh Khare, GM of IBM Semiconductors and VP of Hybrid Cloud, IBM, said that the first chip from the IBM Research AI Hardware Center has entered commercialization, designed to deliver improved performance and productivity to IBM’s mainframe and server clients.
The Spyre Accelerators offer fast, secured processing with on-prem AI acceleration, allowing businesses to leverage AI at scale while keeping data on IBM Z, LinuxONE and Power systems. Coupled with the Telum II processor for IBM Z and LinuxONE, Spyre offers enhanced security, low latency, and high transaction rate processing power. On IBM Power-based servers, Spyre customers can leverage a catalog of AI services, enabling end-to-end AI for enterprise workflows, combined with an on-chip accelerator (MMA), also accelerates data conversion for generative AI to deliver high throughput for deep process integrations.
Willa Hahn, willa.hahn@ibm.com
Chase Skinner, Chase.Skinner@ibm.com
Source: IBM