Energy Efficiency Boost in the AI-Infused POWER10 Processor
Brian Thompto, Dq Nguyen, et al.
ISCA 2021
Computer systems strive for higher performance, improved energy efficiency, reliability, fault tolerance, and sustainability. Dynamically optimizing guardbands can help achieve all of these goals with minimal design and chip area costs, leveraging on-chip sensors and targeted investments in test and firmware. Many chips use fixed voltage guardbands at each supported frequency to safeguard correct operation in the field from all threatening sources of variation, including VDD power-supply droops from sudden workload changes, temperature excursions, and device aging. Previously, advances in robust error recovery and power-supply droop mitigation techniques have been used independently to reduce required guardbands and save power. In this work, we describe an IBM z17 system that dynamically optimizes guardbands by synchronously leveraging robust droop mitigation and robust error recovery in tandem to deliver significant system power savings.
Brian Thompto, Dq Nguyen, et al.
ISCA 2021
Pong-Fei Lu, Keith A. Jenkins, et al.
Microelectronics Reliability
Brian Vanderpool, Phillip J. Restle, et al.
IEEE JSSC
Dieter F. Wendel, Ron Kalla, et al.
IEEE Journal of Solid-State Circuits