Bridging the Gap: User-Centric Energy Monitoring for Policy-Driven Application Optimization in HPC Data Centers

Nov 1, 2025·
Woong Shin
Woong Shin
,
Karl W. Schulz
,
Arthur Francisco Lorenzon
,
Matthias Maiterth
,
Bruno Villasenor Alvarez
,
Jordà Polo
,
Aditya Kashi
,
Hao Lu
,
Nicholson Koukpaizan
,
Antigoni Georgiadou
,
Matthew Norman
,
Wael Elwasif
,
Michael Matheson
,
Feiyi Wang
,
Nicholas Frontiere
,
Sarp Oral
,
Thomas Beck
,
Bronson Messer
· 0 min read
Abstract
Application energy optimization in HPC data centers face two critical gaps. Systematic methodologies that connect data center policies to application decisions and accessible monitoring tools that enable data-driven optimization. We address both gaps through two complementary pillars. First, we present a methodology based on extended weighted Energy Delay Product (EDP) to translate data center operational priorities and integrate energy considerations into the energy optimization workflow which starts from continuous monitoring through targeted optimization. Second, we present a user-space monitoring tool, Omnistat, that enables this methodology by providing developers with direct access to actionable energy telemetry. Through deployment on the Frontier supercomputer and case studies exploring performance-energy trade-offs, we show how these pillars help energy as an integral optimization target for developers as active participants in data center efficiency.
Type
Publication
Sustainable Supercomputing Workshop, held in conjunction with SC25