03954nam 2200589 a 450 991043756510332120200520144314.01-283-62229-797866139347411-4471-4492-910.1007/978-1-4471-4492-2(CKB)2670000000246071(EBL)1030425(OCoLC)810077656(SSID)ssj0000737228(PQKBManifestationID)11439992(PQKBTitleCode)TC0000737228(PQKBWorkID)10782579(PQKB)11237611(DE-He213)978-1-4471-4492-2(MiAaPQ)EBC1030425(PPN)168293587(EXLCZ)99267000000024607120120723d2013 uy 0engur|n|---|||||txtccrEnergy-efficient high performance computing measurement and tuning /James H. Laros III ... [et al.]1st ed. 2013.New York Springer20131 online resource (72 p.)SpringerBriefs in computer science,2191-5768Description based upon print version of record.1-4471-4491-0 Includes bibliographical references and index.Introduction -- Platforms -- Measuring Power -- Applications -- Reducing Power During Idle Cycles -- Tuning CPU Power During Application Run-Time -- Network Bandwidth Tuning During Application Run-Time -- Energy Delay Product -- Conclusions.Recognition of the importance of power and energy in the field of high performance computing (HPC) has never been greater. Research has been conducted in a number of areas related to power and energy, but little existing research has focused on large-scale HPC. Part of the reason is the lack of measurement capability currently available on small or large platforms. Typically, research is conducted using coarse methods of measurement such as inserting a power meter between the power source and the platform, or fine grained measurements using custom instrumented boards (with obvious limitations in scale). To analyze real scientific computing applications at large scale, an in situ measurement capability is necessary that scales to the size of the platform. In response to this challenge, the unique power measurement capabilities of the Cray XT architecture were exploited to gain an understanding of power and energy use and the effects of tuning both CPU and network bandwidth. Modifications were made at the operating system level to deterministically halt cores when idle. Additionally, capabilities were added to alter operating P-state. At the application level, an understanding of the power requirements of a range of important DOE/NNSA production scientific computing applications running at large scale (thousands of nodes) is gained by simultaneously collecting current and voltage measurements on the hosting nodes. The effects of both CPU and network bandwidth tuning are examined and energy savings opportunities of up to 39% with little or no impact on run-time performance is demonstrated. Capturing scale effects was key. This research provides strong evidence that next generation large-scale platforms should not only approach CPU frequency scaling differently, as we will demonstrate, but could also benefit from the capability to tune other platform components, such as the network, to achieve more energy efficient performance.SpringerBriefs in Computer Science,2191-5768High performance computingHigh performance computing.004.3004/.3Laros James H1752586MiAaPQMiAaPQMiAaPQBOOK9910437565103321Energy-efficient high performance computing4187913UNINA