LEADER 05365nam  2200649   450 
001 9910812710103321
005 20200520144314.0
010   $a0-12-802199-3
035   $a(CKB)3710000000272525
035   $a(EBL)1832712
035   $a(SSID)ssj0001411286
035   $a(PQKBManifestationID)11889091
035   $a(PQKBTitleCode)TC0001411286
035   $a(PQKBWorkID)11400595
035   $a(PQKB)11618937
035   $a(Au-PeEL)EBL1832712
035   $a(CaPaEBR)ebr10989032
035   $a(CaONFJC)MIL785047
035   $a(OCoLC)894277468
035   $a(CaSebORM)9780128021187
035   $a(MiAaPQ)EBC1832712
035   $a(EXLCZ)993710000000272525
100   $a20141201h20152015  uy             0
101 0 $aeng
135   $aur|n|---|||||
181   $ctxt
182   $cc
183   $acr
200 10$aHigh performance parallelism pearls $emulticore and many-core programming approaches /$fJames Reinders, Jim Jeffers
205   $aFirst edition.
210  1$aWaltham, Massachusetts :$cMorgan Kaufmann,$d2015.
210  4$d©2015
215   $a1 online resource (549 p.)
300   $aIncludes indexes.
311   $a0-12-802118-7 
320   $aIncludes bibliographical references and indexes.
327   $aFront Cover; High Performance Parallelism Pearls: Multicore and Many-core Programming Approaches; Copyright; Contents; Contributors; Acknowledgments; Foreword; Humongous computing needs: Science years in the making; Open standards; Keen on many-core architecture; Xeon Phi is born: Many cores, excellent vector ISA ; Learn highly scalable parallel programming; Future demands grow: Programming models matter; Preface; Inspired by 61 cores: A new era in programming; Chapter 1: Introduction; Learning from successful experiences; Code modernization; Modernize with concurrent algorithms
327   $aModernize with vectorization and data localityUnderstanding power usage; ISPC and OpenCL anyone?; Intel Xeon Phi coprocessor specific; Many-core, neo-heterogeneous; No "Xeon Phi" in the title, neo-heterogeneous programming; The future of many-core; Downloads; Chapter 2: From "Correct" to "Correct & Efficient": A Hydro2D Case Study with Godunov's Scheme; Scientific computing on contemporary computers; Modern computing environments; CEA's Hydro2D; A numerical method for shock hydrodynamics; Euler's equation; Godunov's method; Where it fits; Features of modern architectures
327   $aPerformance-oriented architectureProgramming tools and runtimes; Our computing environments; Paths to performance; Running Hydro2D; Hydro2D's structure; Computation scheme; Data structures; Measuring performance; Optimizations; Memory usage; Thread-level parallelism; Arithmetic efficiency and instruction-level parallelism; Data-level parallelism; Summary; The coprocessor vs the processor; A rising tide lifts all boats; Performance strategies; Chapter 3: Better Concurrency and SIMD on HBM ; The application: HIROMB - BOOS -Model; Key usage: DMI ; HBM execution profile
327   $aOverview for the optimization of HBM Data structures: Locality done right; Thread parallelism in HBM ; Data parallelism: SIMD vectorization; Trivial obstacles; Premature abstraction is the root of all evil; Results; Profiling details; Scaling on processor vs. coprocessor; Contiguous attribute; Summary; References; Chapter 4: Optimizing for Reacting Navier-Stokes Equations; Getting started; Version 1.0: Baseline; Version 2.0: ThreadBox ; Version 3.0: Stack memory; Version 4.0: Blocking; Version 5.0: Vectorization; Intel Xeon Phi coprocessor results; Summary
327   $aChapter 5: Plesiochronous Phasing BarriersWhat can be done to improve the code?; What more can be done to improve the code?; Hyper-Thread Phalanx; What is nonoptimal about this strategy?; Coding the Hyper-Thread Phalanx; How to determine thread binding to core and HT within core?; The Hyper-Thread Phalanx hand-partitioning technique; A lesson learned; Back to work; Data alignment; Use aligned data when possible; Redundancy can be good for you; The plesiochronous phasing barrier; Let us do something to recover this wasted time; A few "left to the reader" possibilities
327   $aXeon host performance improvements similar to Xeon Phi
330   $aHigh Performance Parallelism Pearls shows how to leverage parallelism on processors and coprocessors with the same programming - illustrating the most effective ways to better tap the computational potential of systems with Intel Xeon Phi coprocessors and Intel Xeon processors or other multicore processors. The book includes examples of successful programming efforts, drawn from across industries and domains such as chemistry, engineering, and environmental science. Each chapter in this edited work includes detailed explanations of the programming techniques used, while showing high performan
606   $aParallel programming (Computer science)$xData processing
606   $aCoprocessors
615  0$aParallel programming (Computer science)$xData processing.
615  0$aCoprocessors.
676   $a005.275
700   $aReinders$b James$0851755
702   $aJeffers$b Jim
801  0$bMiAaPQ
801  1$bMiAaPQ
801  2$bMiAaPQ
906   $aBOOK
912   $a9910812710103321
996   $aHigh performance parallelism pearls$94114148
997   $aUNINA