top

  Info

  • Utilizzare la checkbox di selezione a fianco di ciascun documento per attivare le funzionalità di stampa, invio email, download nei formati disponibili del (i) record.

  Info

  • Utilizzare questo link per rimuovere la selezione effettuata.
Beyond Loop Level Parallelism in OpenMP: Accelerators, Tasking and More [[electronic resource] /] / edited by Mitsuhisa Sato, Toshihiro Hanawa, Matthias S. Müller, Barbara Chapman, Bronis R. de Supinski
Beyond Loop Level Parallelism in OpenMP: Accelerators, Tasking and More [[electronic resource] /] / edited by Mitsuhisa Sato, Toshihiro Hanawa, Matthias S. Müller, Barbara Chapman, Bronis R. de Supinski
Edizione [1st ed. 2010.]
Pubbl/distr/stampa Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2010
Descrizione fisica 1 online resource (187 p. 121 illus.)
Disciplina 005.275
Collana Programming and Software Engineering
Soggetto topico Computer communication systems
Architecture, Computer
Microprocessors
Algorithms
Software engineering
Computers
Computer Communication Networks
Computer System Implementation
Processor Architectures
Algorithm Analysis and Problem Complexity
Software Engineering
Computation by Abstract Devices
ISBN 1-280-38670-3
9786613564627
3-642-13217-0
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Sixth International Workshop on OpenMP IWOMP 2010 -- Enabling Low-Overhead Hybrid MPI/OpenMP Parallelism with MPC -- A ROSE-Based OpenMP 3.0 Research Compiler Supporting Multiple Runtime Libraries -- Binding Nested OpenMP Programs on Hierarchical Memory Architectures -- A Proposal for User-Defined Reductions in OpenMP -- An Extension to Improve OpenMP Tasking Control -- Towards an Error Model for OpenMP -- How OpenMP Applications Get More Benefit from Many-Core Era -- Topology-Aware OpenMP Process Scheduling -- How to Reconcile Event-Based Performance Analysis with Tasking in OpenMP -- Fuzzy Application Parallelization Using OpenMP -- Hybrid Parallel Programming on SMP Clusters Using XPFortran and OpenMP -- A Case for Including Transactions in OpenMP -- OMPCUDA : OpenMP Execution Framework for CUDA Based on Omni OpenMP Compiler.
Record Nr. UNISA-996465838303316
Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2010
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
Evolving OpenMP in an Age of Extreme Parallelism [[electronic resource] ] : 5th International Workshop on OpenMP, IWOMP 2009, Dresden, Germany, June 3-5, 2009 Proceedings / / edited by Matthias S. Müller, Bronis R. de Supinski, Barbara Chapman
Evolving OpenMP in an Age of Extreme Parallelism [[electronic resource] ] : 5th International Workshop on OpenMP, IWOMP 2009, Dresden, Germany, June 3-5, 2009 Proceedings / / edited by Matthias S. Müller, Bronis R. de Supinski, Barbara Chapman
Edizione [1st ed. 2009.]
Pubbl/distr/stampa Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2009
Descrizione fisica 1 online resource (X, 183 p.)
Disciplina 005.11
Collana Programming and Software Engineering
Soggetto topico Computer programming
Computers
Computer software—Reusability
Computer system failures
Microprocessors
Architecture, Computer
Programming Techniques
Theory of Computation
Performance and Reliability
System Performance and Evaluation
Processor Architectures
Computer System Implementation
ISBN 3-642-02303-7
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Fifth International Workshop on OpenMP IWOMP 2009 -- Parallel Simulation of Bevel Gear Cutting Processes with OpenMP Tasks -- Evaluation of Multicore Processors for Embedded Systems by Parallel Benchmark Program Using OpenMP -- Extending Automatic Parallelization to Optimize High-Level Abstractions for Multicore -- Scalability Evaluation of Barrier Algorithms for OpenMP -- Use of Cluster OpenMP with the Gaussian Quantum Chemistry Code: A Preliminary Performance Analysis -- Evaluating OpenMP 3.0 Run Time Systems on Unbalanced Task Graphs -- Dynamic Task and Data Placement over NUMA Architectures: An OpenMP Runtime Perspective -- Scalability of Gaussian 03 on SGI Altix: The Importance of Data Locality on CC-NUMA Architecture -- Providing Observability for OpenMP 3.0 Applications -- A Microbenchmark Suite for Mixed-Mode OpenMP/MPI -- Performance Profiling for OpenMP Tasks -- Tile Reduction: The First Step towards Tile Aware Parallelization in OpenMP -- A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures -- Identifying Inter-task Communication in Shared Memory Programming Models.
Record Nr. UNISA-996466270103316
Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2009
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
OpenMP in a Heterogeneous World [[electronic resource] ] : 8th International Workshop on OpenMP, IWOMP 2012, Rome, Italy, June 11-13, 2012. Proceedings / / edited by Barbara Chapman, Federico Massaioli, Matthias S. Müller, Marco Rorro
OpenMP in a Heterogeneous World [[electronic resource] ] : 8th International Workshop on OpenMP, IWOMP 2012, Rome, Italy, June 11-13, 2012. Proceedings / / edited by Barbara Chapman, Federico Massaioli, Matthias S. Müller, Marco Rorro
Edizione [1st ed. 2012.]
Pubbl/distr/stampa Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2012
Descrizione fisica 1 online resource (XI, 280 p. 86 illus.)
Disciplina 004.1
Collana Programming and Software Engineering
Soggetto topico Microprocessors
Algorithms
Computer system failures
Programming languages (Electronic computers)
Computers
Computer programming
Processor Architectures
Algorithm Analysis and Problem Complexity
System Performance and Evaluation
Programming Languages, Compilers, Interpreters
Computation by Abstract Devices
Programming Techniques
ISBN 3-642-30961-5
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Record Nr. UNISA-996465280903316
Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2012
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
OpenMP in the Era of Low Power Devices and Accelerators [[electronic resource] ] : 9th International Workshop on OpenMP, IWOMP 2013, Canberra, Australia, September 16-18, 2013, Proceedings / / edited by Alistair P. Rendell, Barbara M. Chapman, Matthias S. Müller
OpenMP in the Era of Low Power Devices and Accelerators [[electronic resource] ] : 9th International Workshop on OpenMP, IWOMP 2013, Canberra, Australia, September 16-18, 2013, Proceedings / / edited by Alistair P. Rendell, Barbara M. Chapman, Matthias S. Müller
Edizione [1st ed. 2013.]
Pubbl/distr/stampa Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2013
Descrizione fisica 1 online resource (X, 201 p. 80 illus.)
Disciplina 004.35
Collana Programming and Software Engineering
Soggetto topico Microprocessors
Algorithms
Computer system failures
Programming languages (Electronic computers)
Computers
Computer programming
Processor Architectures
Algorithm Analysis and Problem Complexity
System Performance and Evaluation
Programming Languages, Compilers, Interpreters
Computation by Abstract Devices
Programming Techniques
ISBN 3-642-40698-X
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Proposed extensions to OpenMP -- Applications -- Accelerators -- Scheduling -- Tools.
Record Nr. UNISA-996466039203316
Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2013
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
OpenMP in the Era of Low Power Devices and Accelerators : 9th International Workshop on OpenMP, IWOMP 2013, Canberra, Australia, September 16-18, 2013, Proceedings / / edited by Alistair P. Rendell, Barbara M. Chapman, Matthias S. Müller
OpenMP in the Era of Low Power Devices and Accelerators : 9th International Workshop on OpenMP, IWOMP 2013, Canberra, Australia, September 16-18, 2013, Proceedings / / edited by Alistair P. Rendell, Barbara M. Chapman, Matthias S. Müller
Edizione [1st ed. 2013.]
Pubbl/distr/stampa Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2013
Descrizione fisica 1 online resource (X, 201 p. 80 illus.)
Disciplina 004.35
Collana Programming and Software Engineering
Soggetto topico Microprocessors
Algorithms
Computer system failures
Programming languages (Electronic computers)
Computers
Computer programming
Processor Architectures
Algorithm Analysis and Problem Complexity
System Performance and Evaluation
Programming Languages, Compilers, Interpreters
Computation by Abstract Devices
Programming Techniques
ISBN 3-642-40698-X
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Proposed extensions to OpenMP -- Applications -- Accelerators -- Scheduling -- Tools.
Record Nr. UNINA-9910483298203321
Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2013
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
OpenMP in the Petascale Era [[electronic resource] ] : 7th International Workshop on OpenMP, IWOMP 2011, Chicago, Il, USA, June 13-15, 2011, Proceedings / / edited by Barbara M. Chapman, William D. Gropp, Kalyan Kumaran, Matthias S. Müller
OpenMP in the Petascale Era [[electronic resource] ] : 7th International Workshop on OpenMP, IWOMP 2011, Chicago, Il, USA, June 13-15, 2011, Proceedings / / edited by Barbara M. Chapman, William D. Gropp, Kalyan Kumaran, Matthias S. Müller
Edizione [1st ed. 2011.]
Pubbl/distr/stampa Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2011
Descrizione fisica 1 online resource (X, 179 p. 75 illus., 33 illus. in color.)
Disciplina 004/.35
Collana Programming and Software Engineering
Soggetto topico Microprocessors
Algorithms
Computer communication systems
Software engineering
Computers
Computer programming
Processor Architectures
Algorithm Analysis and Problem Complexity
Computer Communication Networks
Software Engineering
Computation by Abstract Devices
Programming Techniques
ISBN 3-642-21487-8
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Record Nr. UNISA-996465631303316
Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2011
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
OpenMP Shared Memory Parallel Programming [[electronic resource] ] : International Workshop, IWOMP 2005 and IWOMP 2006, Eugene, OR, USA, June 1-4, 2005, and Reims, France, June 12-15, 2006, Proceedings / / edited by Matthias S. Müller, Barbara Chapman, Bronis R. de Supinski, Allen D. Malony, Michael Voss
OpenMP Shared Memory Parallel Programming [[electronic resource] ] : International Workshop, IWOMP 2005 and IWOMP 2006, Eugene, OR, USA, June 1-4, 2005, and Reims, France, June 12-15, 2006, Proceedings / / edited by Matthias S. Müller, Barbara Chapman, Bronis R. de Supinski, Allen D. Malony, Michael Voss
Edizione [1st ed. 2008.]
Pubbl/distr/stampa Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2008
Descrizione fisica 1 online resource (XII, 448 p.)
Disciplina 005.2/75
Collana Theoretical Computer Science and General Issues
Soggetto topico Computer science
Computer programming
Software engineering
Algorithms
Computer science—Mathematics
Computer simulation
Theory of Computation
Programming Techniques
Software Engineering
Mathematics of Computing
Computer Modelling
ISBN 3-540-68555-3
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Performance Tools -- Performance Analysis of Large-Scale OpenMP and Hybrid MPI/OpenMP Applications with Vampir NG -- ompP: A Profiling Tool for OpenMP -- On the Interaction of Tiling and Automatic Parallelization -- Static Nonconcurrency Analysis of OpenMP Programs -- CCRG OpenMP Compiler: Experiments and Improvements -- Compiler Technology -- Implementing an OpenMP Execution Environment on InfiniBand Clusters -- An Introduction to Balder — An OpenMP Run-time Library for Clusters of SMPs -- Run-Time Environment -- Experiences with the OpenMP Parallelization of DROPS, a Navier-Stokes Solver Written in C++ -- A Parallel Structured Ecological Model for High End Shared Memory Computers -- Multi-cluster, Mixed-Mode Computational Modeling of Human Head Conductivity -- Application I -- An Evaluation of OpenMP on Current and Emerging Multithreaded/Multicore Processors -- SPEC OpenMP Benchmarks on Four Generations of NEC SX Parallel Vector Systems -- Performance Evaluation of Parallel Sparse Matrix–Vector Products on SGI Altix3700 -- The OpenMP Language and Its Evaluation -- The OpenMP Memory Model -- Evaluating OpenMP on Chip MultiThreading Platforms -- Experiences Parallelizing a Web Server with OpenMP -- Advanced Performance Tuning -- Automatic Granularity Selection and OpenMP Directive Generation Via Extended Machine Descriptors in the PROMIS Parallelizing Compiler -- Nested Parallelization of the Flow Solver TFS Using the ParaWise Parallelization Environment -- Performance Characteristics of OpenMP Language Constructs on a Many-core-on-a-chip Architecture -- Improving Performance of OpenMP for SMP Clusters Through Overlapped Page Migrations -- Aspects of Code Development -- Adding New Dimensions to Performance Analysis Through User-Defined Objects -- Performance Instrumentation and Compiler Optimizations for MPI/OpenMP Applications -- Supporting Nested OpenMP Parallelism in the TAU Performance System -- Parallelization of a Hierarchical Data Clustering Algorithm Using OpenMP -- OpenMP and C++ -- Common Mistakes in OpenMP and How to Avoid Them -- Formal Specification of the OpenMP Memory Model -- Applications II -- Performance and Programmability Comparison Between OpenMP and MPI Implementations of a Molecular Modeling Application -- OpenMP Implementation of SPICE3 Circuit Simulator -- Automatic Generation of Parallel Code for Hessian Computations -- Geographical Locality and Dynamic Data Migration for OpenMP Implementations of Adaptive PDE Solvers -- Proposed Extensions to OpenMP -- A Comparison of Task Pool Variants in OpenMP and a Proposal for a Solution to the Busy Waiting Problem -- A Proposal for OpenMP for Java -- A Proposal for Error Handling in OpenMP -- Extending the OpenMP Standard for Thread Mapping and Grouping.
Record Nr. UNISA-996465581203316
Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2008
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
OpenMP: Heterogenous Execution and Data Movements [[electronic resource] ] : 11th International Workshop on OpenMP, IWOMP 2015, Aachen, Germany, October 1-2, 2015, Proceedings / / edited by Christian Terboven, Bronis R. de Supinski, Pablo Reble, Barbara M. Chapman, Matthias S. Müller
OpenMP: Heterogenous Execution and Data Movements [[electronic resource] ] : 11th International Workshop on OpenMP, IWOMP 2015, Aachen, Germany, October 1-2, 2015, Proceedings / / edited by Christian Terboven, Bronis R. de Supinski, Pablo Reble, Barbara M. Chapman, Matthias S. Müller
Edizione [1st ed. 2015.]
Pubbl/distr/stampa Cham : , : Springer International Publishing : , : Imprint : Springer, , 2015
Descrizione fisica 1 online resource (XI, 274 p. 146 illus. in color.)
Disciplina 004
Collana Programming and Software Engineering
Soggetto topico Microprocessors
Programming languages (Electronic computers)
Computer system failures
Computer hardware
Algorithms
Software engineering
Processor Architectures
Programming Languages, Compilers, Interpreters
System Performance and Evaluation
Computer Hardware
Algorithm Analysis and Problem Complexity
Software Engineering
ISBN 3-319-24595-3
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Intro -- Preface -- Organization -- Contents -- Applications -- PAGANtec: OpenMP Parallel Error Correction for Next-Generation Sequencing Data -- 1 Introduction -- 2 Related Work -- 2.1 k-mer Graph and Error Correction -- 2.2 Parallelization Options -- 3 PAGANtec Architecture -- 3.1 Graph Structure -- 3.2 Correction Strategies -- 3.3 Correcting Errors -- 4 Parallelization -- 4.1 Performance Analysis -- 5 Conclusion -- References -- Composing Low-Overhead Scheduling Strategies for Improving Performance of Scientific Applications -- 1 Introduction -- 2 Scheduling Strategies -- 3 Techniques for Composing Scheduling Strategies -- 3.1 uSched -- 3.2 slackSched -- 3.3 vSched -- 3.4 ComboSched -- 4 Code Transformation -- 5 Results -- 6 Related Work -- 7 Conclusions -- References -- Exploiting Fine- and Coarse-Grained Parallelism Using a Directive Based Approach -- 1 Introduction -- 2 Related Work -- 3 Background: OpenMP Accelerator Model -- 4 An Offloading Model for a Cluster -- 4.1 Definitions -- 4.2 Execution Model -- 4.3 Memory Model -- 5 Implementation -- 5.1 Runtime Support -- 6 Preliminary Results -- 7 Discussion -- 8 Conclusions -- References -- Accelerator Applications -- Experiences of Using the OpenMP Accelerator Model to Port DOE Stencil Applications -- 1 Introduction -- 2 OpenMP 4.0's Accelerator Support -- 3 Applications -- 4 Porting to GPUs -- 4.1 Baseline Performance on CPU and GPU -- 4.2 Increasing Parallelism -- 4.3 Loop Scheduling -- 4.4 Exploiting Memory Hierarchy -- 4.5 Reducing Memory Movement Between Host and Device -- 4.6 Manual Tuning for GPU Performance -- 4.7 Productivity -- 5 Related Work -- 6 Discussion and Future Work -- References -- Evaluating the Impact of OpenMP 4.0 Extensions on Relevant Parallel Workloads -- 1 Introduction and Motivation -- 2 Application Parallelization -- 2.1 Facesim -- 2.2 Fluidanimate.
2.3 Streamcluster -- 3 Evaluation -- 3.1 Performance Evaluation -- 3.2 Programmability -- 4 Related Work -- 5 Conclusions -- References -- First Experiences Porting a Parallel Application to a Hybrid Supercomputer with OpenMP 4.0 Device Constructs -- 1 Introduction -- 2 OpenMP Device Constructs -- 2.1 Data Regions -- 3 A High Level View of the Porting Method -- 3.1 Fusing Local Data Regions -- 4 Porting NekBone -- 5 Conclusions -- References -- Tools -- Lessons Learned from Implementing OMPD: A Debugging Interface for OpenMP -- 1 Introduction -- 2 Prior Work -- 3 The OpenMP Debugging Interface -- 3.1 OMPT: A Runtime Interface for OpenMP Tools -- 3.2 Why Distinguish OMPD from OMPT? -- 3.3 The OMPD Architecture -- 4 Use Cases of OMPD -- 4.1 OpenMP-Aware Stack Trace -- 4.2 Stepping in and Out of a Parallel Region -- 5 OMPD Callback Interface -- 5.1 Functions for Operating System Interaction -- 5.2 Resolving Structures for Target Architecture -- 5.3 Access Application Memory -- 5.4 Debugger's Context Argument -- 6 OMPD API Function Specifications -- 6.1 Providing Information on Compatible Runtime Library -- 6.2 API Specification for Breakpoints -- 6.3 Missing Function to Identify Master -- 7 Future Challenges -- 7.1 Context Pointer for Accelerators -- 7.2 Addressing Accelerator Threads -- 7.3 Return Codes -- 8 Conclusions -- References -- False Sharing Detection in OpenMP Applications Using OMPT API -- 1 Introduction -- 2 Motivation -- 3 Related Work -- 4 OMPT- Application Programming Interface for Tools -- 5 Our Approach -- 5.1 OMPT for Capturing Unique Patterns -- 5.2 Hardware Performance Information -- 5.3 Binary Classifier for False Sharing Detection -- 5.4 Feature Selection -- 6 Experimentation and Results -- 6.1 Training Phase -- 6.2 Validation of the Approach -- 7 Conclusion and Future Work -- References.
Exception Handling with OpenMP in Object-Oriented Languages -- 1 Introduction -- 2 Related Work -- 3 Problem Overview -- 3.1 Current Situation -- 3.2 Problem Definition -- 4 Cancellations -- 5 Exception Handling -- 5.1 Overview of Categorization -- 5.2 Local Exception Handling -- 5.3 Global Exception Handling -- 6 Implementation -- 6.1 Adaptable Synchronization Barrier -- 6.2 Dynamic Work Redistribution -- 6.3 Exception from Synchronization Regions -- 6.4 Global Exception Throwing -- 7 Evaluation -- 7.1 Usability -- 7.2 Performance -- 8 Conclusion -- References -- Extensions -- On the Algorithmic Aspects of Using OpenMP Synchronization Mechanisms II: User-Guided Speculative Locks -- 1 Introduction -- 2 Related Work -- 3 User-Guided Locking API with TSX -- 3.1 Intel Transactional Synchronization Extensions -- 3.2 Using the User-Guided Locking API -- 4 Applying Intel TSX to the Test Code -- 4.1 A Brief Review of the Algorithm -- 4.2 The Role of TSX -- 5 Experimental Results -- 5.1 Convergence -- 5.2 Transactional Memory Statistics -- 5.3 Performance Measurement -- 6 Conclusions and Future Work -- References -- Using Transactional Memory to Avoid Blocking in OpenMP Synchronization Directives -- 1 Introduction -- 2 Avoiding Blocking in OpenMP -- 2.1 Critical Sections -- 2.2 Barrier/Taskwait -- 3 Evaluation -- 3.1 Experimental Setup -- 3.2 Results -- 4 Limitations and Related Work -- 5 Conclusion -- References -- A Case Study of OpenMP Applied to Map/Reduce-Style Computations -- 1 Introduction -- 2 Related Work -- 3 Map-Reduce Programming Model -- 3.1 Phoenix++ Implementation -- 3.2 OpenMP Facilities for Map/Reduce-Style Computations -- 4 OpenMP Implementations -- 4.1 Histogram -- 4.2 Linear Regression -- 4.3 K-Means Clustering -- 4.4 Word Count -- 4.5 String Match -- 4.6 Matrix Multiply -- 4.7 Principal Component Analysis -- 5 Evaluation -- 5.1 Analysis.
5.2 Coding Style Comparison -- 5.3 Implications to OpenMP -- 6 Conclusion -- References -- Compiler and Runtime -- Enabling Region Merging Optimizations in OpenMP -- 1 Introduction -- 2 Region Merging and Control -- 2.1 Region Merging Validity in OpenMP -- 2.2 Syntax Extensions to Support Merging -- 3 Results and Evaluation -- 3.1 Back to Back Regions -- 3.2 Parallel Regions with Intervening Serial Regions -- 3.3 Lulesh -- 4 Related Work -- 5 Conclusion -- References -- Towards Task-Parallel Reductions in OpenMP -- 1 Introduction -- 2 Related Work -- 3 Discussion -- 3.1 Updates of a Reduction Variable Outside a Reduction Context -- 3.2 Over-Specifying the Reduction Identifier -- 3.3 Supporting Untied Tasks -- 3.4 Supporting Nested Taskgroups -- 3.5 Cancellation, Dependencies and Merged Tasks -- 4 Syntax Additions -- 5 Evaluation -- 5.1 System Environment -- 5.2 Benchmark Descriptions -- 5.3 Performance Results on Intel Xeon Processors -- 5.4 Performance Results on Intel Xeon Phi Coprocessors -- 6 Conclusions and Future Work -- References -- OpenMP 4.0 Device Support in the OMPi Compiler -- 1 Introduction -- 2 Background -- 3 Compiler Transformations -- 3.1 Target Data -- 3.2 Target -- 3.3 Declare Target -- 4 Runtime Support -- 4.1 Data Environment Handling -- 5 The Epiphany Accelerator as a Device -- 5.1 Runtime Organization -- 5.2 Experiments -- 6 Discussion and Current Status -- References -- Energy -- Application-Level Energy Awareness for OpenMP -- 1 Introduction -- 2 Motivation -- 3 OpenMPE -- 4 Compilation and Runtime System -- 5 Evaluation -- 6 Related Work -- 7 Conclusion and Future Work -- References -- Evaluating the Energy Consumption of OpenMP Applications on Haswell Processors -- 1 Introduction -- 2 Related Works -- 3 Basic Characteristics -- 3.1 Energy-Saving Features of Haswell -- 3.2 Load-Dependent Behavior -- 4 Optimization Steps.
4.1 Wait Strategies -- 4.2 Iterative Clock Adjustment -- 4.3 Evaluation -- 5 Conclusion -- References -- Parallelization Methods for Hierarchical SMP Systems -- 1 Introduction -- 2 The Test Code -- 3 SIMD Building Blocks -- 4 Nested Threading -- 5 Code Variants -- 5.1 Baseline -- 5.2 Hand Decomposed -- 5.3 Nested Parallelism -- 5.4 Hand Nested -- 5.5 Crew and Teams -- 5.6 SBB -- 6 Performance Experiments -- 7 Conclusions and Future Work -- References -- Supporting Indirect Data Mapping in OpenMP 敳敲癥搠䁤 㴀 ⨀䁬整䁴潫敮 ⴀ㘀瀀 -- 1 Introduction -- 2 The OpenMP 4.0 Data Environment -- 2.1 Mapping Syntax -- 2.2 Presence -- 3 Map Refinements -- 3.1 Data only Array Sections -- 3.2 Type-Based Implicit Mappings -- 4 Clause Grouping and Binding -- 5 Conclusion -- References -- Author Index.
Record Nr. UNISA-996466296603316
Cham : , : Springer International Publishing : , : Imprint : Springer, , 2015
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui
OpenMP: Heterogenous Execution and Data Movements : 11th International Workshop on OpenMP, IWOMP 2015, Aachen, Germany, October 1-2, 2015, Proceedings / / edited by Christian Terboven, Bronis R. de Supinski, Pablo Reble, Barbara M. Chapman, Matthias S. Müller
OpenMP: Heterogenous Execution and Data Movements : 11th International Workshop on OpenMP, IWOMP 2015, Aachen, Germany, October 1-2, 2015, Proceedings / / edited by Christian Terboven, Bronis R. de Supinski, Pablo Reble, Barbara M. Chapman, Matthias S. Müller
Edizione [1st ed. 2015.]
Pubbl/distr/stampa Cham : , : Springer International Publishing : , : Imprint : Springer, , 2015
Descrizione fisica 1 online resource (XI, 274 p. 146 illus. in color.)
Disciplina 004
Collana Programming and Software Engineering
Soggetto topico Microprocessors
Programming languages (Electronic computers)
Computer system failures
Computer hardware
Algorithms
Software engineering
Processor Architectures
Programming Languages, Compilers, Interpreters
System Performance and Evaluation
Computer Hardware
Algorithm Analysis and Problem Complexity
Software Engineering
ISBN 3-319-24595-3
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Intro -- Preface -- Organization -- Contents -- Applications -- PAGANtec: OpenMP Parallel Error Correction for Next-Generation Sequencing Data -- 1 Introduction -- 2 Related Work -- 2.1 k-mer Graph and Error Correction -- 2.2 Parallelization Options -- 3 PAGANtec Architecture -- 3.1 Graph Structure -- 3.2 Correction Strategies -- 3.3 Correcting Errors -- 4 Parallelization -- 4.1 Performance Analysis -- 5 Conclusion -- References -- Composing Low-Overhead Scheduling Strategies for Improving Performance of Scientific Applications -- 1 Introduction -- 2 Scheduling Strategies -- 3 Techniques for Composing Scheduling Strategies -- 3.1 uSched -- 3.2 slackSched -- 3.3 vSched -- 3.4 ComboSched -- 4 Code Transformation -- 5 Results -- 6 Related Work -- 7 Conclusions -- References -- Exploiting Fine- and Coarse-Grained Parallelism Using a Directive Based Approach -- 1 Introduction -- 2 Related Work -- 3 Background: OpenMP Accelerator Model -- 4 An Offloading Model for a Cluster -- 4.1 Definitions -- 4.2 Execution Model -- 4.3 Memory Model -- 5 Implementation -- 5.1 Runtime Support -- 6 Preliminary Results -- 7 Discussion -- 8 Conclusions -- References -- Accelerator Applications -- Experiences of Using the OpenMP Accelerator Model to Port DOE Stencil Applications -- 1 Introduction -- 2 OpenMP 4.0's Accelerator Support -- 3 Applications -- 4 Porting to GPUs -- 4.1 Baseline Performance on CPU and GPU -- 4.2 Increasing Parallelism -- 4.3 Loop Scheduling -- 4.4 Exploiting Memory Hierarchy -- 4.5 Reducing Memory Movement Between Host and Device -- 4.6 Manual Tuning for GPU Performance -- 4.7 Productivity -- 5 Related Work -- 6 Discussion and Future Work -- References -- Evaluating the Impact of OpenMP 4.0 Extensions on Relevant Parallel Workloads -- 1 Introduction and Motivation -- 2 Application Parallelization -- 2.1 Facesim -- 2.2 Fluidanimate.
2.3 Streamcluster -- 3 Evaluation -- 3.1 Performance Evaluation -- 3.2 Programmability -- 4 Related Work -- 5 Conclusions -- References -- First Experiences Porting a Parallel Application to a Hybrid Supercomputer with OpenMP 4.0 Device Constructs -- 1 Introduction -- 2 OpenMP Device Constructs -- 2.1 Data Regions -- 3 A High Level View of the Porting Method -- 3.1 Fusing Local Data Regions -- 4 Porting NekBone -- 5 Conclusions -- References -- Tools -- Lessons Learned from Implementing OMPD: A Debugging Interface for OpenMP -- 1 Introduction -- 2 Prior Work -- 3 The OpenMP Debugging Interface -- 3.1 OMPT: A Runtime Interface for OpenMP Tools -- 3.2 Why Distinguish OMPD from OMPT? -- 3.3 The OMPD Architecture -- 4 Use Cases of OMPD -- 4.1 OpenMP-Aware Stack Trace -- 4.2 Stepping in and Out of a Parallel Region -- 5 OMPD Callback Interface -- 5.1 Functions for Operating System Interaction -- 5.2 Resolving Structures for Target Architecture -- 5.3 Access Application Memory -- 5.4 Debugger's Context Argument -- 6 OMPD API Function Specifications -- 6.1 Providing Information on Compatible Runtime Library -- 6.2 API Specification for Breakpoints -- 6.3 Missing Function to Identify Master -- 7 Future Challenges -- 7.1 Context Pointer for Accelerators -- 7.2 Addressing Accelerator Threads -- 7.3 Return Codes -- 8 Conclusions -- References -- False Sharing Detection in OpenMP Applications Using OMPT API -- 1 Introduction -- 2 Motivation -- 3 Related Work -- 4 OMPT- Application Programming Interface for Tools -- 5 Our Approach -- 5.1 OMPT for Capturing Unique Patterns -- 5.2 Hardware Performance Information -- 5.3 Binary Classifier for False Sharing Detection -- 5.4 Feature Selection -- 6 Experimentation and Results -- 6.1 Training Phase -- 6.2 Validation of the Approach -- 7 Conclusion and Future Work -- References.
Exception Handling with OpenMP in Object-Oriented Languages -- 1 Introduction -- 2 Related Work -- 3 Problem Overview -- 3.1 Current Situation -- 3.2 Problem Definition -- 4 Cancellations -- 5 Exception Handling -- 5.1 Overview of Categorization -- 5.2 Local Exception Handling -- 5.3 Global Exception Handling -- 6 Implementation -- 6.1 Adaptable Synchronization Barrier -- 6.2 Dynamic Work Redistribution -- 6.3 Exception from Synchronization Regions -- 6.4 Global Exception Throwing -- 7 Evaluation -- 7.1 Usability -- 7.2 Performance -- 8 Conclusion -- References -- Extensions -- On the Algorithmic Aspects of Using OpenMP Synchronization Mechanisms II: User-Guided Speculative Locks -- 1 Introduction -- 2 Related Work -- 3 User-Guided Locking API with TSX -- 3.1 Intel Transactional Synchronization Extensions -- 3.2 Using the User-Guided Locking API -- 4 Applying Intel TSX to the Test Code -- 4.1 A Brief Review of the Algorithm -- 4.2 The Role of TSX -- 5 Experimental Results -- 5.1 Convergence -- 5.2 Transactional Memory Statistics -- 5.3 Performance Measurement -- 6 Conclusions and Future Work -- References -- Using Transactional Memory to Avoid Blocking in OpenMP Synchronization Directives -- 1 Introduction -- 2 Avoiding Blocking in OpenMP -- 2.1 Critical Sections -- 2.2 Barrier/Taskwait -- 3 Evaluation -- 3.1 Experimental Setup -- 3.2 Results -- 4 Limitations and Related Work -- 5 Conclusion -- References -- A Case Study of OpenMP Applied to Map/Reduce-Style Computations -- 1 Introduction -- 2 Related Work -- 3 Map-Reduce Programming Model -- 3.1 Phoenix++ Implementation -- 3.2 OpenMP Facilities for Map/Reduce-Style Computations -- 4 OpenMP Implementations -- 4.1 Histogram -- 4.2 Linear Regression -- 4.3 K-Means Clustering -- 4.4 Word Count -- 4.5 String Match -- 4.6 Matrix Multiply -- 4.7 Principal Component Analysis -- 5 Evaluation -- 5.1 Analysis.
5.2 Coding Style Comparison -- 5.3 Implications to OpenMP -- 6 Conclusion -- References -- Compiler and Runtime -- Enabling Region Merging Optimizations in OpenMP -- 1 Introduction -- 2 Region Merging and Control -- 2.1 Region Merging Validity in OpenMP -- 2.2 Syntax Extensions to Support Merging -- 3 Results and Evaluation -- 3.1 Back to Back Regions -- 3.2 Parallel Regions with Intervening Serial Regions -- 3.3 Lulesh -- 4 Related Work -- 5 Conclusion -- References -- Towards Task-Parallel Reductions in OpenMP -- 1 Introduction -- 2 Related Work -- 3 Discussion -- 3.1 Updates of a Reduction Variable Outside a Reduction Context -- 3.2 Over-Specifying the Reduction Identifier -- 3.3 Supporting Untied Tasks -- 3.4 Supporting Nested Taskgroups -- 3.5 Cancellation, Dependencies and Merged Tasks -- 4 Syntax Additions -- 5 Evaluation -- 5.1 System Environment -- 5.2 Benchmark Descriptions -- 5.3 Performance Results on Intel Xeon Processors -- 5.4 Performance Results on Intel Xeon Phi Coprocessors -- 6 Conclusions and Future Work -- References -- OpenMP 4.0 Device Support in the OMPi Compiler -- 1 Introduction -- 2 Background -- 3 Compiler Transformations -- 3.1 Target Data -- 3.2 Target -- 3.3 Declare Target -- 4 Runtime Support -- 4.1 Data Environment Handling -- 5 The Epiphany Accelerator as a Device -- 5.1 Runtime Organization -- 5.2 Experiments -- 6 Discussion and Current Status -- References -- Energy -- Application-Level Energy Awareness for OpenMP -- 1 Introduction -- 2 Motivation -- 3 OpenMPE -- 4 Compilation and Runtime System -- 5 Evaluation -- 6 Related Work -- 7 Conclusion and Future Work -- References -- Evaluating the Energy Consumption of OpenMP Applications on Haswell Processors -- 1 Introduction -- 2 Related Works -- 3 Basic Characteristics -- 3.1 Energy-Saving Features of Haswell -- 3.2 Load-Dependent Behavior -- 4 Optimization Steps.
4.1 Wait Strategies -- 4.2 Iterative Clock Adjustment -- 4.3 Evaluation -- 5 Conclusion -- References -- Parallelization Methods for Hierarchical SMP Systems -- 1 Introduction -- 2 The Test Code -- 3 SIMD Building Blocks -- 4 Nested Threading -- 5 Code Variants -- 5.1 Baseline -- 5.2 Hand Decomposed -- 5.3 Nested Parallelism -- 5.4 Hand Nested -- 5.5 Crew and Teams -- 5.6 SBB -- 6 Performance Experiments -- 7 Conclusions and Future Work -- References -- Supporting Indirect Data Mapping in OpenMP 敳敲癥搠䁤 㴀 ⨀䁬整䁴潫敮 ⴀ㘀瀀 -- 1 Introduction -- 2 The OpenMP 4.0 Data Environment -- 2.1 Mapping Syntax -- 2.2 Presence -- 3 Map Refinements -- 3.1 Data only Array Sections -- 3.2 Type-Based Implicit Mappings -- 4 Clause Grouping and Binding -- 5 Conclusion -- References -- Author Index.
Record Nr. UNINA-9910485022103321
Cham : , : Springer International Publishing : , : Imprint : Springer, , 2015
Materiale a stampa
Lo trovi qui: Univ. Federico II
Opac: Controlla la disponibilità qui
Scaling OpenMP for Exascale Performance and Portability [[electronic resource] ] : 13th International Workshop on OpenMP, IWOMP 2017, Stony Brook, NY, USA, September 20–22, 2017, Proceedings / / edited by Bronis R. de Supinski, Stephen L. Olivier, Christian Terboven, Barbara M. Chapman, Matthias S. Müller
Scaling OpenMP for Exascale Performance and Portability [[electronic resource] ] : 13th International Workshop on OpenMP, IWOMP 2017, Stony Brook, NY, USA, September 20–22, 2017, Proceedings / / edited by Bronis R. de Supinski, Stephen L. Olivier, Christian Terboven, Barbara M. Chapman, Matthias S. Müller
Edizione [1st ed. 2017.]
Pubbl/distr/stampa Cham : , : Springer International Publishing : , : Imprint : Springer, , 2017
Descrizione fisica 1 online resource (XI, 350 p. 116 illus.)
Disciplina 001.6404
Collana Programming and Software Engineering
Soggetto topico Microprocessors
Programming languages (Electronic computers)
Special purpose computers
Logic design
Architecture, Computer
Computer programming
Processor Architectures
Programming Languages, Compilers, Interpreters
Special Purpose and Application-Based Systems
Logic Design
Computer System Implementation
Programming Techniques
ISBN 3-319-65578-7
Formato Materiale a stampa
Livello bibliografico Monografia
Lingua di pubblicazione eng
Nota di contenuto Advanced Implementations and Extensions -- OpenMP Application Studies -- Analyzing and Extending Tasking -- OpenMP 4 Application Evaluation -- Extended Parallelism Models: Performance Analysis and Tools -- Advanced Data Management with OpenMP.
Record Nr. UNISA-996466001903316
Cham : , : Springer International Publishing : , : Imprint : Springer, , 2017
Materiale a stampa
Lo trovi qui: Univ. di Salerno
Opac: Controlla la disponibilità qui