Beyond Loop Level Parallelism in OpenMP: Accelerators, Tasking and More [[electronic resource] /] / edited by Mitsuhisa Sato, Toshihiro Hanawa, Matthias S. Müller, Barbara Chapman, Bronis R. de Supinski |
Edizione | [1st ed. 2010.] |
Pubbl/distr/stampa | Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2010 |
Descrizione fisica | 1 online resource (187 p. 121 illus.) |
Disciplina | 005.275 |
Collana | Programming and Software Engineering |
Soggetto topico |
Computer communication systems
Architecture, Computer Microprocessors Algorithms Software engineering Computers Computer Communication Networks Computer System Implementation Processor Architectures Algorithm Analysis and Problem Complexity Software Engineering Computation by Abstract Devices |
ISBN |
1-280-38670-3
9786613564627 3-642-13217-0 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Sixth International Workshop on OpenMP IWOMP 2010 -- Enabling Low-Overhead Hybrid MPI/OpenMP Parallelism with MPC -- A ROSE-Based OpenMP 3.0 Research Compiler Supporting Multiple Runtime Libraries -- Binding Nested OpenMP Programs on Hierarchical Memory Architectures -- A Proposal for User-Defined Reductions in OpenMP -- An Extension to Improve OpenMP Tasking Control -- Towards an Error Model for OpenMP -- How OpenMP Applications Get More Benefit from Many-Core Era -- Topology-Aware OpenMP Process Scheduling -- How to Reconcile Event-Based Performance Analysis with Tasking in OpenMP -- Fuzzy Application Parallelization Using OpenMP -- Hybrid Parallel Programming on SMP Clusters Using XPFortran and OpenMP -- A Case for Including Transactions in OpenMP -- OMPCUDA : OpenMP Execution Framework for CUDA Based on Omni OpenMP Compiler. |
Record Nr. | UNISA-996465838303316 |
Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2010 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. di Salerno | ||
|
Evolving OpenMP in an Age of Extreme Parallelism [[electronic resource] ] : 5th International Workshop on OpenMP, IWOMP 2009, Dresden, Germany, June 3-5, 2009 Proceedings / / edited by Matthias S. Müller, Bronis R. de Supinski, Barbara Chapman |
Edizione | [1st ed. 2009.] |
Pubbl/distr/stampa | Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2009 |
Descrizione fisica | 1 online resource (X, 183 p.) |
Disciplina | 005.11 |
Collana | Programming and Software Engineering |
Soggetto topico |
Computer programming
Computers Computer software—Reusability Computer system failures Microprocessors Architecture, Computer Programming Techniques Theory of Computation Performance and Reliability System Performance and Evaluation Processor Architectures Computer System Implementation |
ISBN | 3-642-02303-7 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Fifth International Workshop on OpenMP IWOMP 2009 -- Parallel Simulation of Bevel Gear Cutting Processes with OpenMP Tasks -- Evaluation of Multicore Processors for Embedded Systems by Parallel Benchmark Program Using OpenMP -- Extending Automatic Parallelization to Optimize High-Level Abstractions for Multicore -- Scalability Evaluation of Barrier Algorithms for OpenMP -- Use of Cluster OpenMP with the Gaussian Quantum Chemistry Code: A Preliminary Performance Analysis -- Evaluating OpenMP 3.0 Run Time Systems on Unbalanced Task Graphs -- Dynamic Task and Data Placement over NUMA Architectures: An OpenMP Runtime Perspective -- Scalability of Gaussian 03 on SGI Altix: The Importance of Data Locality on CC-NUMA Architecture -- Providing Observability for OpenMP 3.0 Applications -- A Microbenchmark Suite for Mixed-Mode OpenMP/MPI -- Performance Profiling for OpenMP Tasks -- Tile Reduction: The First Step towards Tile Aware Parallelization in OpenMP -- A Proposal to Extend the OpenMP Tasking Model for Heterogeneous Architectures -- Identifying Inter-task Communication in Shared Memory Programming Models. |
Record Nr. | UNISA-996466270103316 |
Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2009 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. di Salerno | ||
|
Languages and Compilers for Parallel Computing [[electronic resource] ] : 33rd International Workshop, LCPC 2020, Virtual Event, October 14-16, 2020, Revised Selected Papers / / edited by Barbara Chapman, José Moreira |
Edizione | [1st ed. 2022.] |
Pubbl/distr/stampa | Cham : , : Springer International Publishing : , : Imprint : Springer, , 2022 |
Descrizione fisica | 1 online resource (233 pages) |
Disciplina | 004.35 |
Collana | Theoretical Computer Science and General Issues |
Soggetto topico |
Programming languages (Electronic computers)
Computer networks Computer programming Computer systems Programming Language Computer Communication Networks Programming Techniques Computer System Implementation |
ISBN | 3-030-95953-8 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Code and Data Transformations An Affine Scheduling Framework for Integrating Data Layout and Loop Transformations -- Guiding Code Optimizations with Deep Learning-Based Code Matching -- Expanding Opportunities for Array Privatization in Sparse Computations -- OpenMP and Fortran Concurrent Execution of Deferred OpenMP Target Tasks with Hidden Helper Threads -- Using Hardware Transactional Memory to Implement Speculative Privatization in OpenMP -- Improving Fortran Performance Portability -- Domain Specific Compilation COMET: A Domain-Specic Compilation of High-Performance Computational Chemistry -- G-Code Re-compilation and Optimization for Faster 3D Printing -- Li Machine Language and Quantum Computing Optimized Code Generation for Deep Neural Networks -- Thermal-Aware Compilation of Spiking Neural Networks to Neuromorphic Hardware -- A Quantum-Inspired Model For Bit-Serial SIMD-Parallel Computation -- Performance Analysis Enhancing the Top-Down Microarchitectural Analysis Method Using Purchasing Power Parity Theory -- Code Generation Cain: Automatic Code Generation for Simultaneous Convolutional Kernels on Focal-plane Sensor-processors -- Reordering Under the ECMAScript Memory Consistency Model -- Verication of Vectorization of Signal Transforms. |
Record Nr. | UNISA-996464538003316 |
Cham : , : Springer International Publishing : , : Imprint : Springer, , 2022 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. di Salerno | ||
|
Languages and Compilers for Parallel Computing : 33rd International Workshop, LCPC 2020, Virtual Event, October 14-16, 2020, Revised Selected Papers / / edited by Barbara Chapman, José Moreira |
Edizione | [1st ed. 2022.] |
Pubbl/distr/stampa | Cham : , : Springer International Publishing : , : Imprint : Springer, , 2022 |
Descrizione fisica | 1 online resource (233 pages) |
Disciplina |
004.35
005.275 |
Collana | Theoretical Computer Science and General Issues |
Soggetto topico |
Programming languages (Electronic computers)
Computer networks Computer programming Computer systems Programming Language Computer Communication Networks Programming Techniques Computer System Implementation Programació en paral·lel (Informàtica) Processament en paral·lel (Ordinadors) Compiladors (Programes d'ordinador) |
Soggetto genere / forma |
Congressos
Llibres electrònics |
ISBN | 3-030-95953-8 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Code and Data Transformations An Affine Scheduling Framework for Integrating Data Layout and Loop Transformations -- Guiding Code Optimizations with Deep Learning-Based Code Matching -- Expanding Opportunities for Array Privatization in Sparse Computations -- OpenMP and Fortran Concurrent Execution of Deferred OpenMP Target Tasks with Hidden Helper Threads -- Using Hardware Transactional Memory to Implement Speculative Privatization in OpenMP -- Improving Fortran Performance Portability -- Domain Specific Compilation COMET: A Domain-Specic Compilation of High-Performance Computational Chemistry -- G-Code Re-compilation and Optimization for Faster 3D Printing -- Li Machine Language and Quantum Computing Optimized Code Generation for Deep Neural Networks -- Thermal-Aware Compilation of Spiking Neural Networks to Neuromorphic Hardware -- A Quantum-Inspired Model For Bit-Serial SIMD-Parallel Computation -- Performance Analysis Enhancing the Top-Down Microarchitectural Analysis Method Using Purchasing Power Parity Theory -- Code Generation Cain: Automatic Code Generation for Simultaneous Convolutional Kernels on Focal-plane Sensor-processors -- Reordering Under the ECMAScript Memory Consistency Model -- Verication of Vectorization of Signal Transforms. |
Record Nr. | UNINA-9910544850703321 |
Cham : , : Springer International Publishing : , : Imprint : Springer, , 2022 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
Network and Parallel Computing [[electronic resource] ] : 13th IFIP WG 10.3 International Conference, NPC 2016, Xi'an, China, October 28-29, 2016, Proceedings / / edited by Guang R. Gao, Depei Qian, Xinbo Gao, Barbara Chapman, Wenguang Chen |
Edizione | [1st ed. 2016.] |
Pubbl/distr/stampa | Cham : , : Springer International Publishing : , : Imprint : Springer, , 2016 |
Descrizione fisica | 1 online resource (X, 213 p. 94 illus.) |
Disciplina | 004.6 |
Collana | Theoretical Computer Science and General Issues |
Soggetto topico |
Computer engineering
Computer networks Software engineering Algorithms Application software Computer Engineering and Networks Software Engineering Computer and Information Systems Applications |
ISBN | 3-319-47099-X |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Intro -- Preface -- Organization -- Contents -- Memory: Non-Volatile, Solid State Drives, Hybrid Systems -- VIOS: A Variation-Aware I/O Scheduler for Flash-Based Storage Systems -- 1 Introduction -- 2 Background and Related Work -- 2.1 SSD Organization -- 2.2 Process Variation of Flash Memory -- 2.3 I/O Scheduler for Flash-Based SSDs -- 3 Details of VIOS -- 3.1 Block Management -- 3.2 Global Chip-State Vector -- 3.3 Conflict Optimized Scheduling Mechanism -- 4 Experimental Results -- 4.1 Performance Analysis of VIOS -- 4.2 Sensitivity Analysis of VIOS -- 5 Conclusion -- References -- Exploiting Cross-Layer Hotness Identification to Improve Flash Memory System Performance -- 1 Introduction -- 2 Background and Related Work -- 2.1 Tradeoff Between Flash Cell Wearing and Read Latency -- 2.2 Related Work -- 3 Exploiting Cross-Layer Hotness Identification to Improve Read and Endurance Performance (HIRE) -- 3.1 Cross-Layer Study for Hotness Identifier -- 3.2 Voltage Controller in HIRE -- 3.3 Overhead Analysis -- 4 Performance Evaluation -- 4.1 Methodology -- 4.2 Experiment Results -- 5 Conclusion -- References -- Efficient Management for Hybrid Memory in Managed Language Runtime -- 1 Introduction -- 2 Managed Runtime Background -- 3 Related Work -- 4 Hybrid Memory Management Scheme -- 4.1 Overview -- 4.2 HMprof Offline Performance Instrumentation -- 4.3 Hot Object Marking -- 5 Experimental Methodology -- 6 Evaluation Results -- 6.1 Heap Partition Placement -- 6.2 Hot Object Allocation -- 7 Conclusion -- References -- Resilience and Reliability -- Application-Based Coarse-Grained Incremental Checkpointing Based on Non-volatile Memory -- Abstract -- 1 Introduction -- 2 Background and Motivation -- 2.1 Non-volatile Memory -- 2.2 Incremental Checkpointing with Non-volatile Memory -- 2.3 The Problem with Current Checkpoint -- 3 Design and Implementation.
3.1 Contiguous Memory Regions to be Visited -- 3.2 Application-Based Coarse-Grained Checkpoint: Loose Monitoring Granularity for "Hot" Applications -- 3.3 Huge Page Support -- 4 Experiments -- 4.1 Experimental Setup and Benchmarks -- 4.2 Performance Metrics and Corresponding Results -- 5 Related Work -- 6 Conclusion -- Acknowledgments -- References -- DASM: A Dynamic Adaptive Forward Assembly Area Method to Accelerate Restore Speed for Deduplication-Based Backup Systems -- 1 Introduction -- 2 Background and Motivation -- 2.1 Data Deduplication -- 2.2 Fragmentation Problem and Restoration Speed -- 2.3 Forward Assembly Area -- 2.4 Our Motivation -- 3 Design of DASM -- 3.1 Dynamic Orthotics -- 3.2 Near-Optimal Cache Policy -- 4 Performance Evaluation -- 4.1 Evaluation Methodology -- 4.2 Experimental Results -- 4.3 Analysis -- 5 Conclusions -- References -- Scheduling and Load-Balancing -- A Statistics Based Prediction Method for Rendering Application -- 1 Introduction -- 2 Related Works -- 3 Proposed Strategy -- 3.1 System Design -- 3.2 Parsing of Parameters -- 3.3 Strategy Description -- 4 Experimental Results -- 5 Conclusion and Future Work -- References -- IBB: Improved K-Resource Aware Backfill Balanced Scheduling for HTCondor -- Abstract -- 1 Introduction -- 2 Related Work -- 3 Improved Backfill Balance Scheduling -- 3.1 Thought of IBB -- 3.2 Apply Backfilling to HTCondor -- 4 Experiments and Evaluations -- 4.1 Simulation Experiments -- 4.2 Experiments Conducted on HTCondor -- 4.3 Discussion -- 5 Conclusion and Future Work -- Acknowledgments -- References -- Multipath Load Balancing in SDN/OSPF Hybrid Network -- Abstract -- 1 Introduction -- 2 Problem Formulation -- 2.1 Hybrid Network Scenario -- 2.2 Optimization Problem Definition -- 3 Load Balancing in Hybrid SDN Network -- 3.1 Disjoint Multipath Calculation -- 3.2 LRU: A New FPTAS Algorithm. 4 Experiments and Evaluation -- 5 Conclusion -- Acknowledgments -- References -- Heterogeneous Systems -- A Study of Overflow Vulnerabilities on GPUs -- 1 Introduction -- 2 Background on CUDA Architecture -- 3 Empirical Evaluation of GPU Vulnerabilities -- 3.1 Experiment Setup -- 3.2 Stack Overflow -- 3.3 Heap Overflow -- 3.4 Other Vulnerabilities -- 4 Discussions and Countermeasures -- 5 Related Work -- 6 Conclusion -- References -- Streaming Applications on Heterogeneous Platforms -- 1 Introduction -- 2 Related Work -- 3 A Statistical View -- 3.1 Benchmarks and Datasets -- 3.2 Experimental Platforms -- 3.3 Measurement Methodology -- 3.4 Results and Analysis -- 4 Our Streaming Approach -- 4.1 Categorization -- 4.2 Code Streaming -- 5 Experimental Results -- 6 Conclusion -- References -- Data Processing and Big Data -- DSS: A Scalable and Efficient Stratified Sampling Algorithm for Large-Scale Datasets -- 1 Introduction -- 2 Related Work -- 3 Stratified Sampling Queries -- 3.1 A Single Stratified Sampling Query -- 3.2 Sequential Answering Process -- 4 Distributed Sampling Design -- 4.1 Sampling Representativeness in Distributed Environment -- 4.2 Distributed Algorithm Spark-SQE -- 4.3 Improved Distributed Algorithm DSS -- 5 Experimental Evaluation -- 5.1 Experimental Setup -- 5.2 Results -- 6 Conclusion -- References -- A Fast and Better Hybrid Recommender System Based on Spark -- 1 Introduction -- 1.1 Hybrid Recommender Systems -- 1.2 Work of Paper -- 2 Related Work -- 2.1 Recommender Systems -- 2.2 Weight Analysis -- 2.3 Spark -- 3 Design Overview -- 3.1 Objective Function -- 3.2 Weight Calculation -- 4 Implementation -- 4.1 Modules -- 4.2 Discussion -- 5 Performance -- 5.1 Evaluation Index -- 5.2 Experimental Setup -- 5.3 Performance Comparison -- 6 Conclusion and Future Work -- References. Discovering Trip Patterns from Incomplete Passenger Trajectories for Inter-zonal Bus Line Planning -- Abstract -- 1 Introduction -- 2 Related Work -- 3 Problem Definition -- 4 Mining Frequent Trip Patterns for Inter-zonal Bus Lines -- 4.1 Mining Commuters -- 4.2 Mining FBPT4BL Patterns -- 4.3 Time Complexity Analysis -- 5 Evaluation -- 5.1 Sensitivity Analysis of MinPts and ɛ -- 5.2 Effective and Efficient Analysis of MCFTP Approach -- 6 Conclusion -- Acknowledgments -- References -- FCM: A Fine-Grained Crowdsourcing Model Based on Ontology in Crowd-Sensing -- Abstract -- 1 Introduction -- 2 Characteristic Ontology Triple -- 3 Recommendation Method -- 3.1 Recommendation Index -- 3.2 Service Decision Tree -- 4 Experiment -- 5 Conclusion -- Acknowledgments -- References -- QIM: Quantifying Hyperparameter Importance for Deep Learning -- 1 Introduction -- 2 Background -- 2.1 Deep Learning (DL) -- 2.2 PB Design -- 3 QIM -- 3.1 Overview -- 3.2 Identifying the Value Range for Each Hyperparameter -- 3.3 QIM -- 4 Experimental Setup -- 5 Evaluation -- 5.1 Supervised Learning -- 5.2 Unsupervised Pre-training -- 5.3 Time Cost -- 6 Related Work -- 7 Conclusion -- References -- Algorithms and Computational Models -- Toward a Parallel Turing Machine Model -- 1 Introduction and Motivation -- 2 Existing Work on Parallel Turing Machine --- A Disappointing Status Report -- 2.1 Existing Parallel Turing Machine Proposals -- 2.2 The Disappointing Status on Parallel Turing Machine Studies -- 3 A Parallel Turing Machine Model --- Our Proposal -- 3.1 The Concept of Codelets and Codelet Graphs (CDGs) -- 3.2 The Parallel Turing Machine Model -- 3.3 An Example to Illustrate How the PTM Works -- 3.4 Determinacy Property of Our Proposed PTM -- 4 Related Work -- 4.1 Parallel Turing Machine -- 4.2 Memory Consistency Models -- 4.3 The Codelet Model. 4.4 Work on Parallel Computation Models -- 5 Conclusion and Future Work -- References -- On Determination of Balance Ratio for Some Tree Structures -- 1 Introduction -- 2 The Basic Properties and Algorithms -- 3 The Improvement of Time Complexity -- 4 Concluding Remarks -- References -- Author Index. |
Record Nr. | UNISA-996465944903316 |
Cham : , : Springer International Publishing : , : Imprint : Springer, , 2016 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. di Salerno | ||
|
Network and Parallel Computing : 13th IFIP WG 10.3 International Conference, NPC 2016, Xi'an, China, October 28-29, 2016, Proceedings / / edited by Guang R. Gao, Depei Qian, Xinbo Gao, Barbara Chapman, Wenguang Chen |
Edizione | [1st ed. 2016.] |
Pubbl/distr/stampa | Cham : , : Springer International Publishing : , : Imprint : Springer, , 2016 |
Descrizione fisica | 1 online resource (X, 213 p. 94 illus.) |
Disciplina | 004.6 |
Collana | Theoretical Computer Science and General Issues |
Soggetto topico |
Computer engineering
Computer networks Software engineering Algorithms Application software Computer Engineering and Networks Software Engineering Computer and Information Systems Applications |
ISBN | 3-319-47099-X |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto |
Intro -- Preface -- Organization -- Contents -- Memory: Non-Volatile, Solid State Drives, Hybrid Systems -- VIOS: A Variation-Aware I/O Scheduler for Flash-Based Storage Systems -- 1 Introduction -- 2 Background and Related Work -- 2.1 SSD Organization -- 2.2 Process Variation of Flash Memory -- 2.3 I/O Scheduler for Flash-Based SSDs -- 3 Details of VIOS -- 3.1 Block Management -- 3.2 Global Chip-State Vector -- 3.3 Conflict Optimized Scheduling Mechanism -- 4 Experimental Results -- 4.1 Performance Analysis of VIOS -- 4.2 Sensitivity Analysis of VIOS -- 5 Conclusion -- References -- Exploiting Cross-Layer Hotness Identification to Improve Flash Memory System Performance -- 1 Introduction -- 2 Background and Related Work -- 2.1 Tradeoff Between Flash Cell Wearing and Read Latency -- 2.2 Related Work -- 3 Exploiting Cross-Layer Hotness Identification to Improve Read and Endurance Performance (HIRE) -- 3.1 Cross-Layer Study for Hotness Identifier -- 3.2 Voltage Controller in HIRE -- 3.3 Overhead Analysis -- 4 Performance Evaluation -- 4.1 Methodology -- 4.2 Experiment Results -- 5 Conclusion -- References -- Efficient Management for Hybrid Memory in Managed Language Runtime -- 1 Introduction -- 2 Managed Runtime Background -- 3 Related Work -- 4 Hybrid Memory Management Scheme -- 4.1 Overview -- 4.2 HMprof Offline Performance Instrumentation -- 4.3 Hot Object Marking -- 5 Experimental Methodology -- 6 Evaluation Results -- 6.1 Heap Partition Placement -- 6.2 Hot Object Allocation -- 7 Conclusion -- References -- Resilience and Reliability -- Application-Based Coarse-Grained Incremental Checkpointing Based on Non-volatile Memory -- Abstract -- 1 Introduction -- 2 Background and Motivation -- 2.1 Non-volatile Memory -- 2.2 Incremental Checkpointing with Non-volatile Memory -- 2.3 The Problem with Current Checkpoint -- 3 Design and Implementation.
3.1 Contiguous Memory Regions to be Visited -- 3.2 Application-Based Coarse-Grained Checkpoint: Loose Monitoring Granularity for "Hot" Applications -- 3.3 Huge Page Support -- 4 Experiments -- 4.1 Experimental Setup and Benchmarks -- 4.2 Performance Metrics and Corresponding Results -- 5 Related Work -- 6 Conclusion -- Acknowledgments -- References -- DASM: A Dynamic Adaptive Forward Assembly Area Method to Accelerate Restore Speed for Deduplication-Based Backup Systems -- 1 Introduction -- 2 Background and Motivation -- 2.1 Data Deduplication -- 2.2 Fragmentation Problem and Restoration Speed -- 2.3 Forward Assembly Area -- 2.4 Our Motivation -- 3 Design of DASM -- 3.1 Dynamic Orthotics -- 3.2 Near-Optimal Cache Policy -- 4 Performance Evaluation -- 4.1 Evaluation Methodology -- 4.2 Experimental Results -- 4.3 Analysis -- 5 Conclusions -- References -- Scheduling and Load-Balancing -- A Statistics Based Prediction Method for Rendering Application -- 1 Introduction -- 2 Related Works -- 3 Proposed Strategy -- 3.1 System Design -- 3.2 Parsing of Parameters -- 3.3 Strategy Description -- 4 Experimental Results -- 5 Conclusion and Future Work -- References -- IBB: Improved K-Resource Aware Backfill Balanced Scheduling for HTCondor -- Abstract -- 1 Introduction -- 2 Related Work -- 3 Improved Backfill Balance Scheduling -- 3.1 Thought of IBB -- 3.2 Apply Backfilling to HTCondor -- 4 Experiments and Evaluations -- 4.1 Simulation Experiments -- 4.2 Experiments Conducted on HTCondor -- 4.3 Discussion -- 5 Conclusion and Future Work -- Acknowledgments -- References -- Multipath Load Balancing in SDN/OSPF Hybrid Network -- Abstract -- 1 Introduction -- 2 Problem Formulation -- 2.1 Hybrid Network Scenario -- 2.2 Optimization Problem Definition -- 3 Load Balancing in Hybrid SDN Network -- 3.1 Disjoint Multipath Calculation -- 3.2 LRU: A New FPTAS Algorithm. 4 Experiments and Evaluation -- 5 Conclusion -- Acknowledgments -- References -- Heterogeneous Systems -- A Study of Overflow Vulnerabilities on GPUs -- 1 Introduction -- 2 Background on CUDA Architecture -- 3 Empirical Evaluation of GPU Vulnerabilities -- 3.1 Experiment Setup -- 3.2 Stack Overflow -- 3.3 Heap Overflow -- 3.4 Other Vulnerabilities -- 4 Discussions and Countermeasures -- 5 Related Work -- 6 Conclusion -- References -- Streaming Applications on Heterogeneous Platforms -- 1 Introduction -- 2 Related Work -- 3 A Statistical View -- 3.1 Benchmarks and Datasets -- 3.2 Experimental Platforms -- 3.3 Measurement Methodology -- 3.4 Results and Analysis -- 4 Our Streaming Approach -- 4.1 Categorization -- 4.2 Code Streaming -- 5 Experimental Results -- 6 Conclusion -- References -- Data Processing and Big Data -- DSS: A Scalable and Efficient Stratified Sampling Algorithm for Large-Scale Datasets -- 1 Introduction -- 2 Related Work -- 3 Stratified Sampling Queries -- 3.1 A Single Stratified Sampling Query -- 3.2 Sequential Answering Process -- 4 Distributed Sampling Design -- 4.1 Sampling Representativeness in Distributed Environment -- 4.2 Distributed Algorithm Spark-SQE -- 4.3 Improved Distributed Algorithm DSS -- 5 Experimental Evaluation -- 5.1 Experimental Setup -- 5.2 Results -- 6 Conclusion -- References -- A Fast and Better Hybrid Recommender System Based on Spark -- 1 Introduction -- 1.1 Hybrid Recommender Systems -- 1.2 Work of Paper -- 2 Related Work -- 2.1 Recommender Systems -- 2.2 Weight Analysis -- 2.3 Spark -- 3 Design Overview -- 3.1 Objective Function -- 3.2 Weight Calculation -- 4 Implementation -- 4.1 Modules -- 4.2 Discussion -- 5 Performance -- 5.1 Evaluation Index -- 5.2 Experimental Setup -- 5.3 Performance Comparison -- 6 Conclusion and Future Work -- References. Discovering Trip Patterns from Incomplete Passenger Trajectories for Inter-zonal Bus Line Planning -- Abstract -- 1 Introduction -- 2 Related Work -- 3 Problem Definition -- 4 Mining Frequent Trip Patterns for Inter-zonal Bus Lines -- 4.1 Mining Commuters -- 4.2 Mining FBPT4BL Patterns -- 4.3 Time Complexity Analysis -- 5 Evaluation -- 5.1 Sensitivity Analysis of MinPts and ɛ -- 5.2 Effective and Efficient Analysis of MCFTP Approach -- 6 Conclusion -- Acknowledgments -- References -- FCM: A Fine-Grained Crowdsourcing Model Based on Ontology in Crowd-Sensing -- Abstract -- 1 Introduction -- 2 Characteristic Ontology Triple -- 3 Recommendation Method -- 3.1 Recommendation Index -- 3.2 Service Decision Tree -- 4 Experiment -- 5 Conclusion -- Acknowledgments -- References -- QIM: Quantifying Hyperparameter Importance for Deep Learning -- 1 Introduction -- 2 Background -- 2.1 Deep Learning (DL) -- 2.2 PB Design -- 3 QIM -- 3.1 Overview -- 3.2 Identifying the Value Range for Each Hyperparameter -- 3.3 QIM -- 4 Experimental Setup -- 5 Evaluation -- 5.1 Supervised Learning -- 5.2 Unsupervised Pre-training -- 5.3 Time Cost -- 6 Related Work -- 7 Conclusion -- References -- Algorithms and Computational Models -- Toward a Parallel Turing Machine Model -- 1 Introduction and Motivation -- 2 Existing Work on Parallel Turing Machine --- A Disappointing Status Report -- 2.1 Existing Parallel Turing Machine Proposals -- 2.2 The Disappointing Status on Parallel Turing Machine Studies -- 3 A Parallel Turing Machine Model --- Our Proposal -- 3.1 The Concept of Codelets and Codelet Graphs (CDGs) -- 3.2 The Parallel Turing Machine Model -- 3.3 An Example to Illustrate How the PTM Works -- 3.4 Determinacy Property of Our Proposed PTM -- 4 Related Work -- 4.1 Parallel Turing Machine -- 4.2 Memory Consistency Models -- 4.3 The Codelet Model. 4.4 Work on Parallel Computation Models -- 5 Conclusion and Future Work -- References -- On Determination of Balance Ratio for Some Tree Structures -- 1 Introduction -- 2 The Basic Properties and Algorithms -- 3 The Improvement of Time Complexity -- 4 Concluding Remarks -- References -- Author Index. |
Record Nr. | UNINA-9910484010103321 |
Cham : , : Springer International Publishing : , : Imprint : Springer, , 2016 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|
OpenMP in a Heterogeneous World [[electronic resource] ] : 8th International Workshop on OpenMP, IWOMP 2012, Rome, Italy, June 11-13, 2012. Proceedings / / edited by Barbara Chapman, Federico Massaioli, Matthias S. Müller, Marco Rorro |
Edizione | [1st ed. 2012.] |
Pubbl/distr/stampa | Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2012 |
Descrizione fisica | 1 online resource (XI, 280 p. 86 illus.) |
Disciplina | 004.1 |
Collana | Programming and Software Engineering |
Soggetto topico |
Microprocessors
Algorithms Computer system failures Programming languages (Electronic computers) Computers Computer programming Processor Architectures Algorithm Analysis and Problem Complexity System Performance and Evaluation Programming Languages, Compilers, Interpreters Computation by Abstract Devices Programming Techniques |
ISBN | 3-642-30961-5 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Record Nr. | UNISA-996465280903316 |
Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2012 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. di Salerno | ||
|
OpenMP Shared Memory Parallel Programming [[electronic resource] ] : International Workshop, IWOMP 2005 and IWOMP 2006, Eugene, OR, USA, June 1-4, 2005, and Reims, France, June 12-15, 2006, Proceedings / / edited by Matthias S. Müller, Barbara Chapman, Bronis R. de Supinski, Allen D. Malony, Michael Voss |
Edizione | [1st ed. 2008.] |
Pubbl/distr/stampa | Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2008 |
Descrizione fisica | 1 online resource (XII, 448 p.) |
Disciplina | 005.2/75 |
Collana | Theoretical Computer Science and General Issues |
Soggetto topico |
Computer science
Computer programming Software engineering Algorithms Computer science—Mathematics Computer simulation Theory of Computation Programming Techniques Software Engineering Mathematics of Computing Computer Modelling |
ISBN | 3-540-68555-3 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | Performance Tools -- Performance Analysis of Large-Scale OpenMP and Hybrid MPI/OpenMP Applications with Vampir NG -- ompP: A Profiling Tool for OpenMP -- On the Interaction of Tiling and Automatic Parallelization -- Static Nonconcurrency Analysis of OpenMP Programs -- CCRG OpenMP Compiler: Experiments and Improvements -- Compiler Technology -- Implementing an OpenMP Execution Environment on InfiniBand Clusters -- An Introduction to Balder — An OpenMP Run-time Library for Clusters of SMPs -- Run-Time Environment -- Experiences with the OpenMP Parallelization of DROPS, a Navier-Stokes Solver Written in C++ -- A Parallel Structured Ecological Model for High End Shared Memory Computers -- Multi-cluster, Mixed-Mode Computational Modeling of Human Head Conductivity -- Application I -- An Evaluation of OpenMP on Current and Emerging Multithreaded/Multicore Processors -- SPEC OpenMP Benchmarks on Four Generations of NEC SX Parallel Vector Systems -- Performance Evaluation of Parallel Sparse Matrix–Vector Products on SGI Altix3700 -- The OpenMP Language and Its Evaluation -- The OpenMP Memory Model -- Evaluating OpenMP on Chip MultiThreading Platforms -- Experiences Parallelizing a Web Server with OpenMP -- Advanced Performance Tuning -- Automatic Granularity Selection and OpenMP Directive Generation Via Extended Machine Descriptors in the PROMIS Parallelizing Compiler -- Nested Parallelization of the Flow Solver TFS Using the ParaWise Parallelization Environment -- Performance Characteristics of OpenMP Language Constructs on a Many-core-on-a-chip Architecture -- Improving Performance of OpenMP for SMP Clusters Through Overlapped Page Migrations -- Aspects of Code Development -- Adding New Dimensions to Performance Analysis Through User-Defined Objects -- Performance Instrumentation and Compiler Optimizations for MPI/OpenMP Applications -- Supporting Nested OpenMP Parallelism in the TAU Performance System -- Parallelization of a Hierarchical Data Clustering Algorithm Using OpenMP -- OpenMP and C++ -- Common Mistakes in OpenMP and How to Avoid Them -- Formal Specification of the OpenMP Memory Model -- Applications II -- Performance and Programmability Comparison Between OpenMP and MPI Implementations of a Molecular Modeling Application -- OpenMP Implementation of SPICE3 Circuit Simulator -- Automatic Generation of Parallel Code for Hessian Computations -- Geographical Locality and Dynamic Data Migration for OpenMP Implementations of Adaptive PDE Solvers -- Proposed Extensions to OpenMP -- A Comparison of Task Pool Variants in OpenMP and a Proposal for a Solution to the Busy Waiting Problem -- A Proposal for OpenMP for Java -- A Proposal for Error Handling in OpenMP -- Extending the OpenMP Standard for Thread Mapping and Grouping. |
Record Nr. | UNISA-996465581203316 |
Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2008 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. di Salerno | ||
|
A Practical Programming Model for the Multi-Core Era [[electronic resource] ] : International Workshop on OpenMP, IWOMP 2007 Beijing, China, June 3-7, 2007, Proceedings / / edited by Barbara Chapman, Weimin Zheng, Guang R. Gao, Mitsuhisa Sato, Eduard Ayguadé, Dongsheng Wang |
Edizione | [1st ed. 2008.] |
Pubbl/distr/stampa | Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2008 |
Descrizione fisica | 1 online resource (XII, 207 p.) |
Disciplina | 004.6 |
Collana | Theoretical Computer Science and General Issues |
Soggetto topico |
Computer networks
Computer programming Software engineering Algorithms Computer science—Mathematics Computer simulation Computer Communication Networks Programming Techniques Software Engineering Mathematics of Computing Computer Modelling |
ISBN | 3-540-69303-3 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | A Proposal for Task Parallelism in OpenMP -- Support for Fine Grained Dependent Tasks in OpenMP -- Performance Evaluation of a Multi-zone Application in Different OpenMP Approaches -- Transactional Memory and OpenMP -- OpenMP on Multicore Architectures -- Supporting OpenMP on Cell -- CMP Cache Architecture and the OpenMP Performance -- Exploiting Loop-Level Parallelism for SIMD Arrays Using OpenMP -- OpenMP Extensions for Irregular Parallel Applications on Clusters -- Optimization Strategies Using Hybrid MPI+OpenMP Parallelization for Large-Scale Data Visualization on Earth Simulator -- An Investigation on Testing of Parallelized Code with OpenMP -- Loading OpenMP to Cell: An Effective Compiler Framework for Heterogeneous Multi-core Chip -- OpenMP Implementation of Parallel Linear Solver for Reservoir Simulation -- Parallel Data Flow Analysis for OpenMP Programs -- Design and Implementation of OpenMPD: An OpenMP-Like Programming Language for Distributed Memory Systems -- A New Memory Allocation Model for Parallel Search Space Data Structures with OpenMP -- Implementation of OpenMP Work-Sharing on the Cell Broadband Engine Architecture -- Toward an Automatic Code Layout Methodology -- An Efficient OpenMP Runtime System for Hierarchical Architectures -- Problems, Workarounds and Possible Solutions Implementing the Singleton Pattern with C++ and OpenMP -- Web Service Call Parallelization Using OpenMP -- Distributed Implementation of OpenMP Based on Checkpointing Aided Parallel Execution. |
Record Nr. | UNISA-996465586903316 |
Berlin, Heidelberg : , : Springer Berlin Heidelberg : , : Imprint : Springer, , 2008 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. di Salerno | ||
|
A practical programming model for the multi-core era : 3rd International Workshop on OpenMP, IWOMP 2007, Beijing, China, June 3-7, 2007 : proceedings / / Barbara Chapman ... [et al.] (eds.) |
Edizione | [1st ed. 2008.] |
Pubbl/distr/stampa | Berlin, : Springer, 2008 |
Descrizione fisica | 1 online resource (XII, 207 p.) |
Disciplina | 004.6 |
Altri autori (Persone) | ChapmanBarbara |
Collana |
Lecture notes in computer science
LNCS sublibrary. SL 1, Theoretical computer science and general issues |
Soggetto topico |
Application program interfaces (Computer software)
Parallel programming (Computer science) OpenMP |
ISBN | 3-540-69303-3 |
Formato | Materiale a stampa |
Livello bibliografico | Monografia |
Lingua di pubblicazione | eng |
Nota di contenuto | A Proposal for Task Parallelism in OpenMP -- Support for Fine Grained Dependent Tasks in OpenMP -- Performance Evaluation of a Multi-zone Application in Different OpenMP Approaches -- Transactional Memory and OpenMP -- OpenMP on Multicore Architectures -- Supporting OpenMP on Cell -- CMP Cache Architecture and the OpenMP Performance -- Exploiting Loop-Level Parallelism for SIMD Arrays Using OpenMP -- OpenMP Extensions for Irregular Parallel Applications on Clusters -- Optimization Strategies Using Hybrid MPI+OpenMP Parallelization for Large-Scale Data Visualization on Earth Simulator -- An Investigation on Testing of Parallelized Code with OpenMP -- Loading OpenMP to Cell: An Effective Compiler Framework for Heterogeneous Multi-core Chip -- OpenMP Implementation of Parallel Linear Solver for Reservoir Simulation -- Parallel Data Flow Analysis for OpenMP Programs -- Design and Implementation of OpenMPD: An OpenMP-Like Programming Language for Distributed Memory Systems -- A New Memory Allocation Model for Parallel Search Space Data Structures with OpenMP -- Implementation of OpenMP Work-Sharing on the Cell Broadband Engine Architecture -- Toward an Automatic Code Layout Methodology -- An Efficient OpenMP Runtime System for Hierarchical Architectures -- Problems, Workarounds and Possible Solutions Implementing the Singleton Pattern with C++ and OpenMP -- Web Service Call Parallelization Using OpenMP -- Distributed Implementation of OpenMP Based on Checkpointing Aided Parallel Execution. |
Altri titoli varianti | IWOMP 2007 |
Record Nr. | UNINA-9910768432803321 |
Berlin, : Springer, 2008 | ||
Materiale a stampa | ||
Lo trovi qui: Univ. Federico II | ||
|