PARCO 2011: Ghent, Belgium
Koen De Bosschere, Erik H. D'Hollander, Gerhard R. Joubert, David A. Padua, Frans J. Peters, Mark Sawyer (Eds.): Applications, Tools and Techniques on the Road to Exascale Computing, Proceedings of the conference ParCo 2011, 31 August - 3 September 2011, Ghent, Belgium. IOS Press 2012 Advances in Parallel Computing 22 ISBN 978-1-61499-040-6
Keynotes

Thomas Lippert, Thomas Eickermann, Dietmar W. Erwin: PRACE: Europe's Supercomputing Research Infrastructure. 7-18
José Luis Vázquez-Poletti, Rafael Moreno-Vozmediano, Ignacio Martín Llorente: Comparison of Admission Control Policies for Service Provision in public Clouds. 19-28
Jack B. Dennis: Program Execution Models for Massively Parallel Computing. 29-40
Andrew Adamatzky: Advances in Physarum Machines Gates, Hulls, Mazes and Routing with Slime Mould. 41-54
Algorithms
Robert Speck, Rolf Krause, Paul Gibbon: Parallel remeshing in tree codes for vortex particle methods. 57-64
Antonio J. Dios, Angeles G. Navarro, Rafael Asenjo, Francisco Corbera, Emilio L. Zapata: A case study of the task-based parallel wavefront pattern. 65-72
Chun-Sheng Chen, Nauful Shaikh, Panitee Charoenrattanaruk, Christoph F. Eick, Nouhad J. Rizk, Edgar Gabriel: Design and Evaluation of a Parallel Execution Framework for the CLEVER Clustering Algorithm. 73-80
Ashley Zebrowski, Frank Löffler, Erik Schnetter: The BL-Octree: An Efficient Data Structure for Discretized Block-Based Adaptive Mesh Refinement. 81-88
Automatic Parallelisation

Barnali Basak, Sandeep Dasgupta, Amey Karkare: Heap Dependence Analysis for Sequential Programs. 99-106
Cloud Computing
Mehdi Sheikhalishahi, Ignacio Martín Llorente, Lucio Grandinetti: Energy Aware Consolidation Policies. 109-116
Pelle Jakovits, Satish Narayans Srirama, Eero Vainikko: MapReduce for Scientific Computing - Viability for non-embarrassingly parallel algorithms. 117-124
Giuseppe Papuzzo, Giandomenico Spezzano: An Autonomic Management System for Choreography-based Workflows on Grids and Clouds. 125-132
Holger Endt, Kay Weckemann: Remote Utilization of OpenCL for Flexible Computation Offloading using Embedded ECUs, CE Devices and Cloud Servers. 133-140
GPU Applications

Vicente Galiano Ibarra, Otoniel López, Manuel P. Malumbres, Héctor Migallón Gomis: Speeding-up the discrete wavelet transform computation with multicore and GPU-based algorithms. 151-158
Usman Dastgeer, Christoph W. Kessler, Samuel Thibault: Flexible Runtime Support for Efficient Skeleton Programming on Heterogeneous GPU-based Systems. 159-166
Alan Gray, Alistair Hart, Alan Richardson, Kevin Stratford: Lattice Boltzmann for Large-Scale GPU Systems. 167-174
Christopher Scannell, Jonathan Decker, Joseph Collins, William Smith: High-fidelity Real-time Antiship Cruise Missile Modeling on the GPU. 175-182
Juan Gómez-Luna, Holger Endt, Walter Stechele, José María González-Linares, José Ignacio Benavides, Nicolas Guil: Egomotion compensation and moving objects detection algorithm on GPU. 183-190
Paul Albuquerque, Pierre Künzli, Xavier Meyer: Performance Model for a Cellular Automata Implementation on a GPU Cluster. 191-198
Volkmar Wieser, Clemens Grelck, Holger Schöner, Peter Haslinger, Karoly Bosa, Bernhard Moser: GPU-Based Image Processing Use Cases: A High-Level Approach. 199-206
Heterogeneous Computing
Sverre Jarp, Alfio Lazzaro, Julien Leduc, Andrzej Nowak, Yngve Sneen Lindal: Parallel Likelihood Function Evaluation on Heterogeneous Many-core Systems. 209-216
Holger Endt, Lothar Stolz, Martin Wechs, Walter Stechele: A Model-Based Software Generation Approach Qualified for Heterogeneous GPGPU-Enabled Platforms. 217-223
High Performance Applications
Nicolas Berr, Dirk Schmidl, Jens Henrik Göbbert, Stefan Lankes, Dieter an Mey, Thomas Bemmerl, Christian H. Bischof: Trajectory-Search on ScaleMP's vSMP Architecture. 227-234
Vladimir A. Gasilov, Alexei S. Boldarev, Sergey Dyachenko, Olga G. Olkhovskaya, Elena Kartasheva, Gennadiy Bagdasarov, Sergei Boldyrev, Irina Gasilova, Valeriy Shmyrov, Svetlana Tkachenko, Julien Grunenwald, Thierry Maillard: Towards an Application of High-Performance Computer Systems to 3D Simulations of High Energy Density Plasmas in Z-Pinches. 235-242
Laurent Berenguer, Thomas Dufaud, Toan Pham, Damien Tromeur-Dervout: On-the-fly Singular Value Decomposition for Aitken's Acceleration of the Schwarz Domain Decomposition Method. 243-250
Oliver Meister, Kaveh Rahnema, Michael Bader: A Software Concept for Cache-Efficient Simulation on Dynamically Adaptive Structured Triangular Grids. 251-260
Antoine Pedron, Lionel Lacassagne, Victor Barbillon, Franck Bimbard, Gilles Rougeron, Stéphane Le Berre: Performance Analysis of an Ultrasound Reconstruction Algorithm for Non Destructive Testing. 261-268
Languages
Juhana Helovuo, Jarkko Niittylahti, Heikki Berg: Corento SIMD Parallelism from Portable High-Level Code. 271-280
David Henty: A Parallel Benchmark Suite for Fortran Coarrays. 281-288
Eric Holk, William E. Byrd, Nilesh Mahajan, Jeremiah Willcock, Arun Chauhan, Andrew Lumsdaine: Declarative Parallel Programming for GPUs. 297-304
Load Balancing
Jörg Keller, Mudassar Majeed, Christoph W. Kessler: Balancing CPU Load for Irregular MPI Applications. 307-316
Roel Wuyts, Karl Meerbergen, Pascal Costanza: Reactive Rebalancing for Scientific Simulations running on ExaScale High Performance Computers. 317-324
Massive Parallelism
Andrew D. Brown, Jeffrey Reeve, Stephen B. Furber, David R. Lester: Processing with a million cores. 327-334
Jack B. Dennis, Guang R. Gao, Xiao X. Meng, Brian Lucas, Joshua Slocum: The Fresh Breeze Program Execution Model. 335-342
Wim Heirman, Trevor E. Carlson, Souradip Sarkar, Pieter Ghysels, Wim Vanroose, Lieven Eeckhout: Using Fast and Accurate Simulation to Explore Hardware/Software Trade-offs in the Multi-Core Era. 343-350
Marek Blazewicz, Steven R. Brandt, Peter Diener, David M. Koppelman, Krzysztof Kurowski, Frank Löffler, Erik Schnetter, Jian Tao: A Massive Data Parallel Computational Framework for Petascale/Exascale Hybrid Computer Systems. 351-358
Multicores
Siegfried Benkner, Sabri Pllana, Jesper Larsson Träff, Philippas Tsigas, Andrew Richards, Raymond Namyst, Beverly Bachmayer, Christoph W. Kessler, David Moloney, Peter Sanders: The PEPPHER Approach to Programmability and Performance Portability for Heterogeneous many-core Architectures. 361-368
Álvaro de Vega, Diego Andrade, Basilio B. Fraguela: An efficient parallel set container for multicore architectures. 369-376
Carlos Amaral Hölbig, Andriele Busatto do Carmo, Viviane Linck Lara, Luis Paulo Arendt: Use of High Accuracy and Interval Arithmetic on Multicore Processors. 377-384
Clemens Grelck, Kevin Hammond, Heinz Hertlein, Philip K. F. Hölzenspies, Chris R. Jesshope, Raimund Kirner, Bernd Scheuermann, Alexander V. Shafarenko, Iraneus te Boekhorst, Volkmar Wieser: Engineering Concurrent Software Guided by Statistical Performance Analysis. 385-394
Numerical Algorithms
Hatem Ltaief, Piotr Luszczek, Azzam Haidar, Jack Dongarra: Solving the Generalized Symmetric Eigenvalue Problem using Tile Algorithms on Multicore Architectures. 397-404
Marek Karwacki, Przemyslaw Stpiczynski: Improving Performance of Triangular Matrix-Vector BLAS Routines on GPUs. 405-412
Irene Sánchez-Linares, Horacio Emilio Pérez Sánchez, José M. García: Accelerating Grid Kernels for Virtual Screening on Graphics Processing Units. 413-420
Edgardo Mejía-Roa, Carlos García, José Ignacio Gómez, Manuel Prieto, Christian Tenllado, Alberto D. Pascual-Montano, Francisco Tirado: Parallelism on the Nonnegative Matrix Factorization. 421-428
Jack Dongarra, Mathieu Faverge, Hatem Ltaief, Piotr Luszczek: Exploiting Fine-Grain Parallelism in Recursive LU Factorization. 429-436
Parallel I/O
Rafael Larrosa, Rafael Asenjo, Angeles G. Navarro, Bradford L. Chamberlain: A First Implementation of Parallel IO in Chapel for Block Data Distribution. 447-454
Michael Kuhn, Julian M. Kunkel, Yuichi Tsujita, Hidetaka Muguruma, Thomas Ludwig: Optimizations for Two-Phase Collective I/O. 455-462
Performance Modelling and Analysis
A. Galonska, Wolfgang Frings, Paul Gibbon, D. Borodin, A. Kirschner: JuBE-based Automatic Testing and Performance Measurement System for Fusion Codes. 465-472
Dominic Eschweiler, Michael Wagner, Markus Geimer, Andreas Knüpfer, Wolfgang E. Nagel, Felix Wolf: Open Trace Format 2: The Next Generation of Scalable Trace Formats and Support Libraries. 481-490
Frederik Vandeputte: Tools for Analyzing the Behavior and Performance of Parallel Applications. 491-498
Jan G. Cornelis, Jan Lemeire: Benchmarks Based on Anti-Parallel Patterns for the Evaluation of GPUs. 499-506
Skeleton Programming
Steffen Ernsting, Herbert Kuchen: Data Parallel Skeletons for GPU Clusters and Multi-GPU Systems. 509-518
Marco Danelutto, Luca Deri, D. De Sensi: Network Monitoring on Multicores with Algorithmic Skeletons. 519-526
Thread Management
Adnan, Mitsuhisa Sato: Experience Using Lazy Task Creation in OpenMP Task for the UTS Benchmark. 529-536
Lukas Arnold: Folding applications into high dimensional torus networks. 537-544
Andrey Marochko, Alexey Kukanov: Composable Parallelism Foundations in the Intel Threading Building Blocks Task Scheduler. 545-554
Industrial Papers

Florian Niebling, Andreas Kopecki, Martin Aumüller: Integrated Simulation Workflows in Computer Aided Engineering on HPC Resources. 565-572
Mini-Symposium ParaFPGA
Erik H. D'Hollander, Dirk Stroobandt, Abdellah Touhafi: ParaFPGA 2011 High Performance Computing with Multiple FPGAs: Design Methodology and Applications. 575-577
Michiel W. van Tol, Zdenek Pohl, Milan Tichý: A Framework for Self-adaptive Collaborative Computing on Reconfigurable Platforms. 579-586
Ron Sass, Andrew G. Schmidt, Scott Buscemi: Reconfigurable Computing Cluster - A Five-Year Perspective of the Project. 595-602
Junyan Tan, Virginie Fresse, Frédéric Rousseau: From mono-FPGA to multi-FPGA emulation platform for NoC performance evaluations. 603-610
Tom Davidson, Mattias Merlier, Karel Bruneel, Dirk Stroobandt: A Dynamically Reconfigurable Pattern Matcher for Regular Expressions on FPGA. 611-618
Mini-Symposium Exascale
Jesús Labarta, Vladimir Marjanovic, Eduard Ayguadé, Rosa M. Badia, Mateo Valero: Hybrid Parallel Programming with MPI/StarSs. 621-628
Mirko Rahn: GPI Global Address Space Programming Interface - Experiences on Scalability. 629-637
Steffen Brinkmann, José Gracia, Christoph Niethammer, Rainer Keller: TEMANEJO a debugger for task based parallel programming models. 639-645
Sameer Shende, Allen D. Malony, Wyatt Spear, Karen Schuchardt: Characterizing I/O Performance Using the TAU Performance System. 647-655
Rosa M. Badia, Jesús Labarta, Vladimir Marjanovic, Alberto F. Martín, Rafael Mayo, Enrique S. Quintana-Ortí, Ruymán Reyes: Symmetric Rank-k Update on Clusters of Multicore Processors with SMPSs. 657-664



