PARCO 2007: Jülich / Aachen, Germany
Christian H. Bischof, H. Martin Bücker, Paul Gibbon, Gerhard R. Joubert, Thomas Lippert, Bernd Mohr, Frans J. Peters (Eds.): Parallel Computing: Architectures, Algorithms and Applications, ParCo 2007, Forschungszentrum Jülich and RWTH Aachen University, Germany, 4-7 September 2007. IOS Press 2008 Advances in Parallel Computing 15 ISBN 978-1-58603-796-3
Invited Talks
Barbara M. Chapman, Lei Huang: Enhancing OpenMP and Its Implementation for Programming Multicore Systems. 3-18
Marek Behr, Mike Nicolai, Markus Probst: Efficient Parallel Simulations in Support of Medical Device Design. 19-26
Particle and Atomistic Simulation
Maxime Barrault, Guy Bencteux, Eric Cancès, William W. Hager, Claude Le Bris: Domain Decomposition for Electronic Structure Computations. 29-36
Florian Fleissner, Peter Eberhard: Load Balanced Parallel Simulation of Particle-Fluid DEM-SPH Systems with Moving Boundaries. 37-44
Godehard Sutmann, Florian Janoschek: Communication and Load Balancing of Force-Decomposition Algorithms for Parallel Molecular Dynamics. 45-52
Martin Bernreuther, Martin Buchholz, Hans-Joachim Bungartz: Aspects of a Parallel Molecular Dynamics Software for Nano-Fluidics. 53-60
Marcus Richter, Guido Arnold, Binh Trieu, Thomas Lippert: Massively Parallel Quantum Computer Simulations: Towards Realistic Systems. 61-68
Image Processing and Visualization
Daniel Stødle, Phuong Hoai Ha, John Markus Bjørndalen, Otto J. Anshus: Lessons Learned Using a Camera Cluster to Detect and Locate Objects. 71-78
Marc Wolter, Marc Schirski, Torsten Kuhlen: Hybrid Parallelization for Interactive Exploration in Virtual Environments. 79-86
Performance Modeling and Tools
Darren J. Kerbyson, Kevin J. Barker, Kei Davis: Analysis of the Weather Research and Forecasting (WRF) Model on Large-Scale Systems. 89-98
Diego Rodriguez Martínez, Vicente Blanco Pérez, Marcos Boullón, José Carlos Cabaleiro, Tomás F. Pena: Analytical Performance Models of Parallel Programs in Clusters. 99-106
Robert W. Numrich: Computational Force: A Unifying Concept for Scalability Analysis. 107-112
Michael Gerndt, Sebastian Strohhäcker: Distribution of Periscope Analysis Agents on ALTIX 4700. 113-120
Jost Berthold, Rita Loogen: Visualizing Parallel Functional Program Runs: Case Studies with the Eden Trace Viewer. 121-128
Biomedical Applications
Antonella Galizia, Federica Viti, Daniele D'Agostino, Ivan Merelli, Luciano Milanesi, Andrea Clematis: Experimenting Grid Protocols to Improve Privacy Preservation in Efficient Distributed Image Processing. 139-146
Daniele D'Agostino, Ivan Merelli, Andrea Clematis, Luciano Milanesi, Alessandro Orro: A Parallel Workflow for the Reconstruction of Molecular Surfaces. 147-154
Tony Stöcker, Kaveh Vahedipour, Nadim Joni Shah: HPC Simulation of Magnetic Resonance Imaging. 155-164
José Antonio Álvarez, Javier Roca Piera, José-Jesús Fernández: A Load Balancing Framework in Multithreaded Tomographic Reconstruction. 165-172
Parallel Algorithms
Michael Bader, Sebastian Hanigk, Thomas Huckle: Parallelisation of Block-Recursive Matrix Multiplication in Prefix Computations. 175-184

Parallel Programming Models
Michael Süß, Claudia Leopold: Implementing Data-Parallel Patterns for Shared Memory with OpenMP. 203-210
Bjoern Knafla, Claudia Leopold: Parallelizing a Real-Time Steering Simulation for Computer Games with OpenMP. 219-226
Christoph W. Kessler, Welf Löwe: A Framework for Performance-Aware Composition of Explicitly Parallel Components. 227-234
Marco Aldinucci, Marco Danelutto, Peter Kilpatrick: A Framework for Prototyping and Reasoning about Distributed Systems. 235-242
Joel Falcou, Jocelyn Sérot: Formal Semantics Applied to the Implementation of a Skeleton-Based Parallel Programming Library. 243-252
Numerical Algorithms and Automatic Differentiation
José M. Badía, Peter Benner, Maribel Castillo, Heike Faßbender, Rafael Mayo, Enrique S. Quintana-Ortí, Gregorio Quintana-Ortí: Strategies for Parallelizing the Solution of Rational Matrix Equations. 255-262
Francisco-Jose Martínez-Zaldívar, Antonio-Manuel Vidal-Maciá, Alberto González: A Heterogeneous Pipelined Parallel Algorithm for Minimum Mean Squared Error Estimation with Ordered Successive Interference Cancellation. 263-270
Andreas Honecker, Josef Schüle: OpenMP Implementation of the Householder Reduction for Large Complex Hermitian Eigenvalue Problems. 271-278
Carlos García, Manuel Prieto, Francisco Tirado: Multigrid Smoothers on Multicore Architectures. 279-286
José Ignacio Aliaga, Matthias Bollhöfer, Alberto F. Martín, Enrique S. Quintana-Ortí: Parallelization of Multilevel Preconditioners Constructed from Inverse-Based ILUs on Shared-Memory Multiprocessors. 287-294
Arno Rasch, H. Martin Bücker, Christian H. Bischof: Automatic Computation of Sensitivities for a Parallel Aerodynamic Simulation. 303-310
Scheduling
Jörg Dümmler, Raphael Kunis, Gudula Rünger: Layer-Based Scheduling Algorithms for Multiprocessor-Tasks with Precedence Constraints. 321-328
N. Peter Drakenberg, Sven Trautmann: Unified Scheduling of I/O- and Computation-Jobs for Climate Research Environments. 329-336
Fault Tolerance
Vinod Tipparaju, Manojkumar Krishnan, Bruce Palmer, Fabrizio Petrini, Jarek Nieplocha: Towards Fault Resilient Global Arrays. 339-345
Diego Sevilla, José M. García, Antonio Gómez: Using AOP to Automatically Provide Distribution, Fault Tolerance, and Load Balancing to the CORBA-LC Component Model. 347-354
Marco Aldinucci, Marco Danelutto, Massimo Torquati, Francesco Polzella, Gianmarco Spinatelli, Marco Vanneschi, Alessandro Gervaso, Manuel Cacitti, Pierfrancesco Zuccato: VirtuaLinux: Virtualized High-Density Clusters with no Single Point of Failure. 355-362
Performance Analysis
Robert Schöne, Wolfgang E. Nagel, Stefan Pflüger: Analyzing Cache Bandwidth on the Intel Core 2 Architecture. 365-372
Rick Janda, Matthias S. Müller, Wolfgang E. Nagel, Bernd Trenkler: Analyzing Mutual Influences of High Performance Computing Programs on SGI Altix 3700 and 4700 Systems with PARbench. 373-380
Carolina Bonacic, Mauricio Marín: Comparative Study of Concurrency Control on Bulk-Synchronous Parallel Search Engines. 389-396
Stylianos Bounanos, Martin Fleury: Gb Ethernet Protocols for Clusters: An OpenMPI, TIPC, GAMMA Case Study. 397-404
Michael Hofmann, Gudula Rünger: Performance Measurements and Analysis of the BlueGene/L MPI Implementation. 405-412
Rafik A. Salama, Ahmed Sameh: Potential Performance Improvement of Collective Operations in UPC. 413-422
Parallel Data Distribution and I/O
Jan Seidel, Rudolf Berrendorf, Ace Crngarov, Marc-André Hermanns: Optimization Strategies for Data Distribution Schemes in a Parallel File System. 425-432

Fluid and Magnetohydrodynamics Simulation
Andreas Wolf, Volker Rath, H. Martin Bücker: Parallelisation of a Geothermal Simulation Package: A Case Study on Four Multicore Architectures. 451-458
Yusuke Arai, Ryo Sawai, Yoshiki Yamaguchi, Tsutomu Maruyama, Moritoshi Yasunaga: A Lattice Gas Cellular Automata Simulator on the Cell Broadband Engine. 459-466
Lukas Arnold, Christoph Beetz, Jürgen Dreher, Holger Homann, Christian Schwarz, Rainer Grauer: Massively Parallel Simulations of Solar Flares and Plasma Turbulence. 467-474
Vladimir A. Gasilov, Sergei V. D'yachenko, Olga G. Olkhovskaya, Alexei S. Boldarev, Elena L. Kartasheva, Sergei Boldyrev: Object-Oriented Programming and Parallel Computing in Radiative Magnetohydrodynamics Simulations. 475-482
Axelle Viré, Dmitry Krasnov, Bernard Knaepen, Thomas Boeck: Parallel Simulation of Turbulent Magneto-hydrodynamic Flows. 483-490
Parallel Tools and Middleware
Ivan Rodero, Francesc Guim, Julita Corbalán, Jesús Labarta: Design and Implementation of a General-Purpose API of Progress and Performance Indicators. 501-508
José Manuel Velasco, David Atienza, Katzalin Olcoz, Francisco Tirado: Efficient Object Placement including Node Selection in a Distributed Virtual Machine. 509-516
Rainer Keller, Shiqing Fan, Michael M. Resch: Memory Debugging of MPI-Parallel Applications in Open MPI. 517-523
Hyperscalable Applications
Abhinav Verma, Srinivasa M. Gopal, Alexander Schug, Jung S. Oh, Konstantin V. Klenin, Kyu H. Lee, Wolfgang Wenzel: Massively Parallel All Atom Protein Folding in a Single Day. 527-534
Thomas Streuer, Hinnerk Stüben: Simulations of QCD in the Era of Sustained Tflop/s Computing. 535-542
Stefan Krieg: Optimizing Lattice QCD Simulations on BlueGene/L. 543-550
Parallel Computing with FPGAs
Francesco Belletti, Maria Cotallo, Andres Cruz Flor, Luis Antonio Fernandez, Antonio Gordillo, Andrea Maiorano, Filippo Mantovani, Enzo Marinari, Victor Martin-Mayor, Antonio Munoz Sudupe, Denis Navarro, Sergio Perez Gaviro, Mauro Rossi, Juan Jesus Ruiz-Lorenzo, Sebastiano Fabio Schifano, Daniele Sciretti, Alfonso Tarancón, Raffaele Tripiccione, Jose Luis Velasco: IANUS: Scientific Computing on an FPGA-Based Architecture. 553-560
Ling Zhuo, Viktor K. Prasanna: Optimizing Matrix Multiplication on Heterogeneous Reconfigurable Systems. 561-568
Mini-Symposium "The Future of OpenMP in the Multi-Core Era"

Van Bui, Oscar Hernandez, Barbara M. Chapman, Rick Kufrin, Danesh Tafti, Pradeep Gopalkrishnan: Towards an Implementation of the OpenMP Collector API. 573-580
Mini-Symposium "Scaling Science Applications on Blue Gene"
William D. Gropp, Wolfgang Frings, Marc-André Hermanns, Ed Jedlicka, Kirk E. Jordan, Fred Mintzer, Boris Orth: Scaling Science Applications on Blue Gene. 583-584
Kevin Stratford, Jean Christophe Desplat: Large Simulations of Shear Flow in Mixtures via the Lattice Boltzmann Equation. 593-600
Andreas Dolfen, Yuan Lung Luo, Erik Koch: Simulating Materials with Strong Correlations on BlueGene/L. 601-608
Jeffrey J. Fox, Gregery T. Buzzard, Robert Miller, Fernando Siso-Nadal: Massively Parallel Simulation of Cardiac Electrical Wave Propagation on Blue Gene. 609-616
Mini-Symposium "Scalability and Usability of HPC Programming Tools"
Felix Wolf, Daniel Becker, Bettina Krammer, Dieter an Mey, Shirley Moore, Matthias S. Müller: Scalability and Usability of HPC Programming Tools. 619-620
Gregory L. Lee, Dong H. Ahn, Dorian C. Arnold, Bronis R. de Supinski, Barton P. Miller, Martin Schulz: Benchmarking the Stack Trace Analysis Tool for BlueGene/L. 621-628
Kevin A. Huck, Allen D. Malony, Sameer Shende, Alan Morris: Scalable, Automated Performance Analysis with TAU and PerfExplorer. 629-636
Matthias S. Müller, Andreas Knüpfer, Matthias Jurenz, Matthias Lieber, Holger Brunst, Hartmut Mix, Wolfgang E. Nagel: Developing Scalable Applications with Vampir, VampirServer and VampirTrace. 637-644
Markus Geimer, Björn Kuhlmann, Farzona Pulatova, Felix Wolf, Brian J. N. Wylie: Scalable Collation and Presentation of Call-Path Profile Data with CUBE. 645-652
Bettina Krammer, Valentin Himmler, David Lecomber: Coupling DDT and Marmot for Debugging of MPI Applications. 653-660
Oscar Hernandez, Haoqiang Jin, Barbara M. Chapman: Compiler Support for Efficient Instrumentation. 661-668
Christian Terboven: Comparing Intel Thread Checker and Sun Thread Analyzer. 669-676
Mini-Symposium "DEISA: Extreme Computing in an Advanced Supercomputing Environment"
Hermann Lederer, Gavin J. Pringle, Denis Girou, Marc-André Hermanns, Giovanni Erbacci: DEISA: Extreme Computing in an Advanced Supercomputing Environment. 687-688
Hermann Lederer, Victor Alessandrini: DEISA: Enabling Cooperative Extreme Computing in Europe. 689-696
Alice E. Koniges, Brian T. N. Gunney, Robert W. Anderson, Aaron C. Fisher, Nathan D. Masters: Development Strategies for Modern Predictive Simulation Codes. 697-704
Gavin J. Pringle, Terence M. Sloan, Elena Breitmoser, Odysseas Bournas, Arthur S. Trew: Submission Scripts for Scientific Simulations on DEISA. 705-711
Hermann Lederer, Reinhard Tisma, Roman Hatzky, Alberto Bottino, Frank Jenko: Application Enabling in DEISA: Petascaling of Plasma Turbulence Codes. 713-720
Alessandra S. Lanotte, Federico Toschi: HEAVY: A High Resolution Numerical Experiment in Lagrangian Turbulence. 721-728
Elmar Krieger, Laurent Leger, Marie-Pierre Durrieu, Nada Taib, Peter Bond, Michel Laguerre, Richard Lavery, Mark S. P. Sansom, Marc Baaden: Atomistic Modeling of the Membrane-Embedded Synaptic Fusion Complex: a Grand Challenge Project on the DEISA HPC Infrastructure. 729-736
Mini-Symposium "Parallel Computing with FPGAs"
Erik H. D'Hollander, Dirk Stroobandt, Abdellah Touhafi: Parallel Computing with FPGAs - Concepts and Applications. 739-740
Tim Güneysu, Christof Paar, Jan Pelzl, Gerd Pfeiffer, Manfred Schimmler, Christian Schleiffer: Parallel Computing with Low-Cost FPGAs: A Framework for COPACOBANA. 741-748
Tobias Schumacher, Enno Lübbers, Paul Kaufmann, Marco Platzner: Accelerating the Cube Cut Problem with an FPGA-Augmented Compute Cluster. 749-756
Jeff Furlong, Andrew Felch, Jayram Moorkanikara Nageswaran, Nikil Dutt, Alex Nicolau, Alexander V. Veidenbaum, Ashok Chandrashekar, Richard Granger: Novel Brain-Derived Algorithms Scale Linearly with Number of Processing Elements. 767-776
Martin Botteck, Holger Blume, Jörg von Livonius, Martin Neuenhahn, Tobias G. Noll: Programmable Architectures for Realtime Music Decompression. 777-784
Alessandro Marongiu, Paolo Palazzari: The HARWEST High Level Synthesis Flow to Design a Special-Purpose Architecture to Simulate the 3D Ising Model. 785-792
Séamas McGettrick, Dermot Geraghty, Ciarán McElroy: Towards an FPGA Solver for the PageRank Eigenvector Problem. 793-800



