![]() | ![]() |
| 2012 | ||
|---|---|---|
| 63 | Kishor Kharbas, Donghoon Kim, Torsten Hoefler, Frank Mueller: Assessing HPC Failure Detectors for MPI Jobs. PDP 2012: 81-88 | |
| 62 | Torsten Hoefler, Timo Schneider: Communication-centric optimizations by dynamically detecting collective operations. PPOPP 2012: 305-306 | |
| 61 | Fredrik Kjolstad, Torsten Hoefler, Marc Snir: Automatic datatype generation and optimization. PPOPP 2012: 327-328 | |
| 60 | Torsten Hoefler: Extensions for next-generation parallel programming models. Parallel Computing 38(1-2): 1 (2012) | |
| 2011 | ||
| 59 | Timo Schneider, Sven Eckelmann, Torsten Hoefler, Wolfgang Rehm: Kernel-Based Offload of Collective Operations - Implementation, Evaluation and Lessons Learned. Euro-Par (2) 2011: 264-275 | |
| 58 | William Gropp, Torsten Hoefler, Rajeev Thakur, Jesper Larsson Träff: Performance Expectations and Guidelines for MPI Derived Datatypes. EuroMPI 2011: 150-159 | |
| 57 | Torsten Hoefler, Marc Snir: Writing Parallel Libraries with MPI - Common Practice, Issues, and Extensions. EuroMPI 2011: 345-355 | |
| 56 | Vishwanath Venkatesan, Mohamad Chaarawi, Edgar Gabriel, Torsten Hoefler: Design and Evaluation of Nonblocking Collective I/O Operations. EuroMPI 2011: 90-98 | |
| 55 | Jeremiah Willcock, Torsten Hoefler, Nicholas Gerard Edmonds, Andrew Lumsdaine: Active pebbles: parallel programming for data-driven applications. ICS 2011: 235-244 | |
| 54 | Torsten Hoefler, Marc Snir: Generic topology mapping strategies for large-scale parallel architectures. ICS 2011: 75-84 | |
| 53 | Jens Domke, Torsten Hoefler, Wolfgang E. Nagel: Deadlock-Free Oblivious Routing for Arbitrary Topologies. IPDPS 2011: 616-627 | |
| 52 | Torsten Hoefler: HIPS Introduction. IPDPS Workshops 2011: 1139-1140 | |
| 51 | Eric Holk, William E. Byrd, Jeremiah Willcock, Torsten Hoefler, Arun Chauhan, Andrew Lumsdaine: Kanor - A Declarative Language for Explicit Communication. PADL 2011: 190-204 | |
| 50 | Jeremiah Willcock, Torsten Hoefler, Nicholas Gerard Edmonds, Andrew Lumsdaine: Active pebbles: a programming model for highly parallel fine-grained data-driven computations. PPOPP 2011: 305-306 | |
| 49 | Torsten Hoefler, Rolf Rabenseifner, Hubert Ritzdorf, Bronis R. de Supinski, Rajeev Thakur, Jesper Larsson Träff: The scalable process topology interface of MPI 2.2. Concurrency and Computation: Practice and Experience 23(4): 293-310 (2011) | |
| 48 | Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Torsten Hoefler, Sameer Kumar, Ewing L. Lusk, Rajeev Thakur, Jesper Larsson Träff: Mpi on millions of Cores. Parallel Processing Letters 21(1): 45-60 (2011) | |
| 2010 | ||
| 47 | Torsten Hoefler: Bridging Performance Analysis Tools and Analytic Performance Modeling for HPC. Euro-Par Workshops 2010: 483-491 | |
| 46 | Torsten Hoefler, Steven Gottlieb: Parallel Zero-Copy Algorithms for Fast Fourier Transform and Conjugate Gradient Using MPI Datatypes. EuroMPI 2010: 132-141 | |
| 45 | Torsten Hoefler, William Gropp, Rajeev Thakur, Jesper Larsson Träff: Toward Performance Models of MPI Implementations for Understanding Application Scaling Issues. EuroMPI 2010: 21-30 | |
| 44 | Torsten Hoefler, Greg Bronevetsky, Brian Barrett, Bronis R. de Supinski, Andrew Lumsdaine: Efficient MPI Support for Advanced Hybrid Programming Models. EuroMPI 2010: 50-61 | |
| 43 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: LogGOPSim: simulating large-scale applications in the LogGOPS model. HPDC 2010: 597-604 | |
| 42 | Nick Edmonds, Torsten Hoefler, Andrew Lumsdaine: A space-efficient parallel algorithm for computing betweenness centrality in distributed memory. HiPC 2010: 1-10 | |
| 41 | L. Baba Arimilli, Ravi Arimilli, Vicente Chung, Scott Clark, Wolfgang E. Denzel, Ben C. Drerup, Torsten Hoefler, Jody B. Joyner, Jerry Lewis, Jian Li, Nan Ni, Ramakrishnan Rajamony: The PERCS High-Performance Interconnect. Hot Interconnects 2010: 75-82 | |
| 40 | Jeremiah Willcock, Torsten Hoefler, Nicholas Gerard Edmonds, Andrew Lumsdaine: AM++: a generalized active message framework. PACT 2010: 401-410 | |
| 39 | Torsten Hoefler, Christian Siebert, Andrew Lumsdaine: Scalable communication protocols for dynamic sparse data exchange. PPOPP 2010: 159-168 | |
| 38 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: Characterizing the Influence of System Noise on Large-Scale Applications by Simulation. SC 2010: 1-11 | |
| 37 | Torsten Hoefler: Software and Hardware Techniques for Power-Efficient HPC Networking. Computing in Science and Engineering 12(6): 30-37 (2010) | |
| 36 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: Accurately measuring overhead, communication time and progression of blocking and nonblocking collective operations at massive scale. IJPEDS 25(4): 241-258 (2010) | |
| 2009 | ||
| 35 | Prabhanjan Kambadur, Anshul Gupta, Torsten Hoefler, Andrew Lumsdaine: Demand-driven execution of static directed acyclic graphs using task parallelism. HiPC 2009: 284-293 | |
| 34 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: Optimized Routing for Large-Scale InfiniBand Networks. Hot Interconnects 2009: 103-111 | |
| 33 | Torsten Hoefler, Christian Siebert, Andrew Lumsdaine: Group Operation Assembly Language - A Flexible Way to Express Collective Communication. ICPP 2009: 574-581 | |
| 32 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: A power-aware, application-based performance study of modern commodity cluster interconnection networks. IPDPS 2009: 1-7 | |
| 31 | Christian Kaiser, Torsten Hoefler, Boris Bierbaum, Thomas Bemmerl: Implementation and analysis of nonblocking collective operations on SCI networks. IPDPS 2009: 1-7 | |
| 30 | Torsten Hoefler, Jesper Larsson Träff: Sparse collective operations for MPI. IPDPS 2009: 1-8 | |
| 29 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: The impact of network noise at large-scale communication performance. IPDPS 2009: 1-8 | |
| 28 | Torsten Hoefler, Andrew Lumsdaine, Jack Dongarra: Towards Efficient MapReduce Using MPI. PVM/MPI 2009: 240-249 | |
| 27 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: The Effect of Network Noise on Large-Scale Collective Communications. Parallel Processing Letters 19(4): 573-593 (2009) | |
| 26 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: LogGP in theory and practice - An in-depth analysis of modern interconnection networks and benchmarking methods for collective operations. Simulation Modelling Practice and Theory 17(9): 1511-1521 (2009) | |
| 2008 | ||
| 25 | Torsten Hoefler, Andrew Lumsdaine: Overlapping Communication and Computation with High Level Communication Routines. CCGRID 2008: 572-577 | |
| 24 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: Multistage switches are not crossbars: Effects of static routing in high-performance networks. CLUSTER 2008: 116-125 | |
| 23 | Torsten Hoefler, Andrew Lumsdaine: Message progression in parallel computing - to thread or not to thread? CLUSTER 2008: 213-222 | |
| 22 | Patrick Geoffray, Torsten Hoefler: Adaptive Routing Strategies for Modern High Performance Networks. Hot Interconnects 2008: 165-172 | |
| 21 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: Accurately measuring collective operations at massive scale. IPDPS 2008: 1-8 | |
| 20 | Torsten Hoefler, Andrew Lumsdaine: Optimizing non-blocking collective operations for infiniband. IPDPS 2008: 1-8 | |
| 19 | Timo Schneider, Torsten Hoefler, Simon Wunderlich, Torsten Mehlan, Wolfgang Rehm: An Optimized ZGEMM Implementation for the Cell BE. PASA 2008: 113-122 | |
| 18 | Torsten Hoefler, Florian Lorenzen, Andrew Lumsdaine: Sparse Non-blocking Collectives in Quantum Mechanical Calculations. PVM/MPI 2008: 55-63 | |
| 17 | Torsten Hoefler, Maraike Schellmann, Sergei Gorlatch, Andrew Lumsdaine: Communication Optimization for Medical Image Reconstruction Algorithms. PVM/MPI 2008: 75-83 | |
| 16 | Torsten Hoefler, Peter Gottschling, Andrew Lumsdaine: Leveraging non-blocking collective communication in high-performance applications. SPAA 2008: 113-115 | |
| 2007 | ||
| 15 | Torsten Hoefler, Torsten Mehlan, Andrew Lumsdaine, Wolfgang Rehm: Netgauge: A Network Performance Measurement Framework. HPCC 2007: 659-671 | |
| 14 | Torsten Hoefler, Christian Siebert, Wolfgang Rehm: A practically constant-time MPI Broadcast Algorithm for large-scale InfiniBand Clusters with Multicast. IPDPS 2007: 1-8 | |
| 13 | Torsten Hoefler, Andre Lichei, Wolfgang Rehm: Low-Overhead LogGP Parameter Assessment for Modern Interconnection Networks. IPDPS 2007: 1-8 | |
| 12 | Torsten Hoefler, Prabhanjan Kambadur, Richard L. Graham, Galen M. Shipman, Andrew Lumsdaine: A Case for Standard Non-blocking Collective Operations. PVM/MPI 2007: 125-134 | |
| 11 | Torsten Hoefler, Andrew Lumsdaine, Wolfgang Rehm: Implementation and performance analysis of non-blocking collective operations for MPI. SC 2007: 52 | |
| 10 | Torsten Hoefler, Peter Gottschling, Andrew Lumsdaine, Wolfgang Rehm: Optimizing a conjugate gradient solver with non-blocking collective operations. Parallel Computing 33(9): 624-633 (2007) | |
| 2006 | ||
| 9 | Torsten Hoefler, Torsten Mehlan, Frank Mietke, Wolfgang Rehm: Adding Low-Cost Hardware Barrier Support to Small Commodity Clusters. ARCS Workshops 2006: 343-350 | |
| 8 | Frank Mietke, Robert Rex, Robert Baumgartl, Torsten Mehlan, Torsten Hoefler, Wolfgang Rehm: Analysis of the Memory Registration Process in the Mellanox InfiniBand Software Stack. Euro-Par 2006: 124-133 | |
| 7 | Torsten Hoefler, Torsten Mehlan, Frank Mietke, Wolfgang Rehm: Fast barrier synchronization for InfiniBand/spl trade/. IPDPS 2006 | |
| 6 | Torsten Hoefler, Torsten Mehlan, Frank Mietke, Wolfgang Rehm: LogfP - a model for small messages in InfiniBand. IPDPS 2006 | |
| 5 | Torsten Hoefler, Jeffrey M. Squyres, Wolfgang Rehm, Andrew Lumsdaine: A Case for Non-blocking Collective Operations. ISPA Workshops 2006: 155-164 | |
| 4 | Torsten Mehlan, Jochen Strunk, Torsten Hoefler, Frank Mietke, Wolfgang Rehm: IRS - A Portable Interface for Reconfigurable Systems. PARELEC 2006: 187-191 | |
| 3 | Torsten Hoefler, Carsten Viertel, Torsten Mehlan, Frank Mietke, Wolfgang Rehm: Assessing Single-Message and Multi-Node Communication Performance of InfiniBand. PARELEC 2006: 227-232 | |
| 2 | Torsten Hoefler, Peter Gottschling, Wolfgang Rehm, Andrew Lumsdaine: Optimizing a Conjugate Gradient Solver with Non-Blocking Collective Operations. PVM/MPI 2006: 374-382 | |
| 2005 | ||
| 1 | Torsten Hoefler, Lavinio Cerquetti, Torsten Mehlan, Frank Mietke, Wolfgang Rehm: A Practical Approach to the Rating of Barrier Algorithms Using the LogP Model and Open MPI. ICPP Workshops 2005: 562-569 | |
Colors in the list of coauthors
Last update Thu May 31 18:55:10 2012 CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page