default search action
Jesper Larsson Träff
Person information
- affiliation: TU Wien, Vienna, Austria
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c123]Ioannis Vardas, Sascha Hunold, Philippe Swartvagher, Jesper Larsson Träff:
Improved Parallel Application Performance and Makespan by Colocation and Topology-aware Process Mapping. CCGrid 2024: 119-124 - [i36]Jesper Larsson Träff:
Optimal Broadcast Schedules in Logarithmic Time with Applications to Broadcast, All-Broadcast, Reduction and All-Reduction. CoRR abs/2407.18004 (2024) - [i35]Jesper Larsson Träff:
Lectures on Parallel Computing. CoRR abs/2407.18795 (2024) - [i34]Jesper Larsson Träff:
Optimal, Non-pipelined Reduce-scatter and Allreduce Algorithms. CoRR abs/2410.14234 (2024) - 2023
- [j38]Martti Forsell, Jussi Roivainen, Ville Leppänen, Jesper Larsson Träff:
Realizing multioperations and multiprefixes in Thick Control Flow processors. Microprocess. Microsystems 98: 104807 (2023) - [c122]Jesper Larsson Träff, Sascha Hunold, Ioannis Vardas, Nikolaus Manes Funk:
Uniform Algorithms for Reduce-scatter and (most) other Collectives for MPI. CLUSTER 2023: 284-294 - [c121]Ioannis Vardas, Sascha Hunold, Philippe Swartvagher, Jesper Larsson Träff:
Exploring Mapping Strategies for Co-allocated HPC Applications. Euro-Par Workshops 2023: 271-276 - [c120]Martti Forsell, Jussi Roivainen, Ville Leppänen, Jesper Larsson Träff:
Preliminary Performance and Memory Access Scalability Study of Thick Control Flow Processors. NorCAS 2023: 1-7 - [c119]Jesper Larsson Träff, Ioannis Vardas:
Library Development with MPI: Attributes, Request Objects, Group Communicator Creation, Local Reductions, and Datatypes. EuroMPI 2023: 5:1-5:10 - [c118]Philippe Swartvagher, Sascha Hunold, Jesper Larsson Träff, Ioannis Vardas:
Using Mixed-Radix Decomposition to Enumerate Computational Resources of Deeply Hierarchical Architectures. SC Workshops 2023: 404-415 - [i33]Jesper Larsson Träff:
Round-optimal n-Block Broadcast Schedules in Logarithmic Time. CoRR abs/2312.11236 (2023) - 2022
- [j37]Martti Forsell, Sara Nikula, Jussi Roivainen, Ville Leppänen, Jesper Larsson Träff:
Performance and programmability comparison of the thick control flow architecture and current multicore processors. J. Supercomput. 78(3): 3152-3183 (2022) - [c117]Jesper Larsson Träff:
Fast(er) Construction of Round-optimal $n$-Block Broadcast Schedules. CLUSTER 2022: 142-151 - [c116]Sascha Hunold, Jordy I. Ajanohoun, Ioannis Vardas, Jesper Larsson Träff:
An Overhead Analysis of MPI Profiling and Tracing Tools. PERMAVOST@HPDC 2022: 5-13 - [c115]Ioannis Vardas, Sascha Hunold, Jordy I. Ajanohoun, Jesper Larsson Träff:
mpisee: MPI Profiling for Communication and Communicator Structure. IPDPS Workshops 2022: 520-529 - [c114]Jesper Larsson Träff:
Brief Announcement: Fast(er) Construction of Round-optimal n-Block Broadcast Schedules. SPAA 2022: 143-146 - [i32]Jesper Larsson Träff:
(Poly)Logarithmic Time Construction of Round-optimal n-Block Broadcast Schedules for Broadcast and irregular Allgather in MPI. CoRR abs/2205.10072 (2022) - 2021
- [j36]Jesper Larsson Träff, Sascha Hunold, Guillaume Mercier, Daniel J. Holmes:
MPI collective communication through a single set of interfaces: A case for orthogonality. Parallel Comput. 107: 102826 (2021) - [c113]Jesper Larsson Träff, Manuel Pöter:
A more pragmatic implementation of the lock-free, ordered, linked list. PPoPP 2021: 457-459 - [i31]Jesper Larsson Träff:
A Doubly-pipelined, Dual-root Reduction-to-all Algorithm and Implementation. CoRR abs/2109.12626 (2021) - 2020
- [j35]Konrad von Kirchbach, Christian Schulz, Jesper Larsson Träff:
Better Process Mapping and Sparse Quadratic Assignment. ACM J. Exp. Algorithmics 25: 1-19 (2020) - [j34]Jesper Larsson Träff, Torsten Hoefler:
Special issue: Selected papers from EuroMPI 2019. Parallel Comput. 99: 102695 (2020) - [c112]Konrad von Kirchbach, Markus Lehr, Sascha Hunold, Christian Schulz, Jesper Larsson Träff:
Efficient Process-to-Node Mapping Algorithms for Stencil Computations. CLUSTER 2020: 1-11 - [c111]Jesper Larsson Träff, Sascha Hunold:
Decomposing MPI Collectives for Exploiting Multi-lane Communication. CLUSTER 2020: 270-280 - [c110]Martti Forsell, Jussi Roivainen, Jesper Larsson Träff:
Optimizing Memory Access in TCF Processors with Compute-Update Operations. IPDPS Workshops 2020: 577-586 - [c109]Jesper Larsson Träff, Sascha Hunold, Guillaume Mercier, Daniel J. Holmes:
Collectives and Communicators: A Case for Orthogonality: (Or: How to get rid of MPI neighbor and enhance Cartesian collectives). EuroMPI 2020: 31-38 - [c108]Jesper Larsson Träff:
Signature Datatypes for Type Correct Collective Operations, Revisited. EuroMPI 2020: 81-88 - [c107]Marcelo Fonseca Faraj, Alexander van der Grinten, Henning Meyerhenke, Jesper Larsson Träff, Christian Schulz:
High-Quality Hierarchical Process Mapping. SEA 2020: 4:1-4:15 - [i30]Marcelo Fonseca Faraj, Alexander van der Grinten, Henning Meyerhenke, Jesper Larsson Träff, Christian Schulz:
High-Quality Hierarchical Process Mapping. CoRR abs/2001.07134 (2020) - [i29]Sascha Hunold, Konrad von Kirchbach, Markus Lehr, Christian Schulz, Jesper Larsson Träff:
Efficient Process-to-Node Mapping Algorithms for Stencil Computations. CoRR abs/2005.09521 (2020) - [i28]Jesper Larsson Träff:
k-ported vs. k-lane Broadcast, Scatter, and Alltoall Algorithms. CoRR abs/2008.12144 (2020) - [i27]Jesper Larsson Träff, Manuel Pöter:
A more Pragmatic Implementation of the Lock-free, Ordered, Linked List. CoRR abs/2010.15755 (2020)
2010 – 2019
- 2019
- [j33]Qiao Kang, Jesper Larsson Träff, Reda Al-Bahrani, Ankit Agrawal, Alok N. Choudhary, Wei-keng Liao:
Scalable Algorithms for MPI Intergroup Allgather and Allgatherv. Parallel Comput. 85: 220-230 (2019) - [j32]Jesper Larsson Träff:
On Optimal Trees for Irregular Gather and Scatter Collectives. IEEE Trans. Parallel Distributed Syst. 30(9): 2060-2074 (2019) - [c106]Jesper Larsson Träff, Sascha Hunold:
Cartesian Collective Communication. ICPP 2019: 48:1-48:11 - [c105]Carlos Pachajoa, Markus Levonyak, Wilfried N. Gansterer, Jesper Larsson Träff:
How to Make the Preconditioned Conjugate Gradient Method Resilient Against Multiple Node Failures. ICPP 2019: 67:1-67:10 - [c104]Jesper Larsson Träff, Torsten Hoefler:
Foreword EuroMPI 2019. EuroMPI 2019: 1:1-1:2 - [e10]Torsten Hoefler, Jesper Larsson Träff:
Proceedings of the 26th European MPI Users' Group Meeting, EuroMPI 2019, Zürich, Switzerland, September 11-13, 2019. ACM 2019, ISBN 978-1-4503-7175-9 [contents] - [i26]Michael Kainer, Jesper Larsson Träff:
More Parallelism in Dijkstra's Single-Source Shortest Path Algorithm. CoRR abs/1903.12085 (2019) - [i25]Carlos Pachajoa, Markus Levonyak, Wilfried N. Gansterer, Jesper Larsson Träff:
How to Make the Preconditioned Conjugate Gradient Method Resilient Against Multiple Node Failures. CoRR abs/1907.13077 (2019) - [i24]Jesper Larsson Träff:
Decomposing Collectives for Exploiting Multi-lane Communication. CoRR abs/1910.13373 (2019) - 2018
- [j31]Martti Forsell, Jussi Roivainen, Ville Leppänen, Jesper Larsson Träff:
Supporting concurrent memory access in TCF processor architectures. Microprocess. Microsystems 63: 226-236 (2018) - [j30]Jesper Larsson Träff:
Practical, distributed, low overhead algorithms for irregular gather and scatter collectives. Parallel Comput. 75: 100-117 (2018) - [c103]Martti Forsell, Jussi Roivainen, Ville Leppänen, Jesper Larsson Träff:
Implementation of Multioperations in Thick Control Flow Processors. IPDPS Workshops 2018: 744-752 - [c102]Manuel Pöter, Jesper Larsson Träff:
Stamp-it, amortized constant-time memory reclamation in comparison to five other schemes. PPoPP 2018: 413-414 - [c101]Qiao Kang, Jesper Larsson Träff, Reda Al-Bahrani, Ankit Agrawal, Alok N. Choudhary, Wei-keng Liao:
Full-Duplex Inter-Group All-to-All Broadcast Algorithms with Optimal Bandwidth. EuroMPI 2018: 1:1-1:10 - [c100]Manuel Pöter, Jesper Larsson Träff:
Brief Announcement: Stamp-it, a more Thread-efficient, Concurrent Memory Reclamation Scheme in the C++ Memory Model. SPAA 2018: 355-358 - [i23]Manuel Pöter, Jesper Larsson Träff:
Memory Models for C/C++ Programmers. CoRR abs/1803.04432 (2018) - [i22]Jesper Larsson Träff:
Parallel Quicksort without Pairwise Element Exchange. CoRR abs/1804.07494 (2018) - [i21]Manuel Pöter, Jesper Larsson Träff:
Stamp-it: A more Thread-efficient, Concurrent Memory Reclamation Scheme in the C++ Memory Model. CoRR abs/1805.08639 (2018) - 2017
- [j29]Alexandra Carpen-Amarie, Sascha Hunold, Jesper Larsson Träff:
On expected and observed communication performance with MPI derived datatypes. Parallel Comput. 69: 98-117 (2017) - [c99]Seyed Hessam Mirsadeghi, Jesper Larsson Träff, Pavan Balaji, Ahmad Afsahi:
Exploiting Common Neighborhoods to Optimize MPI Neighborhood Collectives. HiPC 2017: 348-357 - [c98]Martti Forsell, Jussi Roivainen, Ville Leppänen, Jesper Larsson Träff:
Supporting concurrent memory access in TCF-aware processor architectures. NORCAS 2017: 1-6 - [c97]Jesper Larsson Träff:
Practical, linear-time, fully distributed algorithms for irregular gather and scatter. EuroMPI/USA 2017: 1:1-1:10 - [c96]Christian Schulz, Jesper Larsson Träff:
Better Process Mapping and Sparse Quadratic Assignment. SEA 2017: 4:1-4:15 - [i20]Christian Schulz, Jesper Larsson Träff:
Better Process Mapping and Sparse Quadratic Assignment. CoRR abs/1702.04164 (2017) - [i19]Jesper Larsson Träff:
Practical, Linear-time, Fully Distributed Algorithms for Irregular Gather and Scatter. CoRR abs/1702.05967 (2017) - [i18]Christian Schulz, Jesper Larsson Träff:
VieM v1.00 - Vienna Mapping and Sparse Quadratic Assignment User Guide. CoRR abs/1703.05509 (2017) - [i17]Jesper Larsson Träff:
On Optimal Trees for Irregular Gather and Scatter Collectives. CoRR abs/1711.08731 (2017) - [i16]Manuel Pöter, Jesper Larsson Träff:
A new and five older Concurrent Memory Reclamation Schemes in Comparison (Stamp-it). CoRR abs/1712.06134 (2017) - 2016
- [j28]Jesper Larsson Träff:
(Mis)managing parallel computing research through EU project funding. Commun. ACM 59(12): 46-48 (2016) - [j27]Christian Lengauer, Luc Bougé, Jesper Larsson Träff:
Special issue: Euro-Par 2015. Concurr. Comput. Pract. Exp. 28(12): 3445-3446 (2016) - [c95]Sascha Hunold, Alexandra Carpen-Amarie, Felix Donatus Lübbe, Jesper Larsson Träff:
Automatic Verification of Self-consistent MPI Performance Guidelines. Euro-Par 2016: 433-446 - [c94]Robert Ganian, Martin Kalany, Stefan Szeider, Jesper Larsson Träff:
Polynomial-Time Construction of Optimal MPI Derived Datatype Trees. IPDPS 2016: 638-647 - [c93]Jesper Larsson Träff:
A Library for Advanced Datatype Programming. EuroMPI 2016: 98-107 - [c92]Alexandra Carpen-Amarie, Sascha Hunold, Jesper Larsson Träff:
On the Expected and Observed Communication Performance with MPI Derived Datatypes. EuroMPI 2016: 108-120 - [c91]Jakob Gruber, Jesper Larsson Träff, Martin Wimmer:
Brief Announcement: Benchmarking Concurrent Priority Queues. SPAA 2016: 361-362 - [c90]Stefano Markidis, Ivy Bo Peng, Jesper Larsson Träff, Antoine Rougier, Valeria Bartsch, Rui Machado, Mirko Rahn, Alistair Hart, Daniel J. Holmes, Mark Bull, Erwin Laure:
The EPiGRAM Project: Preparing Parallel Programming Models for Exascale. ISC Workshops 2016: 56-68 - [e9]Jack J. Dongarra, Daniel J. Holmes, Antonia B. K. Collis, Jesper Larsson Träff, Lorna Smith:
Proceedings of the 23rd European MPI Users' Group Meeting, EuroMPI 2016, Edinburgh, United Kingdom, September 25-28, 2016. ACM 2016, ISBN 978-1-4503-4234-6 [contents] - [i15]Jakob Gruber, Jesper Larsson Träff, Martin Wimmer:
Benchmarking Concurrent Priority Queues: Performance of k-LSM and Related Data Structures. CoRR abs/1603.05047 (2016) - [i14]Sascha Hunold, Alexandra Carpen-Amarie, Felix Donatus Lübbe, Jesper Larsson Träff:
PGMPI: Automatically Verifying Self-Consistent MPI Performance Guidelines. CoRR abs/1606.00215 (2016) - [i13]Jesper Larsson Träff, Alexandra Carpen-Amarie, Sascha Hunold, Antoine Rougier:
Message-Combining Algorithms for Isomorphic, Sparse Collective Communication. CoRR abs/1606.07676 (2016) - [i12]Alexandra Carpen-Amarie, Sascha Hunold, Jesper Larsson Träff:
MPI Derived Datatypes: Performance Expectations and Status Quo. CoRR abs/1607.00178 (2016) - 2015
- [c89]Martin Wimmer, Jakob Gruber, Jesper Larsson Träff, Philippas Tsigas:
The lock-free k-LSM relaxed priority queue. PPoPP 2015: 277-278 - [c88]Martin Kalany, Jesper Larsson Träff:
Efficient, Optimal MPI Datatype Reconstruction for Vector and Index Types. EuroMPI 2015: 5:1-5:10 - [c87]Jesper Larsson Träff, Felix Donatus Lübbe, Antoine Rougier, Sascha Hunold:
Isomorphic, Sparse MPI-like Collective Communication Operations for Parallel Stencil Computations. EuroMPI 2015: 10:1-10:10 - [c86]Jesper Larsson Träff, Felix Donatus Lübbe:
Specification Guideline Violations by MPI_Dims_create. EuroMPI 2015: 19:1-19:2 - [e8]Jesper Larsson Träff, Sascha Hunold, Francesco Versaci:
Euro-Par 2015: Parallel Processing - 21st International Conference on Parallel and Distributed Computing, Vienna, Austria, August 24-28, 2015, Proceedings. Lecture Notes in Computer Science 9233, Springer 2015, ISBN 978-3-662-48095-3 [contents] - [i11]Martin Wimmer, Jakob Gruber, Jesper Larsson Träff, Philippas Tsigas:
The Lock-free k-LSM Relaxed Priority Queue. CoRR abs/1503.05698 (2015) - [i10]Robert Ganian, Martin Kalany, Stefan Szeider, Jesper Larsson Träff:
Polynomial-time Construction of Optimal Tree-structured Communication Data Layout Descriptions. CoRR abs/1506.09100 (2015) - [i9]Jesper Larsson Träff:
The Shortest Path Problem with Edge Information Reuse is NP-Complete. CoRR abs/1509.05637 (2015) - 2014
- [j26]Jesper Larsson Träff, Siegfried Benkner:
Selected Papers from EuroMPI 2012 - 19th European MPI Users' Group Meeting. Computing 96(4): 259-261 (2014) - [j25]Christian Siebert, Jesper Larsson Träff:
Perfectly Load-Balanced, Stable, Synchronization-Free Parallel Merge. Parallel Process. Lett. 24(1) (2014) - [c85]Jesper Larsson Träff, Antoine Rougier, Sascha Hunold:
Implementing a classic: zero-copy all-to-all communication with mpi datatypes. ICS 2014: 135-144 - [c84]Martin Wimmer, Francesco Versaci, Jesper Larsson Träff, Daniel Cederman, Philippas Tsigas:
Data structures for task-based priority scheduling. PPoPP 2014: 379-380 - [c83]Jesper Larsson Träff, Antoine Rougier:
MPI Collectives and Datatypes for Hierarchical All-to-all Communication. EuroMPI/ASIA 2014: 27 - [c82]Jesper Larsson Träff:
Optimal MPI Datatype Normalization for Vector and Index-block Types. EuroMPI/ASIA 2014: 33 - [c81]Jesper Larsson Träff, Antoine Rougier:
Zero-copy, Hierarchical Gather is not possible with MPI Datatypes and Collectives. EuroMPI/ASIA 2014: 39 - [c80]Sascha Hunold, Alexandra Carpen-Amarie, Jesper Larsson Träff:
Reproducible MPI Micro-Benchmarking Isn't As Easy As You Think. EuroMPI/ASIA 2014: 69 - [i8]Jesper Larsson Träff, Martin Wimmer:
An improved, easily computable combinatorial lower bound for weighted graph bipartitioning. CoRR abs/1410.0462 (2014) - 2013
- [c79]Martin Wimmer, Manuel Pöter, Jesper Larsson Träff:
The Pheet Task-Scheduling Framework on the Intel® Xeon Phi Coprocessor and other Multicore Architectures. IPDPS Workshops 2013: 1587-1596 - [c78]Martin Wimmer, Daniel Cederman, Jesper Larsson Träff, Philippas Tsigas:
Work-stealing with configurable scheduling strategies. PPoPP 2013: 315-316 - [i7]Christian Siebert, Jesper Larsson Träff:
Perfectly load-balanced, optimal, stable, parallel merge. CoRR abs/1303.4312 (2013) - [i6]Jesper Larsson Träff:
A Note on (Parallel) Depth- and Breadth-First Search by Arc Elimination. CoRR abs/1305.1222 (2013) - [i5]Martin Wimmer, Daniel Cederman, Jesper Larsson Träff, Philippas Tsigas:
Configurable Strategies for Work-stealing. CoRR abs/1305.6474 (2013) - [i4]Sascha Hunold, Jesper Larsson Träff:
On the State and Importance of Reproducible Experimental Research in Parallel Computing. CoRR abs/1308.3648 (2013) - [i3]Martin Wimmer, Daniel Cederman, Francesco Versaci, Jesper Larsson Träff, Philippas Tsigas:
Data Structures for Task-based Priority Scheduling. CoRR abs/1312.2501 (2013) - 2012
- [j24]Torsten Hoefler, Patrick Geoffray, Fabrizio Petrini, Jesper Larsson Träff:
Top Picks from Hot Interconnects 2011: Petascale Network Architectures. IEEE Micro 32(1): 4-7 (2012) - [j23]Jesper Larsson Träff:
Alternative, uniformly expressive and more scalable interfaces for collective communication in MPI. Parallel Comput. 38(1-2): 26-36 (2012) - [c77]Christoph W. Kessler, Usman Dastgeer, Samuel Thibault, Raymond Namyst, Andrew Richards, Uwe Dolinsky, Siegfried Benkner, Jesper Larsson Träff, Sabri Pllana:
Programmability and performance portability aspects of heterogeneous multi-/manycore systems. DATE 2012: 1403-1408 - [c76]Jesper Larsson Träff:
mpicroscope: Towards an MPI Benchmark Tool for Performance Guideline Verification. EuroMPI 2012: 100-109 - [c75]Christian Siebert, Jesper Larsson Träff:
Efficient MPI Implementation of a Parallel, Stable Merge Algorithm. EuroMPI 2012: 204-213 - [c74]Christoph W. Kessler, Usman Dastgeer, Mudassar Majeed, Nathalie Furmento, Samuel Thibault, Raymond Namyst, Siegfried Benkner, Sabri Pllana, Jesper Larsson Träff, Martin Wimmer:
Abstract: Leveraging PEPPHER Technology for Performance Portable Supercomputing. SC Companion 2012: 1395-1396 - [c73]Christoph W. Keßler, Usman Dastgeer, Mudassar Majeed, Nathalie Furmento, Samuel Thibault, Raymond Namyst, Siegfried Benkner, Sabri Pllana, Jesper Larsson Träff, Martin Wimmer:
Poster: Leveraging PEPPHER Technology for Performance Portable Supercomputing. SC Companion 2012: 1397 - [e7]Michael Alexander, Pasqua D'Ambra, Adam Belloum, George Bosilca, Mario Cannataro, Marco Danelutto, Beniamino Di Martino, Michael Gerndt, Emmanuel Jeannot, Raymond Namyst, Jean Roman, Stephen L. Scott, Jesper Larsson Träff, Geoffroy Vallée, Josef Weidendorfer:
Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29 - September 2, 2011, Revised Selected Papers, Part I. Lecture Notes in Computer Science 7155, Springer 2012, ISBN 978-3-642-29736-6 [contents] - [e6]Michael Alexander, Pasqua D'Ambra, Adam Belloum, George Bosilca, Mario Cannataro, Marco Danelutto, Beniamino Di Martino, Michael Gerndt, Emmanuel Jeannot, Raymond Namyst, Jean Roman, Stephen L. Scott, Jesper Larsson Träff, Geoffroy Vallée, Josef Weidendorfer:
Euro-Par 2011: Parallel Processing Workshops - CCPI, CGWS, HeteroPar, HiBB, HPCVirt, HPPC, HPSS, MDGS, ProPer, Resilience, UCHPC, VHPC, Bordeaux, France, August 29 - September 2, 2011, Revised Selected Papers, Part II. Lecture Notes in Computer Science 7156, Springer 2012, ISBN 978-3-642-29739-7 [contents] - [e5]Jesper Larsson Träff, Siegfried Benkner, Jack J. Dongarra:
Recent Advances in the Message Passing Interface - 19th European MPI Users' Group Meeting, EuroMPI 2012, Vienna, Austria, September 23-26, 2012. Proceedings. Lecture Notes in Computer Science 7490, Springer 2012, ISBN 978-3-642-33517-4 [contents] - [i2]Jesper Larsson Träff:
Simplified, stable parallel merging. CoRR abs/1202.6575 (2012) - 2011
- [j22]Torsten Hoefler, Rolf Rabenseifner, Hubert Ritzdorf, Bronis R. de Supinski, Rajeev Thakur, Jesper Larsson Träff:
The scalable process topology interface of MPI 2.2. Concurr. Comput. Pract. Exp. 23(4): 293-310 (2011) - [j21]Siegfried Benkner, Sabri Pllana, Jesper Larsson Träff, Philippas Tsigas, Uwe Dolinsky, Cédric Augonnet, Beverly Bachmayer, Christoph W. Kessler, David Moloney, Vitaly Osipov:
PEPPHER: Efficient and Productive Usage of Hybrid Computing Systems. IEEE Micro 31(5): 28-41 (2011) - [j20]Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Torsten Hoefler, Sameer Kumar, Ewing L. Lusk, Rajeev Thakur, Jesper Larsson Träff:
Mpi on millions of Cores. Parallel Process. Lett. 21(1): 45-60 (2011) - [c72]Jesper Larsson Träff, Brice Goglin, Ulrich Brüning, Fabrizio Petrini:
Introduction. Euro-Par (2) 2011: 263 - [c71]Jesper Larsson Träff:
A (Radical) Proposal Addressing the Non-scalability of the Irregular MPI Collective Interfaces. IPDPS Workshops 2011: 1199-1207 - [c70]Martin Wimmer, Jesper Larsson Träff:
An Extended Work-Stealing Framework for Mixed-Mode Parallel Applications. IPDPS Workshops 2011: 1683-1690 - [c69]Siegfried Benkner, Sabri Pllana, Jesper Larsson Träff, Philippas Tsigas, Andrew Richards, Raymond Namyst, Beverly Bachmayer, Christoph W. Kessler, David Moloney, Peter Sanders:
The PEPPHER Approach to Programmability and Performance Portability for Heterogeneous many-core Architectures. PARCO 2011: 361-368 - [c68]Enes Bajrovic, Jesper Larsson Träff:
Using MPI Derived Datatypes in Numerical Libraries. EuroMPI 2011: 29-38 - [c67]William Gropp, Torsten Hoefler, Rajeev Thakur, Jesper Larsson Träff:
Performance Expectations and Guidelines for MPI Derived Datatypes. EuroMPI 2011: 150-159 - [c66]