


default search action
11th VECPAR 2014: Eugene, OR, USA
- Michel J. Daydé, Osni Marques, Kengo Nakajima: 
 High Performance Computing for Computational Science - VECPAR 2014 - 11th International Conference, Eugene, OR, USA, June 30 - July 3, 2014, Revised Selected Papers. Lecture Notes in Computer Science 8969, Springer 2015, ISBN 978-3-319-17352-8
Algorithms for GPU and Manycores
- Langshi Chen, Serge G. Petiton, Leroy Anthony Drummond, Maxime R. Hugues: 
 A Communication Optimization Scheme for Basis Computation of Krylov Subspace Methods on Multi-GPUs. 3-16
- Ichitaro Yamazaki, Stanimire Tomov, Tingxing Dong, Jack J. Dongarra: 
 Mixed-Precision Orthogonalization Scheme and Adaptive Step Size for Improving the Stability and Performance of CA-GMRES on GPUs. 17-30
- Azzam Haidar, Piotr Luszczek, Stanimire Tomov  , Jack J. Dongarra: , Jack J. Dongarra:
 Heterogenous Acceleration for Linear Algebra in Multi-coprocessor Environments. 31-42
- Fan Ye, Christophe Calvin, Serge G. Petiton: 
 A Study of SpMV Implementation Using MPI and OpenMP on Intel Many-Core Architecture. 43-56
- Masatoshi Kawai, Takeshi Iwashita, Hiroshi Nakashima: 
 SIMD Implementation of a Multiplicative Schwarz Smoother for a Multigrid Poisson Solver on an Intel Xeon Phi Coprocessor. 57-65
- Futoshi Mori, Masaharu Matsumoto, Takashi Furumura: 
 Performance Optimization of the 3D FDM Simulation of Seismic Wave Propagation on the Intel Xeon Phi Coprocessor Using the ppOpen-APPL/FDM Library. 66-76
Large-Scale Applications
- Prasanna Balaprakash  , Yuri Alexeev, Sheri A. Mickelson, Sven Leyffer , Yuri Alexeev, Sheri A. Mickelson, Sven Leyffer , Robert L. Jacob, Anthony P. Craig: , Robert L. Jacob, Anthony P. Craig:
 Machine-Learning-Based Load Balancing for Community Ice Code Component in CESM. 79-91
- Timothy B. Costa, David Foster, Malgorzata Peszynska  : :
 Domain Decomposition for Heterojunction Problems in Semiconductors. 92-101
- Heidi K. Thornquist, Sivasankaran Rajamanickam: 
 A Hybrid Approach for Parallel Transistor-Level Full-Chip Circuit Simulation. 102-111
Numerical Algorithms
- Hartwig Anzt  , Dimitar Lukarski, Stanimire Tomov , Dimitar Lukarski, Stanimire Tomov , Jack J. Dongarra: , Jack J. Dongarra:
 Self-adaptive Multiprecision Preconditioners on Multicore and Manycore Architectures. 115-123
- Ziming Zheng, Andrew A. Chien, Keita Teranishi: 
 Fault Tolerance in an Inner-Outer Solver: A GVR-Enabled Case Study. 124-132
Direct/Hybrid Methods for Solving Sparse Matrices
- Marc Baboulin, Xiaoye S. Li, François-Henry Rouet: 
 Using Random Butterfly Transformations to Avoid Pivoting in Sparse Direct Methods. 135-144
- Joshua Dennis Booth, Padma Raghavan: 
 Hybrid Sparse Linear Solutions with Substituted Factorization. 145-155
- Patrick Amestoy, Jean-Yves L'Excellent, François-Henry Rouet, Wissam M. Sid-Lakhdar: 
 Modeling 1D Distributed-Memory Dense Kernels for an Asynchronous Multifrontal Sparse Solver. 156-169
Performance Tuning
- Steven H. Langer, Ian Karlin, Michael M. Marinak: 
 Performance Characteristics of HYDRA - A Multi-physics Simulation Code from LLNL. 173-181
- Mark Gates  , Azzam Haidar, Jack J. Dongarra: , Azzam Haidar, Jack J. Dongarra:
 Accelerating Computation of Eigenvectors in the Dense Nonsymmetric Eigenvalue Problem. 182-191
- Kenji Ono, Shuichi Chiba, Shunsuke Inoue, Kazuo Minami: 
 Low Byte/Flop Implementation of Iterative Solver for Sparse Matrices Derived from Stencil Computations. 192-205
The Ninth International Workshop on Automatic Performance Tuning
- Yu Lin, Franjo Ivancic, Pallavi Joshi, Gogul Balakrishnan, Malay K. Ganai, Aarti Gupta: 
 Environment-Sensitive Performance Tuning for Distributed Service Orchestration. 209-223
- Shahzeb Siddiqui  , Fatemah AlZayer , Fatemah AlZayer , Saber Feki , Saber Feki : :
 Historic Learning Approach for Auto-tuning OpenACC Accelerated Scientific Applications. 224-235
- Richard Veras, Franz Franchetti: 
 Capturing the Expert: Generating Fast Matrix-Multiply Kernels with Spiral. 236-244
- Elmar Peise, Paolo Bientinesi: 
 A Study on the Influence of Caching: Sequences of Dense Linear Algebra Kernels. 245-258
- France Boillod-Cerneux, Serge G. Petiton, Christophe Calvin, Leroy Anthony Drummond: 
 Toward Restarting Strategies Tuning for a Krylov Eigenvalue Solver. 259-268
- Takeshi Fukaya, Toshiyuki Imamura, Yusaku Yamamoto: 
 Performance Analysis of the Householder-Type Parallel Tall-Skinny QR Factorizations Toward Automatic Algorithm Selection. 269-283
- Takeshi Minami, Motoharu Hibino, Tasuku Hiraishi, Takeshi Iwashita, Hiroshi Nakashima: 
 Automatic Parameter Tuning of Three-Dimensional Tiled FDTD Kernel. 284-297
- Alfian Amrizal, Shoichi Hirasawa, Hiroyuki Takizawa  , Hiroaki Kobayashi: , Hiroaki Kobayashi:
 Automatic Parameter Tuning of Hierarchical Incremental Checkpointing. 298-309

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


 Google
Google Google Scholar
Google Scholar Semantic Scholar
Semantic Scholar Internet Archive Scholar
Internet Archive Scholar CiteSeerX
CiteSeerX ORCID
ORCID














