default search action
Hartwig Anzt
- > Home > Persons > Hartwig Anzt
Publications
- 2023
- [c65]Wissam M. Sid-Lakhdar, Sébastien Cayrols, Daniel Bielich, Ahmad Abdelfattah, Piotr Luszczek, Mark Gates, Stanimire Tomov, Hans Johansen, David B. Williams-Young, Timothy A. Davis, Jack J. Dongarra, Hartwig Anzt:
PAQR: Pivoting Avoiding QR factorization. IPDPS 2023: 322-332 - [c62]Ahmad Abdelfattah, Stanimire Tomov, Piotr Luszczek, Hartwig Anzt, Jack J. Dongarra:
GPU-based LU Factorization and Solve on Batches of Matrices with Band Structure. SC Workshops 2023: 1670-1679 - [c61]Dalal Sukkari, Mark Gates, Mohammed A. Al Farhan, Hartwig Anzt, Jack J. Dongarra:
Task-Based Polar Decomposition Using SLATE on Massively Parallel Systems with Hardware Accelerators. SC Workshops 2023: 1680-1687 - 2021
- [j30]Ahmad Abdelfattah, Hartwig Anzt, Erik G. Boman, Erin C. Carson, Terry Cojean, Jack J. Dongarra, Alyson Fox, Mark Gates, Nicholas J. Higham, Xiaoye S. Li, Jennifer A. Loe, Piotr Luszczek, Srikara Pranesh, Siva Rajamanickam, Tobias Ribizel, Barry F. Smith, Kasia Swirydowicz, Stephen J. Thomas, Stanimire Tomov, Yaohung M. Tsai, Ulrike Meier Yang:
A survey of numerical linear algebra methods utilizing mixed-precision arithmetic. Int. J. High Perform. Comput. Appl. 35(4) (2021) - 2020
- [j24]Hartwig Anzt, Terry Cojean, Chen Yen-Chen, Jack J. Dongarra, Goran Flegar, Pratik Nayak, Stanimire Tomov, Yuhsiang M. Tsai, Weichung Wang:
Load-balancing Sparse Matrix Vector Product Kernels on GPUs. ACM Trans. Parallel Comput. 7(1): 2:1-2:26 (2020) - [c45]Piotr Luszczek, Yaohung M. Tsai, Neil Lindquist, Hartwig Anzt, Jack J. Dongarra:
Scalable Data Generation for Evaluating Mixed-Precision Solvers. HPEC 2020: 1-6 - [c44]Hartwig Anzt, Yuhsiang M. Tsai, Ahmad Abdelfattah, Terry Cojean, Jack J. Dongarra:
Evaluating the Performance of NVIDIA's A100 Ampere GPU for Sparse and Batched Computations. PMBS@SC 2020: 26-38 - [i6]Ahmad Abdelfattah, Hartwig Anzt, Erik G. Boman, Erin C. Carson, Terry Cojean, Jack J. Dongarra, Mark Gates, Thomas Grützmacher, Nicholas J. Higham, Xiaoye Sherry Li, Neil Lindquist, Yang Liu, Jennifer A. Loe, Piotr Luszczek, Pratik Nayak, Srikara Pranesh, Sivasankaran Rajamanickam, Tobias Ribizel, Barry Smith, Kasia Swirydowicz, Stephen J. Thomas, Stanimire Tomov, Yaohung M. Tsai, Ichitaro Yamazaki, Ulrike Meier Yang:
A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic. CoRR abs/2007.06674 (2020) - 2019
- [j22]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Nicholas J. Higham, Enrique S. Quintana-Ortí:
Adaptive precision in block-Jacobi preconditioning for iterative sparse linear system solvers. Concurr. Comput. Pract. Exp. 31(6) (2019) - [j20]Heike Jagode, Anthony Danalis, Hartwig Anzt, Jack J. Dongarra:
PAPI software-defined events for in-depth performance analysis. Int. J. High Perform. Comput. Appl. 33(6) (2019) - [j19]Hartwig Anzt, Jack J. Dongarra, Enrique S. Quintana-Ortí:
Fine-grained bit-flip protection for relaxation methods. J. Comput. Sci. 36 (2019) - [j18]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí:
Variable-size batched Gauss-Jordan elimination for block-Jacobi preconditioning on graphics processors. Parallel Comput. 81: 131-146 (2019) - [c41]Hartwig Anzt, Tobias Ribizel, Goran Flegar, Edmond Chow, Jack J. Dongarra:
ParILUT - A Parallel Threshold ILU for GPUs. IPDPS 2019: 231-241 - [c38]Hartwig Anzt, Yen-Chen Chen, Terry Cojean, Jack J. Dongarra, Goran Flegar, Pratik Nayak, Enrique S. Quintana-Ortí, Yuhsiang M. Tsai, Weichung Wang:
Towards Continuous Benchmarking: An Automated Performance Evaluation Framework for High Performance Software. PASC 2019: 9:1-9:11 - 2018
- [j17]Hartwig Anzt, Moritz Kreutzer, Eduardo Ponce, Gregory D. Peterson, Gerhard Wellein, Jack J. Dongarra:
Optimization and performance evaluation of the IDR iterative Krylov solver on GPUs. Int. J. High Perform. Comput. Appl. 32(2): 220-230 (2018) - [j16]Edmond Chow, Hartwig Anzt, Jennifer A. Scott, Jack J. Dongarra:
Using Jacobi iterations and blocking for solving sparse triangular systems in incomplete factorization preconditioning. J. Parallel Distributed Comput. 119: 219-230 (2018) - [j15]Hartwig Anzt, Thomas K. Huckle, Jürgen Bräckle, Jack J. Dongarra:
Incomplete Sparse Approximate Inverses for Parallel Preconditioning. Parallel Comput. 71: 1-22 (2018) - [j14]Hartwig Anzt, Edmond Chow, Jack J. Dongarra:
ParILUT - A New Parallel Threshold ILU Factorization. SIAM J. Sci. Comput. 40(4): C503-C519 (2018) - [c36]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Thomas Grützmacher:
Variable-Size Batched Condition Number Calculation on GPUs. SBAC-PAD 2018: 132-139 - [c35]Hartwig Anzt, Jack J. Dongarra:
A Jaccard Weights Kernel Leveraging Independent Thread Scheduling on GPUs. SBAC-PAD 2018: 229-232 - 2017
- [j13]Jack J. Dongarra, Stanimire Tomov, Piotr Luszczek, Jakub Kurzak, Mark Gates, Ichitaro Yamazaki, Hartwig Anzt, Azzam Haidar, Ahmad Abdelfattah:
With Extreme Computing, the Rules Have Changed. Comput. Sci. Eng. 19(3): 52-62 (2017) - [j12]Hartwig Anzt, Stanimire Tomov, Jack J. Dongarra:
On the performance and energy efficiency of sparse linear algebra on GPUs. Int. J. High Perform. Comput. Appl. 31(5): 375-390 (2017) - [j11]Hartwig Anzt, Mark Gates, Jack J. Dongarra, Moritz Kreutzer, Gerhard Wellein, Martin Koehler:
Preconditioned Krylov solvers on GPUs. Parallel Comput. 68: 32-44 (2017) - [c32]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí, Andrés E. Tomás:
Variable-Size Batched Gauss-Huard for Block-Jacobi Preconditioning. ICCS 2017: 1783-1792 - [c31]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí:
Variable-Size Batched LU for Small Matrices and Its Integration into Block-Jacobi Preconditioning. ICPP 2017: 91-100 - [c30]Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí:
Batched Gauss-Jordan Elimination for Block-Jacobi Preconditioner Generation on GPUs. PMAM@PPoPP 2017: 1-10 - [c28]Hartwig Anzt, Gary Collins, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí:
Flexible batched sparse matrix-vector product on GPUs. ScalA@SC 2017: 3:1-3:8 - [p2]Hartwig Anzt, Jack J. Dongarra, Mark Gates, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov, Ichitaro Yamazaki:
Bringing High Performance Computing to Big Data Algorithms. Handbook of Big Data Technologies 2017: 777-806 - 2016
- [j10]Ahmad Abdelfattah, Hartwig Anzt, Jack J. Dongarra, Mark Gates, Azzam Haidar, Jakub Kurzak, Piotr Luszczek, Stanimire Tomov, Ichitaro Yamazaki, Asim YarKhan:
Linear algebra software for large-scale accelerated multicore computing. Acta Numer. 25: 1-160 (2016) - [j9]Hartwig Anzt, Edmond Chow, Jens Saak, Jack J. Dongarra:
Updating incomplete factorization preconditioners for model order reduction. Numer. Algorithms 73(3): 611-630 (2016) - [j8]Jakub Kurzak, Hartwig Anzt, Mark Gates, Jack J. Dongarra:
Implementation and Tuning of Batched Cholesky Factorization and Solve for NVIDIA GPUs. IEEE Trans. Parallel Distributed Syst. 27(7): 2036-2048 (2016) - [c27]Chris J. Newburn, Gaurav Bansal, Michael Wood, Luis Crivelli, Judit Planas, Alejandro Duran, Paulo Souza, Leonardo Borges, Piotr Luszczek, Stanimire Tomov, Jack J. Dongarra, Hartwig Anzt, Mark Gates, Azzam Haidar, Yulu Jia, Khairul Kabir, Ichitaro Yamazaki, Jesús Labarta:
Heterogeneous Streaming. IPDPS Workshops 2016: 611-620 - [c26]Hartwig Anzt, Jack J. Dongarra, Moritz Kreutzer, Gerhard Wellein, Martin Koehler:
Efficiency of General Krylov Methods on GPUs - An Experimental Study. IPDPS Workshops 2016: 683-691 - [c25]Hartwig Anzt, Edmond Chow, Thomas Huckle, Jack J. Dongarra:
Batched Generation of Incomplete Sparse Approximate Inverses on GPUs. ScalA@SC 2016: 49-56 - [c24]Hartwig Anzt, Marc Baboulin, Jack J. Dongarra, Yvan Fournier, Frank Hülsemann, Amal Khabou, Yushan Wang:
Accelerating the Conjugate Gradient Algorithm with GPUs in CFD Simulations. VECPAR 2016: 35-43 - [p1]Hartwig Anzt, Edmond Chow, Daniel B. Szyld, Jack J. Dongarra:
Domain Overlap for Iterative Sparse Triangular Solves on GPUs. Software for Exascale Computing 2016: 527-545 - 2015
- [j6]Hartwig Anzt, Blake Haugen, Jakub Kurzak, Piotr Luszczek, Jack J. Dongarra:
Experiences in autotuning matrix multiplication for energy minimization on GPUs. Concurr. Comput. Pract. Exp. 27(17): 5096-5113 (2015) - [j5]Hartwig Anzt, Stanimire Tomov, Piotr Luszczek, William B. Sawyer, Jack J. Dongarra:
Acceleration of GPU-based Krylov solvers via data transfer reduction. Int. J. High Perform. Comput. Appl. 29(3): 366-383 (2015) - [c23]Mark Gates, Hartwig Anzt, Jakub Kurzak, Jack J. Dongarra:
Accelerating collaborative filtering using concepts from high performance computing. IEEE BigData 2015: 667-676 - [c22]Hartwig Anzt, Edmond Chow, Jack J. Dongarra:
Iterative Sparse Triangular Solves for Preconditioning. Euro-Par 2015: 650-661 - [c21]Hartwig Anzt, Stanimire Tomov, Jack J. Dongarra:
Energy efficiency and performance frontiers for sparse computations on GPU supercomputers. PMAM@PPoPP 2015: 1-10 - [c20]Hartwig Anzt, Jack J. Dongarra, Enrique S. Quintana-Ortí:
Tuning stationary iterative solvers for fault resilience. ScalA@SC 2015: 1:1-1:8 - [c19]Hartwig Anzt, Jack J. Dongarra, Enrique S. Quintana-Ortí:
Adaptive precision solvers for sparse linear systems. E2SC@SC 2015: 2:1-2:10 - [c18]Hartwig Anzt, Eduardo Ponce, Gregory D. Peterson, Jack J. Dongarra:
GPU-accelerated co-design of induced dimension reduction: algorithmic fusion and kernel overlap. Co-HPC@SC 2015: 5:1-5:8 - [c17]Hartwig Anzt, Stanimire Tomov, Jack J. Dongarra:
Accelerating the LOBPCG method on GPUs using a blocked sparse matrix vector product. SpringSim (HPS) 2015: 75-82 - [c16]Edmond Chow, Hartwig Anzt, Jack J. Dongarra:
Asynchronous Iterative Algorithm for Computing Incomplete Factorizations on GPUs. ISC 2015: 1-16 - 2014
- [c15]Dimitar Lukarski, Hartwig Anzt, Stanimire Tomov, Jack J. Dongarra:
Hybrid Multi-elimination ILU Preconditioners on GPUs. IPDPS Workshops 2014: 7-16 - [c14]Ichitaro Yamazaki, Hartwig Anzt, Stanimire Tomov, Mark Hoemmen, Jack J. Dongarra:
Improving the Performance of CA-GMRES on Multicores with Multiple GPUs. IPDPS 2014: 382-391 - [c13]Hartwig Anzt, William B. Sawyer, Stanimire Tomov, Piotr Luszczek, Ichitaro Yamazaki, Jack J. Dongarra:
Optimizing Krylov Subspace Solvers on Graphics Processing Units. IPDPS Workshops 2014: 941-949 - [c12]Hartwig Anzt, Dimitar Lukarski, Stanimire Tomov, Jack J. Dongarra:
Self-adaptive Multiprecision Preconditioners on Multicore and Manycore Architectures. VECPAR 2014: 115-123 - 2013
- [j3]Hartwig Anzt, Stanimire Tomov, Jack J. Dongarra, Vincent Heuveline:
A block-asynchronous relaxation method for graphics processing units. J. Parallel Distributed Comput. 73(12): 1613-1626 (2013) - 2012
- [c9]Hartwig Anzt, Stanimire Tomov, Jack J. Dongarra, Vincent Heuveline:
Weighted Block-Asynchronous Iteration on GPU-Accelerated Systems. Euro-Par Workshops 2012: 145-154 - [c8]Hartwig Anzt, Piotr Luszczek, Jack J. Dongarra, Vincent Heuveline:
GPU-Accelerated Asynchronous Error Correction for Mixed Precision Iterative Refinement. Euro-Par 2012: 908-919 - [c7]Hartwig Anzt, Stanimire Tomov, Jack J. Dongarra, Vincent Heuveline:
A Block-Asynchronous Relaxation Method for Graphics Processing Units. IPDPS Workshops 2012: 113-124 - [c6]Hartwig Anzt, Stanimire Tomov, Mark Gates, Jack J. Dongarra, Vincent Heuveline:
Block-asynchronous Multigrid Smoothers for GPU-accelerated Systems. ICCS 2012: 7-16
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-19 00:31 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint