default search action
Jeffrey S. Vetter
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
Books and Theses
- 2013
- [b1]Jeffrey S. Vetter:
Contemporary High Performance Computing - From Petascale toward Exascale. Chapman and Hall / CRC computational science series, CRC Press 2013, ISBN 978-1-4665-6834-1, pp. I-XXVII, 1-702
Journal Articles
- 2024
- [j60]William F. Godoy, Pedro Valero-Lara, Keita Teranishi, Prasanna Balaprakash, Jeffrey S. Vetter:
Large language model evaluation for high-performance computing software development. Concurr. Comput. Pract. Exp. 36(26) (2024) - 2023
- [j59]Jeffrey S. Vetter, Prasanna Date, Farah Fahim, Shruti R. Kulkarni, Petro Maksymovych, A. Alec Talin, Marc González Tallada, Pruek Vanna-Iampikul, Aaron R. Young, David Brooks, Yu Cao, Gu-Yeon Wei, Sung Kyu Lim, Frank Liu, Matthew J. Marinella, Bobby G. Sumpter, Narasinga Rao Miniskar:
Abisko: Deep codesign of an architecture for spiking neural networks using novel neuromorphic materials. Int. J. High Perform. Comput. Appl. 37(3-4): 351-379 (2023) - 2022
- [j58]Satoshi Matsuoka, Jens Domke, Mohamed Wahib, Aleksandr Drozd, Andrew A. Chien, Raymond Bair, Jeffrey S. Vetter, John Shalf:
Preparing for the Future - Rethinking Proxy Applications. Comput. Sci. Eng. 24(2): 85-90 (2022) - 2021
- [j57]George Papadimitriou, Cong Wang, Karan Vahi, Rafael Ferreira da Silva, Anirban Mandal, Zhengchun Liu, Rajiv Mayani, Mats Rynge, Mariam Kiran, Vickie E. Lynch, Rajkumar Kettimuthu, Ewa Deelman, Jeffrey S. Vetter, Ian T. Foster:
End-to-end online performance data capture and analysis for scientific workflows. Future Gener. Comput. Syst. 117: 387-400 (2021) - [j56]Jacob Lambert, Seyong Lee, Jeffrey S. Vetter, Allen D. Malony:
Optimization with the OpenACC-to-FPGA framework on the Arria 10 and Stratix 10 FPGAs. Parallel Comput. 104-105: 102784 (2021) - [j55]Gregory J. Herschlag, Seyong Lee, Jeffrey S. Vetter, Amanda Randles:
Analysis of GPU Data Access Patterns on Complex Geometries for the D3Q19 Lattice Boltzmann Algorithm. IEEE Trans. Parallel Distributed Syst. 32(10): 2400-2414 (2021) - 2019
- [j54]Jungwon Kim, Jeffrey S. Vetter:
Implementing efficient data compression and encryption in a persistent key-value store for HPC. Int. J. High Perform. Comput. Appl. 33(6) (2019) - [j53]Seyong Lee, John Gounley, Amanda Randles, Jeffrey S. Vetter:
Performance portability study for massively parallel computational fluid dynamics application on scalable heterogeneous architectures. J. Parallel Distributed Comput. 129: 1-13 (2019) - 2018
- [j52]Ewa Deelman, Tom Peterka, Ilkay Altintas, Christopher D. Carothers, Kerstin Kleese van Dam, Kenneth Moreland, Manish Parashar, Lavanya Ramakrishnan, Michela Taufer, Jeffrey S. Vetter:
The future of scientific workflows. Int. J. High Perform. Comput. Appl. 32(1): 159-175 (2018) - [j51]Mariam Umar, Shirley V. Moore, Jeremy S. Meredith, Jeffrey S. Vetter, Kirk W. Cameron:
Aspen-based performance and energy modeling frameworks. J. Parallel Distributed Comput. 120: 222-236 (2018) - [j50]Ivy Bo Peng, Roberto Gioiosa, Gokcen Kestor, Jeffrey S. Vetter, Pietro Cicotti, Erwin Laure, Stefano Markidis:
Characterizing the performance benefit of hybrid memory system for HPC applications. Parallel Comput. 76: 57-69 (2018) - 2017
- [j49]Sparsh Mittal, Jeffrey S. Vetter, Lei Jiang:
Addressing Read-Disturbance Issue in STT-RAM by Data Compression and Selective Duplication. IEEE Comput. Archit. Lett. 16(2): 94-98 (2017) - [j48]Ewa Deelman, Christopher D. Carothers, Anirban Mandal, Brian Tierney, Jeffrey S. Vetter, Ilya Baldin, Claris Castillo, Gideon Juve, Dariusz Król, Vickie E. Lynch, Benjamin Mayer, Jeremy S. Meredith, Thomas Proffen, Paul Ruth, Rafael Ferreira da Silva:
PANORAMA: An approach to performance modeling and diagnosis of extreme-scale workflows. Int. J. High Perform. Comput. Appl. 31(1): 4-18 (2017) - [j47]Jeffrey S. Vetter, Erik P. DeBenedictis, Thomas M. Conte:
Architectures for the Post-Moore Era. IEEE Micro 37(4): 6-8 (2017) - 2016
- [j46]Sparsh Mittal, Jeffrey S. Vetter:
Reliability Tradeoffs in Design of Volatile and Nonvolatile Caches. J. Circuits Syst. Comput. 25(11): 1650139:1-1650139:14 (2016) - [j45]Sparsh Mittal, Jeffrey S. Vetter:
A Survey of Techniques for Modeling and Improving Reliability of Computing Systems. IEEE Trans. Parallel Distributed Syst. 27(4): 1226-1238 (2016) - [j44]Sparsh Mittal, Jeffrey S. Vetter:
A Survey Of Architectural Approaches for Data Compression in Cache and Main Memory Systems. IEEE Trans. Parallel Distributed Syst. 27(5): 1524-1536 (2016) - [j43]Sparsh Mittal, Jeffrey S. Vetter:
A Survey of Software Techniques for Using Non-Volatile Memories for Storage and Main Memory Systems. IEEE Trans. Parallel Distributed Syst. 27(5): 1537-1550 (2016) - [j42]Sparsh Mittal, Jeffrey S. Vetter:
A Survey Of Techniques for Architecting DRAM Caches. IEEE Trans. Parallel Distributed Syst. 27(6): 1852-1863 (2016) - [j41]Sparsh Mittal, Jeffrey S. Vetter:
EqualWrites: Reducing Intra-Set Write Variations for Enhancing Lifetime of Non-Volatile Caches. IEEE Trans. Very Large Scale Integr. Syst. 24(1): 103-114 (2016) - 2015
- [j40]Sparsh Mittal, Jeffrey S. Vetter:
AYUSH: A Technique for Extending Lifetime of SRAM-NVM Hybrid Caches. IEEE Comput. Archit. Lett. 14(2): 115-118 (2015) - [j39]Jeffrey S. Vetter, Sparsh Mittal:
Opportunities for Nonvolatile Memory Systems in Extreme-Scale High-Performance Computing. Comput. Sci. Eng. 17(2): 73-82 (2015) - [j38]Sparsh Mittal, Jeffrey S. Vetter:
A Survey of CPU-GPU Heterogeneous Computing Techniques. ACM Comput. Surv. 47(4): 69:1-69:35 (2015) - [j37]Amit Sabne, Putt Sakdhnagool, Seyong Lee, Jeffrey S. Vetter:
Understanding Portability of a High-Level Programming Model on Contemporary Heterogeneous Architectures. IEEE Micro 35(4): 48-58 (2015) - [j36]Kyle L. Spafford, Jeffrey S. Vetter:
Automated Design Space Exploration with Aspen. Sci. Program. 2015: 157305:1-157305:10 (2015) - [j35]Sparsh Mittal, Jeffrey S. Vetter, Dong Li:
A Survey Of Architectural Approaches for Managing Embedded DRAM and Non-Volatile On-Chip Caches. IEEE Trans. Parallel Distributed Syst. 26(6): 1524-1537 (2015) - 2014
- [j34]Anthony Danalis, Piotr Luszczek, Gabriel Marin, Jeffrey S. Vetter, Jack J. Dongarra:
BlackjackBench: Portable Hardware Characterization with Automated Results' Analysis. Comput. J. 57(7): 1002-1016 (2014) - [j33]Yuan Tian, Cong Xu, Weikuan Yu, Jeffrey S. Vetter, Scott Klasky, Honggao Liu, Saad Biaz:
neCODEC: nearline data compression for scientific applications. Clust. Comput. 17(2): 475-486 (2014) - [j32]Sparsh Mittal, Jeffrey S. Vetter:
A Survey of Methods for Analyzing and Improving GPU Energy Efficiency. ACM Comput. Surv. 47(2): 19:1-19:23 (2014) - 2013
- [j31]Kyle Spafford, Jeffrey S. Vetter, Thomas Benson, Mike Parker:
Modeling synthetic aperture radar computation with Aspen. Int. J. High Perform. Comput. Appl. 27(3): 255-262 (2013) - [j30]Jeffrey S. Vetter, Padma Raghavan:
Special Issue: Selected Papers from Super Computing 2012. Sci. Program. 21(3-4): 63-64 (2013) - 2012
- [j29]Yuan Tian, Weikuan Yu, Jeffrey S. Vetter:
RXIO: Design and implementation of high performance RDMA-capable GridFTP. Comput. Electr. Eng. 38(3): 772-784 (2012) - [j28]Vinod Tipparaju, Edoardo Aprà, Weikuan Yu, Xinyu Que, Jeffrey S. Vetter:
Runtime Techniques to Enable a Highly-Scalable Global Address Space Model for Petascale Computing. Int. J. Parallel Program. 40(6): 633-655 (2012) - [j27]Weikuan Yu, Xinyu Que, Vinod Tipparaju, Jeffrey S. Vetter:
HiCOO: Hierarchical cooperation for scalable communication in Global Address Space programming models on Cray XT systems. J. Parallel Distributed Comput. 72(11): 1481-1492 (2012) - [j26]Anthony Danalis, Piotr Luszczek, Gabriel Marin, Jeffrey S. Vetter, Jack J. Dongarra:
BlackjackBench: portable hardware characterization. SIGMETRICS Perform. Evaluation Rev. 40(2): 74-79 (2012) - 2011
- [j25]Jeffrey S. Vetter, Richard Glassbrook, Jack J. Dongarra, Karsten Schwan, Bruce Loftis, Stephen Taylor McNally, Jeremy S. Meredith, James H. Rogers, Philip C. Roth, Kyle Spafford, Sudhakar Yalamanchili:
Keeneland: Bringing Heterogeneous GPU Computing to the Computational Science Community. Comput. Sci. Eng. 13(5): 90-95 (2011) - [j24]Jack J. Dongarra, Peter H. Beckman, Terry Moore, Patrick Aerts, Giovanni Aloisio, Jean-Claude Andre, David Barkai, Jean-Yves Berthou, Taisuke Boku, Bertrand Braunschweig, Franck Cappello, Barbara M. Chapman, Xuebin Chi, Alok N. Choudhary, Sudip S. Dosanjh, Thom H. Dunning, Sandro Fiore, Al Geist, Bill Gropp, Robert J. Harrison, Mark Hereld, Michael A. Heroux, Adolfy Hoisie, Koh Hotta, Zhong Jin, Yutaka Ishikawa, Fred Johnson, Sanjay Kale, Richard Kenway, David E. Keyes, Bill Kramer, Jesús Labarta, Alain Lichnewsky, Thomas Lippert, Bob Lucas, Barney Maccabe, Satoshi Matsuoka, Paul Messina, Peter Michielse, Bernd Mohr, Matthias S. Müller, Wolfgang E. Nagel, Hiroshi Nakashima, Michael E. Papka, Daniel A. Reed, Mitsuhisa Sato, Edward Seidel, John Shalf, David Skinner, Marc Snir, Thomas L. Sterling, Rick Stevens, Frederick H. Streitz, Bob Sugar, Shinji Sumimoto, William M. Tang, John A. Taylor, Rajeev Thakur, Anne E. Trefethen, Mateo Valero, Aad J. van der Steen, Jeffrey S. Vetter, Peg Williams, Robert W. Wisniewski, Katherine A. Yelick:
The International Exascale Software Project roadmap. Int. J. High Perform. Comput. Appl. 25(1): 3-60 (2011) - [j23]Jeremy S. Meredith, Philip C. Roth, Kyle Spafford, Jeffrey S. Vetter:
Performance Implications of Nonuniform Device Topologies in Scalable Heterogeneous Architectures. IEEE Micro 31(5): 66-75 (2011) - 2010
- [j22]Weikuan Yu, Xinyu Que, Vinod Tipparaju, Richard L. Graham, Jeffrey S. Vetter:
Cooperative server clustering for a scalable GAS model on petascale cray XT5 systems. Comput. Sci. Res. Dev. 25(1-2): 57-64 (2010) - [j21]Ken Alvin, Brian Barrett, Ron Brightwell, Sudip S. Dosanjh, Al Geist, K. Scott Hemmert, Michael A. Heroux, Douglas B. Kothe, Richard C. Murphy, Jeff Nichols, Ron A. Oldfield, Arun Rodrigues, Jeffrey S. Vetter:
On the Path to Exascale. Int. J. Distributed Syst. Technol. 1(2): 1-22 (2010) - 2009
- [j20]Rupak Biswas, Leonid Oliker, Jeffrey S. Vetter:
Revolutionary technologies for acceleration of emerging petascale applications. Parallel Comput. 35(3): 117-118 (2009) - [j19]Jeremy S. Meredith, Gonzalo Alvarez, Thomas A. Maier, Thomas C. Schulthess, Jeffrey S. Vetter:
Accuracy and performance of graphics processors: A Quantum Monte Carlo application case study. Parallel Comput. 35(3): 151-163 (2009) - 2008
- [j18]Jack J. Dongarra, Robert B. Graybill, William J. Harrod, Robert F. Lucas, Ewing L. Lusk, Piotr Luszczek, Janice McMahon, Allan Snavely, Jeffrey S. Vetter, Katherine A. Yelick, Sadaf R. Alam, Roy L. Campbell, Laura Carrington, Tzu-Yi Chen, Omid Khalili, Jeremy S. Meredith, Mustafa M. Tikir:
DARPA's HPCS Program- History, Models, Tools, Languages. Adv. Comput. 72: 1-100 (2008) - [j17]Sadaf R. Alam, Richard F. Barrett, Mark R. Fahey, Jeffery A. Kuehn, O. E. Bronson Messer, Richard Tran Mills, Philip C. Roth, Jeffrey S. Vetter, Patrick H. Worley:
An Evaluation of the Oak Ridge National Laboratory Cray XT3. Int. J. High Perform. Comput. Appl. 22(1): 52-80 (2008) - [j16]Sadaf R. Alam, Pratul K. Agarwal, Jeffrey S. Vetter:
Performance characteristics of biomolecular simulations on high-end systems with multi-core processors. Parallel Comput. 34(11): 640-651 (2008) - 2007
- [j15]Sadaf R. Alam, Pratul K. Agarwal, Melissa C. Smith, Jeffrey S. Vetter, David Caliga:
Using FPGA Devices to Accelerate Biomolecular Simulations. Computer 40(3): 66-73 (2007) - [j14]Bernd Mohr, Luiz De Rose, Jeffrey S. Vetter:
A framework for performance analysis of Co-Array Fortran. Concurr. Comput. Pract. Exp. 19(17): 2207-2218 (2007) - [j13]Sadaf R. Alam, Pratul K. Agarwal, Jeffrey S. Vetter, Melissa C. Smith:
Throughput Improvement of Molecular Dynamics Simulations Using Reconfigurable Computing. Scalable Comput. Pract. Exp. 8(4) (2007) - [j12]Eric Grobelny, David Bueno, Ian A. Troxel, Alan D. George, Jeffrey S. Vetter:
FASE: A Framework for Scalable Performance Prediction of HPC Systems and Applications. Simul. 83(10): 721-745 (2007) - 2006
- [j11]Rod Fatoohi, Ken Kardys, Sumy Koshy, Soundarya Sivaramakrishnan, Jeffrey S. Vetter:
Performance evaluation of high-speed interconnects using dense communication patterns. Parallel Comput. 32(11-12): 794-807 (2006) - [j10]Hong Ong, Jeffrey S. Vetter, R. Scott Studham, Collin McCurdy, Bruce Walker, Alan L. Cox:
Kernel-level single system image for petascale computing. ACM SIGOPS Oper. Syst. Rev. 40(2): 50-54 (2006) - 2005
- [j9]Jeffrey S. Vetter, Bronis R. de Supinski, Lynn Kissel, John May, Sheila Vaidya:
Evaluating high-performance computers. Concurr. Pract. Exp. 17(10): 1239-1270 (2005) - [j8]Thomas H. Dunigan Jr., Jeffrey S. Vetter, James B. White III, Patrick H. Worley:
Performance Evaluation of the Cray X1 Distributed Shared-Memory Architecture. IEEE Micro 25(1): 30-40 (2005) - 2003
- [j7]Jeffrey S. Vetter, Frank Mueller:
Communication characteristics of large-scale scientific applications for contemporary cluster architectures. J. Parallel Distributed Comput. 63(9): 853-865 (2003) - 2000
- [j6]Jeffrey S. Vetter, Daniel A. Reed:
Real-Time Performance Monitoring, Adaptive Control, and Interactive Steering of Computational Grids. Int. J. High Perform. Comput. Appl. 14(4): 357-366 (2000) - 1999
- [j5]Jeffrey S. Vetter, Karsten Schwan:
Techniques for high-performance computational steering. IEEE Concurr. 7(4): 63-74 (1999) - 1998
- [j4]Weiming Gu, Greg Eisenhauer, Karsten Schwan, Jeffrey S. Vetter:
Falcon: On-line monitoring for steering parallel programs. Concurr. Pract. Exp. 10(9): 699-736 (1998) - [j3]Beth Plale, Greg Eisenhauer, Karsten Schwan, Jeremy M. Heiner, Vernard Martin, Jeffrey S. Vetter:
From interactive applications to distributed laboratories. IEEE Concurr. 6(2): 78-90 (1998) - 1997
- [j2]Jeffrey S. Vetter:
Computational Steering Annoted Bibliography. ACM SIGPLAN Notices 32(6): 40-44 (1997) - 1994
- [j1]Weiming Gu, Jeffrey S. Vetter, Karsten Schwan:
An annotated bibliography of interactive program steering. ACM SIGPLAN Notices 29(9): 140-148 (1994)
Conference and Workshop Papers
- 2024
- [c216]Norihisa Fujita, Beau Johnston, Narasinga Rao Miniskar, Ryohei Kobayashi, Mohammad Alaul Haque Monil, Keita Teranishi, Seyong Lee, Jeffrey S. Vetter, Taisuke Boku:
CHARM-SYCL & IRIS: A Tool Chain for Performance Portability on Extremely Heterogeneous Systems. e-Science 2024: 1-10 - [c215]Pedro Valero-Lara, Seyong Lee, Joel E. Denny, Keita Teranishi, Jeffrey S. Vetter, Marc González Tallada:
sKokkos: Enabling Kokkos with Transparent Device Selection on Heterogeneous Systems using OpenACC. HPC Asia 2024: 23-34 - [c214]Beau Johnston, Narasinga Rao Miniskar, Aaron R. Young, Mohammad Alaul Haque Monil, Seyong Lee, Jeffrey S. Vetter:
IRIS: Exploring Performance Scaling of the Intelligent Runtime System and its Dynamic Scheduling Policies. IPDPS (Workshops) 2024: 58-67 - [c213]Marc González Tallada, Joel E. Denny, Pedro Valero-Lara, Seyong Lee, Keita Teranishi, Jeffrey S. Vetter:
eCC++ : A Compiler Construction Framework for Embedded Domain-Specific Languages. IPDPS (Workshops) 2024: 667-677 - [c212]Narasinga Rao Miniskar, Seyong Lee, Beau Johnston, Aaron R. Young, Mohammad Alaul Haque Monil, Pedro Valero-Lara, Jeffrey S. Vetter:
IRIS Reimagined: Advancements in Intelligent Runtime System for Task-Based Programming. WAMTA 2024: 46-58 - [c211]Mohammad Alaul Haque Monil, Narasinga Rao Miniskar, Pedro Valero-Lara, Keita Teranishi, Jeffrey S. Vetter:
MatRIS: Addressing the Challenges for Portability and Heterogeneity Using Tasking for Matrix Decomposition (Cholesky). WAMTA 2024: 59-70 - 2023
- [c210]Zheming Jin, Jeffrey S. Vetter:
Accelerating Hyperdimensional Classifier with SYCL. CLUSTER Workshops 2023: i-ii - [c209]Zheming Jin, Jeffrey S. Vetter:
Understanding Portability of Automotive Workload: A Case Study with the Points-to-Image Kernel in SYCL on Heterogeneous Computing Platforms. GPGPU@PPoPP 2023: 14-20 - [c208]Anthony M. Cabrera, Yigit A. Yucesan, Frank Y. Liu, Willem Blokland, Jeffrey S. Vetter:
Errant Beam Detection Using the AMD Versal ACAP and Vitis AI. HPEC 2023: 1-6 - [c207]Narasinga Rao Miniskar, Mohammad Alaul Haque Monil, Pedro Valero-Lara, Frank Y. Liu, Jeffrey S. Vetter:
IRIS-DMEM: Efficient Memory Management for Heterogeneous Computing. HPEC 2023: 1-7 - [c206]Shruti R. Kulkarni, Aaron R. Young, Prasanna Date, Narasinga Rao Miniskar, Jeffrey S. Vetter, Farah Fahim, Benjamin Parpillon, Jennet Dickinson, Nhan Tran, Jieun Yoo, Corrinne Mills, Morris Swartz, Petar Maksimovic, Catherine D. Schuman, Alice Bean:
On-Sensor Data Filtering using Neuromorphic Computing for High Energy Physics Experiments. ICONS 2023: 3:1-3:8 - [c205]Zheming Jin, Jeffrey S. Vetter:
Experience Deploying Graph Applications on GPUs with SYCL. ICPP Workshops 2023: 30-39 - [c204]William F. Godoy, Pedro Valero-Lara, Keita Teranishi, Prasanna Balaprakash, Jeffrey S. Vetter:
Evaluation of OpenAI Codex for HPC Parallel Programming Models Kernel Generation. ICPP Workshops 2023: 136-144 - [c203]Ahna Wurm, Rebecca Seay, Prasanna Date, Shruti R. Kulkarni, Aaron R. Young, Jeffrey S. Vetter:
Arithmetic Primitives for Efficient Neuromorphic Computing. ICRC 2023: 1-5 - [c202]Zheming Jin, Jeffrey S. Vetter:
Understanding SYCL Portability for Pseudorandom Number Generation: a Case Study with Gene-Expression Connectivity Mapping. IPDPS Workshops 2023: 295-298 - [c201]Zheming Jin, Jeffrey S. Vetter:
Understanding Performance Portability of SYCL Kernels: A Case Study with the All-Pairs Distance Calculation in Bioinformatics on GPUs. IPDPS Workshops 2023: 366-372 - [c200]William F. Godoy, Pedro Valero-Lara, T. Elise Dettling, Christian Trefftz, Ian Jorquera, Thomas Sheehy, Ross G. Miller, Marc González Tallada, Jeffrey S. Vetter, Valentin Churavy:
Evaluating performance and portability of high-level programming models: Julia, Python/Numba, and Kokkos on exascale nodes. IPDPS Workshops 2023: 373-382 - [c199]Narasinga Rao Miniskar, Pruek Vanna-Iampikul, Aaron R. Young, Sung Kyu Lim, Frank Liu, Jieun Yoo, Corrinne Mills, Nhan Tran, Farah Fahim, Jeffrey S. Vetter:
A 3D Implementation of Convolutional Neural Network for Fast Inference. ISCAS 2023: 1-5 - [c198]Zheming Jin, Jeffrey S. Vetter:
A Benchmark Suite for Improving Performance Portability of the SYCL Programming Model. ISPASS 2023: 325-327 - [c197]Pedro Valero-Lara, Jeffrey S. Vetter:
A MultiGPU Performance-Portable Solution for Array Programming Based on Kokkos. ARRAY@PLDI 2023: 1-12 - [c196]Narasinga Rao Miniskar, Mohammad Alaul Haque Monil, Pedro Valero-Lara, Frank Liu, Jeffrey S. Vetter:
Tiling Framework for Heterogeneous Computing of Matrix based Tiled Algorithms. ExHET@PPoPP 2023: 1:1-1:6 - [c195]Pedro Valero-Lara, Ian Jorquera, Frank Liu, Jeffrey S. Vetter:
Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor Cores. SC Workshops 2023: 177-186 - [c194]Mohammad Alaul Haque Monil, Narasinga Rao Miniskar, Keita Teranishi, Jeffrey S. Vetter, Pedro Valero-Lara:
MatRIS: Multi-level Math Library Abstraction for Heterogeneity and Performance Portability using IRIS Runtime. SC Workshops 2023: 1081-1092 - [c193]Aristotle X. Martin, Geng Liu, William Ladd, Seyong Lee, John Gounley, Jeffrey S. Vetter, Saumil Patel, Silvio Rizzi, Victor A. Mateevitsi, Joseph A. Insley, Amanda Randles:
Performance Evaluation of Heterogeneous GPU Programming Frameworks for Hemodynamic Simulations. SC Workshops 2023: 1126-1137 - [c192]Sanil Rao, Mohammad Alaul Haque Monil, Het Mankad, Jeffrey S. Vetter, Franz Franchetti:
FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code. SC Workshops 2023: 1635-1641 - [c191]Norihisa Fujita, Beau Johnston, Ryohei Kobayashi, Keita Teranishi, Seyong Lee, Taisuke Boku, Jeffrey S. Vetter:
CHARM-SYCL: New Unified Programming Environment for Multiple Accelerator Types. SC Workshops 2023: 1651-1661 - [c190]Pedro Valero-Lara, Jeffrey S. Vetter, John Gounley, Amanda Randles:
Moment Representation of Regularized Lattice Boltzmann Methods on NVIDIA and AMD GPUs. SC Workshops 2023: 1697-1704 - [c189]William F. Godoy, Pedro Valero-Lara, Caira Anderson, Katrina W. Lee, Ana Gainaru, Rafael Ferreira da Silva, Jeffrey S. Vetter:
Julia as a unifying end-to-end workflow language on the Frontier exascale system. SC Workshops 2023: 1989-1999 - [c188]Zheming Jin, Jeffrey S. Vetter:
Experience Migrating OpenCL to SYCL: A Case Study on Searches for Potential Off-Target Sites of Cas9 RNA-Guided Endonucleases on AMD GPUs. SOCC 2023: 1-6 - [c187]Taisuke Boku, Ryuta Tsunashima, Ryohei Kobayashi, Norihisa Fujita, Seyong Lee, Jeffrey S. Vetter, Hitoshi Murai, Masahiro Nakao, Miwako Tsuji, Mitsuhisa Sato:
OpenACC Unified Programming Environment for Multi-hybrid Acceleration with GPU and FPGA. ISC Workshops 2023: 662-674 - 2022
- [c186]Zheming Jin, Jeffrey S. Vetter:
Performance portability study of epistasis detection using SYCL on NVIDIA GPU. BCB 2022: 69:1-69:8 - [c185]Zheming Jin, Jeffrey S. Vetter:
Understanding Performance Portability of Bioinformatics Applications in SYCL on an NVIDIA GPU. BIBM 2022: 2190-2195 - [c184]Daniel F. Puleri, Sayan Roychowdhury, Peter Balogh, John Gounley, Erik W. Draeger, Jeff Ames, Adebayo Adebiyi, Simbarashe Chidyagwai, Benjamín Hernández, Seyong Lee, Shirley V. Moore, Jeffrey S. Vetter, Amanda Randles:
High Performance Adaptive Physics Refinement to Enable Large-Scale Tracking of Cancer Cell Trajectory. CLUSTER 2022: 230-242 - [c183]Pedro Valero-Lara, Jungwon Kim, Jeffrey S. Vetter:
A Portable and Heterogeneous LU Factorization on IRIS. Euro-Par Workshops 2022: 17-31 - [c182]Narasinga Rao Miniskar, Aaron R. Young, Frank Liu, Willem Blokland, Anthony M. Cabrera, Jeffrey S. Vetter:
Ultra Low Latency Machine Learning for Scientific Edge Applications. FPL 2022: 1-7 - [c181]Narasinga Rao Miniskar, Mohammad Alaul Haque Monil, Pedro Valero-Lara, Frank Liu, Jeffrey S. Vetter:
IRIS-BLAS: Towards a Performance Portable and Heterogeneous BLAS Library. HIPC 2022: 256-261 - [c180]Zheming Jin, Jeffrey S. Vetter:
A Study on Atomics-based Integer Sum Reduction in HIP on AMD GPU. ICPP Workshops 2022: 6:1-6:10 - [c179]Prasanna Date, Shruti R. Kulkarni, Aaron R. Young, Catherine D. Schuman, Thomas E. Potok, Jeffrey S. Vetter:
Virtual Neuron: A Neuromorphic Approach for Encoding Numbers. ICRC 2022: 100-105 - [c178]Zheming Jin, Jeffrey S. Vetter:
Integer Sum Reduction with OpenMP on an AMD MI100 GPU. IPDPS Workshops 2022: 496-499 - [c177]Zheming Jin, Jeffrey S. Vetter:
Evaluating Unified Memory Performance in HIP. IPDPS Workshops 2022: 562-568 - [c176]Kazi Asifuzzaman, Mohammad Alaul Haque Monil, Frank Liu, Jeffrey S. Vetter:
Evaluating HPC Kernels for Processing in Memory. MEMSYS 2022: 1:1-1:6 - [c175]Jacob Lambert, Mohammad Alaul Haque Monil, Seyong Lee, Allen D. Malony, Jeffrey S. Vetter:
Leveraging Compiler-Based Translation to Evaluate a Diversity of Exascale Platforms. P3HPC@SC 2022: 14-25 - [c174]Anthony M. Cabrera, Aaron R. Young, Jeffrey S. Vetter:
Design and analysis of CXL performance models for tightly-coupled heterogeneous computing. ExHET@PPOPP 2022: 1:1-1:6 - [c173]Mohammad Alaul Haque Monil, Narasinga Rao Miniskar, Frank Y. Liu, Jeffrey S. Vetter, Pedro Valero-Lara:
LaRIS: Targeting Portability and Productivity for LAPACK Codes on Extreme Heterogeneous Systems by Using IRIS. RSDHA@SC 2022: 12-21 - [c172]Pedro Valero-Lara, Cameron Greenwalt, Jeffrey S. Vetter:
SparseLU, A Novel Algorithm and Math Library for Sparse LU Factorization. IA3@SC 2022: 25-31 - [c171]Aaron R. Young, Narasinga Rao Miniskar, Frank Liu, Willem Blokland, Jeffrey S. Vetter:
Adrastea: An Efficient FPGA Design Environment for Heterogeneous Scientific Computing and Machine Learning. SMC 2022: 227-243 - [c170]Mohammad Alaul Haque Monil, Seyong Lee, Jeffrey S. Vetter, Allen D. Malony:
MAPredict: Static Analysis Driven Memory Access Prediction Framework for Modern CPUs. ISC 2022: 233-255 - [c169]Pedro Valero-Lara, Seyong Lee, Marc González Tallada, Joel E. Denny, Jeffrey S. Vetter:
KokkACC: Enhancing Kokkos with OpenACC. WACCPD@SC 2022: 32-42 - [c168]Zheming Jin, Jeffrey S. Vetter:
Evaluating Nonuniform Reduction in HIP and SYCL on GPUs. DRBSD@SC 2022: 37-43 - 2021
- [c167]Pedro Valero-Lara, Jungwon Kim, Oscar R. Hernandez, Jeffrey S. Vetter:
OpenMP Target Task: Tasking and Target Offloading on Heterogeneous Systems. Euro-Par Workshops 2021: 445-455 - [c166]Leonel Toledo, Pedro Valero-Lara, Jeffrey S. Vetter, Antonio J. Peña:
Static Graphs for Coding Productivity in OpenACC. HiPC 2021: 364-369 - [c165]Anthony M. Cabrera, Seth Hitefield, Jungwon Kim, Seyong Lee, Narasinga Rao Miniskar, Jeffrey S. Vetter:
Toward Performance Portable Programming for Heterogeneous Systems on a Chip: A Case Study with Qualcomm Snapdragon SoC. HPEC 2021: 1-7 - [c164]Jungwon Kim, Seyong Lee, Beau Johnston, Jeffrey S. Vetter:
IRIS: A Portable Runtime System Exploiting Multiple Heterogeneous Programming Systems. HPEC 2021: 1-8 - [c163]Zheming Jin, Jeffrey S. Vetter:
Evaluating the Performance of Integer Sum Reduction in SYCL on GPUs. ICPP Workshops 2021: 34:1-34:8 - [c162]Zheming Jin, Jeffrey S. Vetter:
Evaluating CUDA Portability with HIPCL and DPCT. IPDPS Workshops 2021: 371-376 - [c161]Zheming Jin, Jeffrey S. Vetter:
Evaluating the Performance of Integer Sum Reduction on an Intel GPU. IPDPS Workshops 2021: 652-655 - [c160]Narasinga Rao Miniskar, Frank Liu, Jeffrey S. Vetter:
A Memory Efficient Lock-Free Circular Queue. ISCAS 2021: 1-5 - [c159]Anthony M. Cabrera, Aaron R. Young, Jacob Lambert, Zhili Xiao, Amy An, Seyong Lee, Zheming Jin, Jungwon Kim, Jeremy Buhler, Roger D. Chamberlain, Jeffrey S. Vetter:
Toward Evaluating High-Level Synthesis Portability and Performance between Intel and Xilinx FPGAs. IWOCL 2021: 7:1-7:9 - [c158]Valentin Clement, Jeffrey S. Vetter:
Flacc: Towards OpenACC support for Fortran in the LLVM Ecosystem. LLVM-HPC@SC 2021: 12-19 - [c157]Yuanchao Xu, Mehmet Esat Belviranli, Xipeng Shen, Jeffrey S. Vetter:
PCCS: Processor-Centric Contention-aware Slowdown Model for Heterogeneous System-on-Chips. MICRO 2021: 1282-1295 - [c156]Mohammad Alaul Haque Monil, Seyong Lee, Jeffrey S. Vetter, Allen D. Malony:
Comparing LLC-Memory Traffic between CPU and GPU Architectures. RSDHA@SC 2021: 8-16 - [c155]Narasinga Rao Miniskar, Frank Liu, Aaron R. Young, Dwaipayan Chakraborty, Jeffrey S. Vetter:
A Hierarchical Task Scheduler for Heterogeneous Computing. ISC 2021: 57-76 - 2020
- [c154]Mohammad Alaul Haque Monil, Mehmet E. Belviranli, Seyong Lee, Jeffrey S. Vetter, Allen D. Malony:
MEPHESTO: Modeling Energy-Performance in Heterogeneous SoCs and Their Trade-Offs. PACT 2020: 413-425 - [c153]Frank Liu, Narasinga Rao Miniskar, Dwaipayan Chakraborty, Jeffrey S. Vetter:
Deffe: a data-efficient framework for performance characterization in domain-specific computing. CF 2020: 182-191 - [c152]Blaise Tine, Seyong Lee, Jeffrey S. Vetter, Hyesoon Kim:
Productive Hardware Designs using Hybrid HLS-RTL Development. FPGA 2020: 311 - [c151]Blaise Tine, Fares Elsabbagh, Seyong Lee, Jeffrey S. Vetter, Hyesoon Kim:
Cash: A Single-Source Hardware-Software Codesign Framework for Rapid Prototyping. FPGA 2020: 321 - [c150]Jacob Lambert, Seyong Lee, Jeffrey S. Vetter, Allen D. Malony:
In-Depth Optimization with the OpenACC-to-FPGA Framework on an Arria 10 FPGA. IPDPS Workshops 2020: 460-470 - [c149]Roberto Gioiosa, Burcu Ozcelik Mutlu, Seyong Lee, Jeffrey S. Vetter, Giulio Picierro, Marco Cesati:
The Minos Computing Library: efficient parallel programming for extremely heterogeneous systems. GPGPU@PPoPP 2020: 1-10 - [c148]Camille Coti, Joel E. Denny, Kevin A. Huck, Seyong Lee, Allen D. Malony, Sameer Shende, Jeffrey S. Vetter:
OpenACC Profiling Support for Clang and LLVM using Clacc and TAU. HUST/ProTools@SC 2020: 38-48 - [c147]Beau Johnston, Jeffrey S. Vetter, Josh Milthorpe:
Evaluating the Performance and Portability of Contemporary SYCL Implementations. P3HPC@SC 2020: 45-56 - [c146]Mohammad Alaul Haque Monil, Seyong Lee, Jeffrey S. Vetter, Allen D. Malony:
Understanding the Impact of Memory Access Patterns in Intel Processors. MCHPC@SC 2020: 52-61 - [c145]Jacob Lambert, Seyong Lee, Jeffrey S. Vetter, Allen D. Malony:
CCAMP: an integrated translation and optimization framework for OpenACC and OpenMP. SC 2020: 98 - 2019
- [c144]Blaise-Pascal Tine, Sudhakar Yalamanchili, Hyesoon Kim, Jeffrey S. Vetter:
POSTER: Tango: An Optimizing Compiler for Just-In-Time RTL Simulation. PACT 2019: 481-482 - [c143]Ivy Bo Peng, Jeffrey S. Vetter, Shirley Moore, Joydeep Rakshit, Stefano Markidis:
Analyzing the suitability of contemporary 3D-stacked PIM architectures for HPC scientific applications. CF 2019: 256-262 - [c142]Jacob Lambert, Seyong Lee, Allen D. Malony, Jeffrey S. Vetter:
CCAMP: OpenMP and OpenACC Interoperable Framework. Euro-Par Workshops 2019: 357-369 - [c141]Jiawen Liu, Dong Li, Gokcen Kestor, Jeffrey S. Vetter:
Runtime Concurrency Control and Operation Scheduling for High Performance Neural Network Training. IPDPS 2019: 188-199 - [c140]Steven Wei Der Chien, Stefano Markidis, Vyacheslav Olshevsky, Yaroslav Bulatov, Erwin Laure, Jeffrey S. Vetter:
TensorFlow Doing HPC. IPDPS Workshops 2019: 509-518 - [c139]Forrest Shriver, Seyong Lee, Steven Hamilton, Jeffrey S. Vetter, Justin Watson:
Enhancing Monte Carlo proxy applications on GPUs. PMBS@SC 2019: 30-40 - 2018
- [c138]Ivy Bo Peng, Jeffrey S. Vetter, Shirley V. Moore, Seyong Lee:
Tuyere: enabling scalable memory workloads for system exploration. HPDC 2018: 180-191 - [c137]Mehmet E. Belviranli, Seyong Lee, Jeffrey S. Vetter:
Designing Algorithms for the EMU Migrating-threads-based Architecture. HPEC 2018: 1-7 - [c136]Jacob Lambert, Seyong Lee, Jungwon Kim, Jeffrey S. Vetter, Allen D. Malony:
Directive-Based, High-Level Programming and Optimizations for High-Performance Computing with FPGAs. ICS 2018: 160-171 - [c135]Kaixi Hou, Hao Wang, Wu-chun Feng, Jeffrey S. Vetter, Seyong Lee:
Highly Efficient Compensation-Based Parallelism for Wavefront Loops on GPUs. IPDPS 2018: 276-285 - [c134]Stefano Markidis, Steven Wei Der Chien, Erwin Laure, Ivy Bo Peng, Jeffrey S. Vetter:
NVIDIA Tensor Core Programmability, Performance & Precision. IPDPS Workshops 2018: 522-531 - [c133]Gregory Herschlag, Seyong Lee, Jeffrey S. Vetter, Amanda Randles:
GPU Data Access on Complex Geometries for D3Q19 Lattice Boltzmann Method. IPDPS 2018: 825-834 - [c132]Mariam Umar, Shirley V. Moore, Jeffrey S. Vetter, Kirk W. Cameron:
Prometheus: Coherent Exploration of Hardware and Software Optimizations Using Aspen. MASCOTS 2018: 244-250 - [c131]Mehmet E. Belviranli, Seyong Lee, Jeffrey S. Vetter, Laxmi N. Bhuyan:
Juggler: a dependence-aware task-based execution framework for GPUs. PPoPP 2018: 54-67 - [c130]Pak Markthub, Mehmet E. Belviranli, Seyong Lee, Jeffrey S. Vetter, Satoshi Matsuoka:
DRAGON: breaking GPU memory capacity limits with direct NVM access. SC 2018: 32:1-32:13 - [c129]Ivy Bo Peng, Jeffrey S. Vetter:
Siena: exploring the design space of heterogeneous memory systems. SC 2018: 33:1-33:14 - 2017
- [c128]Vickie E. Lynch, Jose Borreguero Calvo, Ewa Deelman, Rafael Ferreira da Silva, Monojoy Goswami, Yawei Hui, Eric J. Lingerfelt, Jeffrey S. Vetter:
Distributed workflows for modeling experimental data. HPEC 2017: 1-5 - [c127]Jungwon Kim, Kittisak Sajjapongse, Seyong Lee, Jeffrey S. Vetter:
Design and Implementation of Papyrus: Parallel Aggregate Persistent Storage. IPDPS 2017: 1151-1162 - [c126]Joel Edward Denny, Seyong Lee, Jeffrey S. Vetter:
Language-Based Optimizations for Persistence on Nonvolatile Main Memory Systems. IPDPS 2017: 1163-1173 - [c125]Sparsh Mittal, Rajendra Bishnoi, Fabian Oboril, Haonan Wang, Mehdi Baradaran Tahoori, Adwait Jog, Jeffrey S. Vetter:
Architecting SOT-RAM Based GPU Register File. ISVLSI 2017: 38-44 - [c124]Christopher D. Carothers, Jeremy S. Meredith, Mark P. Blanco, Jeffrey S. Vetter, Misbah Mubarak, Justin M. LaPre, Shirley Moore:
Durango: Scalable Synthetic Workload Generation for Extreme-Scale Application Performance Modeling and Simulation. SIGSIM-PADS 2017: 97-108 - [c123]Jungwon Kim, Seyong Lee, Jeffrey S. Vetter:
PapyrusKV: a high-performance parallel key-value store for distributed NVM architectures. SC 2017: 57 - [c122]Sparsh Mittal, Haonan Wang, Adwait Jog, Jeffrey S. Vetter:
Design and Analysis of Soft-Error Resilience Mechanisms for GPU Register File. VLSID 2017: 409-414 - 2016
- [c121]Sparsh Mittal, Jeffrey S. Vetter:
Reducing Soft-error Vulnerability of Caches using Data Compression. ACM Great Lakes Symposium on VLSI 2016: 197-202 - [c120]Jeffrey S. Vetter:
Preparing for Supercomputing's Sixth Wave. HPDC 2016: 1 - [c119]Joel E. Denny, Seyong Lee, Jeffrey S. Vetter:
NVL-C: Static Analysis Techniques for Efficient, Correct Programming of Non-Volatile Main Memory Systems. HPDC 2016: 125-136 - [c118]Panruo Wu, Dong Li, Zizhong Chen, Jeffrey S. Vetter, Sparsh Mittal:
Algorithm-Directed Data Placement in Explicitly Managed Non-Volatile Memory. HPDC 2016: 141-152 - [c117]Jungwon Kim, Seyong Lee, Jeffrey S. Vetter:
IMPACC: A Tightly Integrated MPI+OpenACC Framework Exploiting Shared Memory Parallelism. HPDC 2016: 189-201 - [c116]Seyong Lee, Jungwon Kim, Jeffrey S. Vetter:
OpenACC to FPGA: A Framework for Directive-Based High-Performance Reconfigurable Computing. IPDPS 2016: 544-554 - [c115]Anirban Mandal, Paul Ruth, Ilya Baldin, Dariusz Król, Gideon Juve, Rajiv Mayani, Rafael Ferreira da Silva, Ewa Deelman, Jeremy S. Meredith, Jeffrey S. Vetter, Vickie E. Lynch, Benjamin Mayer, James Wynne, Mark P. Blanco, Christopher D. Carothers, Justin M. LaPre, Brian Tierney:
Toward an End-to-End Framework for Modeling, Monitoring and Anomaly Detection for Scientific Workflows. IPDPS Workshops 2016: 1370-1379 - [c114]Rishiraj A. Bheda, Thomas M. Conte, Jeffrey S. Vetter:
Improving DRAM Bandwidth Utilization with MLP-Aware OS Paging. MEMSYS 2016: 289-294 - [c113]Mariam Umar, Jeremy S. Meredith, Jeffrey S. Vetter, Kirk W. Cameron:
A Study of Power-Performance Modeling Using a Domain-Specific Language. SBAC-PAD 2016: 84-92 - 2015
- [c112]Matt Poremba, Sparsh Mittal, Dong Li, Jeffrey S. Vetter, Yuan Xie:
DESTINY: a tool for modeling emerging 3D NVM and eDRAM caches. DATE 2015: 1543-1546 - [c111]Philip C. Roth, Jeremy S. Meredith, Jeffrey S. Vetter:
Automated Characterization of Parallel Application Communication Patterns. HPDC 2015: 73-84 - [c110]Bo Wu, Guoyang Chen, Dong Li, Xipeng Shen, Jeffrey S. Vetter:
Enabling and Exploiting Flexible Task Assignment on GPU through SM-Centric Program Transformations. ICS 2015: 119-130 - [c109]Seyong Lee, Jeremy S. Meredith, Jeffrey S. Vetter:
COMPASS: A Framework for Automated Performance Modeling and Prediction. ICS 2015: 405-414 - [c108]Sparsh Mittal, Jeffrey S. Vetter:
AYUSH: Extending Lifetime of SRAM-NVM Way-Based Hybrid Caches Using Wear-Leveling. MASCOTS 2015: 112-121 - [c107]Jungwon Kim, Seyong Lee, Jeffrey S. Vetter:
An OpenACC-based unified programming model for multi-accelerator systems. PPoPP 2015: 257-258 - [c106]M. Graham Lopez, Jeffrey S. Young, Jeremy S. Meredith, Philip C. Roth, Mitchel D. Horton, Jeffrey S. Vetter:
Examining recent many-core architectures and programming models using SHOC. PMBS@SC 2015: 3:1-3:12 - [c105]Joel E. Denny, Seyong Lee, Jeffrey S. Vetter:
FITL: extending LLVM for the translation of fault-injection directives. LLVM@SC 2015: 4:1-4:11 - 2014
- [c104]Bo Wu, Guoyang Chen, Dong Li, Xipeng Shen, Jeffrey S. Vetter:
SM-centric transformation: circumventing hardware restrictions for flexible GPU scheduling. PACT 2014: 497-498 - [c103]Sparsh Mittal, Jeffrey S. Vetter, Dong Li:
WriteSmoothing: improving lifetime of non-volatile caches using intra-set wear-leveling. ACM Great Lakes Symposium on VLSI 2014: 139-144 - [c102]Sparsh Mittal, Jeffrey S. Vetter, Dong Li:
Improving energy efficiency of embedded DRAM caches for high-end computing systems. HPDC 2014: 99-110 - [c101]Seyong Lee, Jeffrey S. Vetter:
OpenARC: open accelerator research compiler for directive-based, efficient heterogeneous computing. HPDC 2014: 115-120 - [c100]Seyong Lee, Dong Li, Jeffrey S. Vetter:
Interactive Program Debugging and Optimization for Directive-Based, Efficient GPU Computing. IPDPS 2014: 481-490 - [c99]Jeffrey S. Vetter:
AsHES Keynote. IPDPS Workshops 2014: 907 - [c98]Sparsh Mittal, Jeffrey S. Vetter, Dong Li:
LastingNVCache: A Technique for Improving the Lifetime of Non-volatile Caches. ISVLSI 2014: 534-540 - [c97]Amit Sabne, Putt Sakdhnagool, Seyong Lee, Jeffrey S. Vetter:
Evaluating Performance Portability of OpenACC. LCPC 2014: 51-66 - [c96]Sparsh Mittal, Jeffrey S. Vetter:
EqualChance: Addressing Intra-set Write Variation to Increase Lifetime of Non-volatile Caches. INFLOW 2014 - [c95]Seyong Lee, Jeffrey S. Vetter:
OpenARC: extensible OpenACC compiler framework for directive-based accelerator programming study. WACCPD@SC 2014: 1-11 - [c94]Sarat Sreepathi, M. L. Grodowitz, Robert V. Lim, Philip Taffet, Philip C. Roth, Jeremy S. Meredith, Seyong Lee, Dong Li, Jeffrey S. Vetter:
Application characterization using Oxbow toolkit and PADS infrastructure. Co-HPC@SC 2014: 55-63 - [c93]Li Yu, Dong Li, Sparsh Mittal, Jeffrey S. Vetter:
Quantitatively Modeling Application Resilience with the Data Vulnerability Factor. SC 2014: 695-706 - [c92]Jeffrey S. Young, M. Graham Lopez, Mitchel D. Horton, Richard Glassbrook, Jeffrey S. Vetter:
Advanced Application Support for Improved GPU Utilization on Keeneland. XSEDE 2014: 6:1-6:6 - 2013
- [c91]Bin Wang, Bo Wu, Dong Li, Xipeng Shen, Weikuan Yu, Yizheng Jiao, Jeffrey S. Vetter:
Exploring hybrid memory for GPU energy efficiency through software-hardware co-design. PACT 2013: 93-102 - [c90]Dong Li, Seyong Lee, Jeffrey S. Vetter:
Evaluating the Viability of Application-Driven Cooperative CPU/GPU Fault Detection. Euro-Par Workshops 2013: 670-679 - [c89]Sparsh Mittal, Zhao Zhang, Jeffrey S. Vetter:
FlexiWay: A cache energy saving technique using fine-grained cache reconfiguration. ICCD 2013: 100-107 - [c88]Gabriel Marin, Collin McCurdy, Jeffrey S. Vetter:
Diagnosis and optimization of application prefetching performance. ICS 2013: 303-312 - [c87]Bin Wang, Yizheng Jiao, Weikuan Yu, Xipeng Shen, Dong Li, Jeffrey S. Vetter:
A Versatile Performance and Energy Simulation Tool for Composite GPU Global Memory. MASCOTS 2013: 298-302 - [c86]Jeffrey S. Vetter, Seyong Lee, Dong Li, Gabriel Marin, Collin McCurdy, Jeremy S. Meredith, Philip C. Roth, Kyle Spafford:
Quantifying Architectural Requirements of Contemporary Extreme-Scale Scientific Applications. PMBS@SC 2013: 3-24 - [c85]Dong Li, Zizhong Chen, Panruo Wu, Jeffrey S. Vetter:
Rethinking algorithm-based fault tolerance with a cooperative software-hardware approach. SC 2013: 44:1-44:12 - [c84]Collin McCurdy, Gabriel Marin, Jeffrey S. Vetter:
Characterizing the Impact of Prefetching on Scientific Application Performance. PMBS@SC 2013: 115-135 - 2012
- [c83]Vinod Tipparaju, Jeffrey S. Vetter:
GA-GPU: extending a library-based global address spaceprogramming model for scalable heterogeneouscomputing systems. Conf. Computing Frontiers 2012: 53-64 - [c82]Kyle Spafford, Jeremy S. Meredith, Seyong Lee, Dong Li, Philip C. Roth, Jeffrey S. Vetter:
The tradeoffs of fused memory hierarchies in heterogeneous computing architectures. Conf. Computing Frontiers 2012: 103-112 - [c81]Dong Li, Jeffrey S. Vetter, Gabriel Marin, Collin McCurdy, Cristian Cira, Zhuo Liu, Weikuan Yu:
Identifying Opportunities for Byte-Addressable Non-Volatile Memory in Extreme-Scale Scientific Applications. IPDPS 2012: 945-956 - [c80]Anthony Danalis, Collin McCurdy, Jeffrey S. Vetter:
Efficient Quality Threshold Clustering for Parallel Architectures. IPDPS 2012: 1068-1079 - [c79]Zhuo Liu, Bin Wang, Patrick Carpenter, Dong Li, Jeffrey S. Vetter, Weikuan Yu:
PCM-Based Durable Write Cache for Fast Disk I/O. MASCOTS 2012: 451-458 - [c78]Seyong Lee, Jeffrey S. Vetter:
Early evaluation of directive-based GPU programming models for productive exascale computing. SC 2012: 23 - [c77]Dong Li, Jeffrey S. Vetter, Weikuan Yu:
Classifying soft error vulnerabilities in extreme-scale scientific applications using a binary instrumentation tool. SC 2012: 57 - [c76]Kyle Spafford, Jeffrey S. Vetter:
Aspen: a domain specific language for performance modeling. SC 2012: 84 - 2011
- [c75]Kyle Spafford, Jeremy S. Meredith, Jeffrey S. Vetter:
Quantifying NUMA and contention effects in multi-GPU systems. GPGPU 2011: 11 - [c74]Xinyu Que, Weikuan Yu, Vinod Tipparaju, Jeffrey S. Vetter, Bin Wang:
Network-Friendly One-Sided Communication through Multinode Cooperation on Petascale Cray XT5 Systems. CCGRID 2011: 352-361 - [c73]Kyle Spafford, Jeremy S. Meredith, Jeffrey S. Vetter:
Quartile and Outlier Detection on Heterogeneous Clusters Using Distributed Radix Sort. CLUSTER 2011: 412-419 - [c72]Weikuan Yu, Vinod Tipparaju, Xinyu Que, Jeffrey S. Vetter:
Virtual Topologies for Scalable Resource Management and Contention Attenuation in a Global Address Space Model on the Cray XT5. ICPP 2011: 235-244 - [c71]Luiz DeRose, Jeffrey S. Vetter:
MTAAP Introduction. IPDPS Workshops 2011: 1659 - 2010
- [c70]Jeffrey S. Vetter:
Toward exascale computational science with heterogeneous processing. GPGPU 2010: 1 - [c69]Anthony Danalis, Gabriel Marin, Collin McCurdy, Jeremy S. Meredith, Philip C. Roth, Kyle Spafford, Vinod Tipparaju, Jeffrey S. Vetter:
The Scalable Heterogeneous Computing (SHOC) benchmark suite. GPGPU 2010: 63-74 - [c68]Vinod Tipparaju, Edoardo Aprà, Weikuan Yu, Jeffrey S. Vetter:
Enabling a highly-scalable global address space model for petascale computing. Conf. Computing Frontiers 2010: 207-216 - [c67]Kyle Spafford, Jeremy S. Meredith, Jeffrey S. Vetter:
Maestro: Data Orchestration and Tuning for OpenCL Devices. Euro-Par (2) 2010: 275-286 - [c66]Weikuan Yu, Yuan Tian, Jeffrey S. Vetter:
Efficient Zero-Copy Noncontiguous I/O for Globus on InfiniBand. ICPP Workshops 2010: 362-368 - [c65]Luiz De Rose, Jeffrey S. Vetter:
MTAAP 2010 Welcome. IPDPS Workshops 2010: 1 - [c64]Weikuan Yu, Jeffrey S. Vetter:
Initial characterization of parallel NFS implementations. IPDPS Workshops 2010: 1-6 - [c63]Collin McCurdy, Jeffrey S. Vetter:
Memphis: Finding and fixing NUMA-related performance problems on multi-core platforms. ISPASS 2010: 87-96 - [c62]Abtin Rahimian, Ilya Lashuk, Shravan K. Veerapaneni, Aparna Chandramowlishwaran, Dhairya Malhotra, Logan Moon, Rahul S. Sampath, Aashay Shringarpure, Jeffrey S. Vetter, Richard W. Vuduc, Denis Zorin, George Biros:
Petascale Direct Numerical Simulation of Blood Flow on 200K Cores and Heterogeneous Architectures. SC 2010: 1-11 - 2009
- [c61]Kyle Spafford, Jeremy S. Meredith, Jeffrey S. Vetter, Jacqueline Chen, Ray W. Grout, Ramanan Sankaran:
Accelerating S3D: A GPGPU Case Study. Euro-Par Workshops 2009: 122-131 - [c60]Heike Jagode, Jack J. Dongarra, Sadaf R. Alam, Jeffrey S. Vetter, Wyatt Spear, Allen D. Malony:
A Holistic Approach for Performance Measurement and Analysis for Petascale Applications. ICCS (2) 2009: 686-695 - [c59]Weikuan Yu, Oleg Drokin, Jeffrey S. Vetter:
Design, implementation, and evaluation of transparent pNFS on Lustre. IPDPS 2009: 1-9 - 2008
- [c58]Weikuan Yu, Jeffrey S. Vetter:
Xen-Based HPC: A Parallel I/O Perspective. CCGRID 2008: 154-161 - [c57]Weikuan Yu, Sarp Oral, Shane Canon, Jeffrey S. Vetter, Ramanan Sankaran:
Empirical Analysis of a Large-Scale Hierarchical Storage System. Euro-Par 2008: 130-140 - [c56]Jeffrey S. Vetter, Vinod Tipparaju, Weikuan Yu, Philip C. Roth:
HPC Interconnection Networks: The Key to Exascale Computing. High Performance Computing Workshop 2008: 95-106 - [c55]Weikuan Yu, Jeffrey S. Vetter:
ParColl: Partitioned Collective I/O on the Cray XT. ICPP 2008: 562-569 - [c54]Sadaf R. Alam, Pratul K. Agarwal, Scott S. Hampton, Hong Ong, Jeffrey S. Vetter:
Impact of multicores on large-scale molecular dynamics simulations. IPDPS 2008: 1-7 - [c53]Weikuan Yu, Jeffrey S. Vetter, Sarp Oral:
Performance characterization and optimization of parallel I/O on the Cray XT. IPDPS 2008: 1-11 - [c52]Collin McCurdy, Alan L. Cox, Jeffrey S. Vetter:
Investigating the TLB Behavior of High-end Scientific Applications on Commodity Microprocessors. ISPASS 2008: 95-104 - [c51]Weikuan Yu, Nageswara S. V. Rao, Jeffrey S. Vetter:
Experimental Analysis of InfiniBand Transport Services on WAN. NAS 2008: 233-240 - [c50]Charles W. Lively, Valerie E. Taylor, Sadaf R. Alam, Jeffrey S. Vetter:
A Methodology for Developing High Fidelity Communication Models for Large-Scale Applications Targeted on Multicore Systems. SBAC-PAD 2008: 55-62 - [c49]Nageswara S. V. Rao, Weikuan Yu, William R. Wing, Stephen W. Poole, Jeffrey S. Vetter:
Wide-area performance profiling of 10GigE and InfiniBand technologies. SC 2008: 14 - [c48]Sadaf R. Alam, Richard F. Barrett, M. Bast, Mark R. Fahey, Jeffery A. Kuehn, Collin McCurdy, James H. Rogers, Philip C. Roth, Ramanan Sankaran, Jeffrey S. Vetter, Patrick H. Worley, Weikuan Yu:
Early evaluation of IBM BlueGene/P. SC 2008: 23 - 2007
- [c47]Sadaf R. Alam, Jeffrey S. Vetter, Melissa C. Smith:
An Application Specific Memory Characterization Technique for Co-processor Accelerators. ASAP 2007: 353-358 - [c46]Sadaf R. Alam, Nikhil Bhatia, Jeffrey S. Vetter:
Sensitivity Analysis of Biomolecular Simulations using Symbolic Models. BIBE 2007: 294-300 - [c45]Weikuan Yu, Jeffrey S. Vetter, Shane Canon, Song Jiang:
Exploiting Lustre File Joining for Effective Collective IO. CCGRID 2007: 267-274 - [c44]Sadaf R. Alam, Jeremy S. Meredith, Jeffrey S. Vetter:
Balancing productivity and performance on the cell broadband engine. CLUSTER 2007: 149-158 - [c43]Nikhil Bhatia, Jeffrey S. Vetter:
Virtual Cluster Management with Xen. Euro-Par Workshops 2007: 185-194 - [c42]Sadaf R. Alam, Nikhil Bhatia, Jeffrey S. Vetter:
An Exploration of Performance Attributes for Symbolic Modeling of Emerging Processing Devices. HPCC 2007: 683-694 - [c41]Jeremy S. Meredith, Sadaf R. Alam, Jeffrey S. Vetter:
Analysis of a Computational Biology Simulation Technique on Emerging Processing Architectures. IPDPS 2007: 1-8 - [c40]Richard F. Barrett, Sadaf R. Alam, Jeffrey S. Vetter:
Performance evaluation of the cray XT3 configured with dual core opteron processors. PPoPP 2007: 148-149 - 2006
- [c39]Chokchai Leangsuksun, Tirumala Rao, Anand Tikotekar, Stephen L. Scott, Richard Libby, Jeffrey S. Vetter, Yung-Chin Fang, Hong Ong:
IPMI-based Efficient Notification Framework for Large Scale Cluster Computing. CCGRID 2006: 23 - [c38]Jesús Labarta, Bernd Mohr, Allan Snavely, Jeffrey S. Vetter:
Topic 2: Performance Prediction and Evaluation. Euro-Par 2006: 63 - [c37]Sadaf R. Alam, Jeffrey S. Vetter:
Hierarchical Model Validation of Symbolic Performance Models of Scientific Kernels. Euro-Par 2006: 65-77 - [c36]Sadaf R. Alam, Jeffrey S. Vetter:
An Analysis of System Balance Requirements for Scientific Applications. ICPP 2006: 229-236 - [c35]Sadaf R. Alam, Richard F. Barrett, Jeffery A. Kuehn, Philip C. Roth, Jeffrey S. Vetter:
Characterization of Scientific Workloads on Systems with Multi-Core Processors. IISWC 2006: 225-236 - [c34]Sadaf R. Alam, Jeffrey S. Vetter:
A framework to develop symbolic performance models of parallel applications. IPDPS 2006 - [c33]Jeffrey S. Vetter, Sadaf R. Alam, Thomas H. Dunigan Jr., Mark R. Fahey, Philip C. Roth, Patrick H. Worley:
Early evaluation of the Cray XT3. IPDPS 2006 - [c32]Sadaf R. Alam, Jeffrey S. Vetter, Pratul K. Agarwal, Al Geist:
Performance characterization of molecular dynamics techniques for biomolecular simulations. PPoPP 2006: 59-68 - 2005
- [c31]Xuejun Liang, Jeffrey S. Vetter, Melissa C. Smith, Arthur S. Bland:
Balancing FPGA Resource Utilities. ERSA 2005: 156-162 - [c30]Bernd Mohr, Luiz De Rose, Jeffrey S. Vetter:
A Performance Measurement Infrastructure for Co-array Fortran. Euro-Par 2005: 146-155 - [c29]Sadaf R. Alam, Jeffrey S. Vetter:
Performance and Scalability Analysis of Cray X1 Vectorization and Multistreaming Optimization. International Conference on Computational Science (1) 2005: 304-312 - [c28]Thomas H. Dunigan, Jeffrey S. Vetter, Patrick H. Worley:
Performance Evaluation of the SGI Altix 3700. ICPP 2005: 231-240 - [c27]Rod Fatoohi, Ken Kardys, Sumy Koshy, Soundarya Sivaramakrishnan, Jeffrey S. Vetter:
Performance Evaluation of High-Speed Interconnects Using Dense Communication Patterns. ICPP Workshops 2005: 554-561 - [c26]Melissa C. Smith, Jeffrey S. Vetter, Xuejun Liang:
Accelerating Scientific Applications with the SRC-6 Reconfigurable Computer: Methodologies and Analysis. IPDPS 2005 - [c25]Jeffrey S. Vetter, Nikhil Bhatia, E. M. Grobelny, Philip C. Roth:
Capturing Petascale Application Characteristics with the Sequoia Toolkit. PARCO 2005: 917-924 - 2004
- [c24]Wolfgang E. Nagel, Thomas Ludwig, Jeffrey S. Vetter, Lorenzo Donatiello:
Topic 2: Performance Evaluation. Euro-Par 2004: 117-118 - [c23]Thomas H. Dunigan Jr., Jeffrey S. Vetter, Patrick H. Worley:
Performance evaluation of the Cray X1 distributed shared memory architecture. Hot Interconnects 2004: 20-25 - 2002
- [c22]Ivan Corey, John R. Johnson, Jeffrey S. Vetter:
Local Discovery of System Architecture - Application Parameter Sensitivity: An Empirical Technique for Adaptive Grid Applications. HPDC 2002: 399- - [c21]Jeffrey S. Vetter, Frank Mueller:
Communication Characteristics of Large-Scale Scientific Applications for Contemporary Cluster Architectures. IPDPS 2002 - [c20]Narasimha R. Adiga, George Almási, George S. Almási, Yariv Aridor, Rajkishore Barik, Daniel K. Beece, Ralph Bellofatto, Gyan Bhanot, Randy Bickford, Matthias A. Blumrich, Arthur A. Bright, José R. Brunheroto, Calin Cascaval, José G. Castaños, Waiman Chan, Luis Ceze, Paul Coteus, Siddhartha Chatterjee, Dong Chen, George L.-T. Chiu, Thomas M. Cipolla, Paul Crumley, K. M. Desai, Alina Deutsch, Tamar Domany, Marc Boris Dombrowa, Wilm E. Donath, Maria Eleftheriou, C. Christopher Erway, J. Esch, Blake G. Fitch, Joseph Gagliano, Alan Gara, Rahul Garg, Robert S. Germain, Mark Giampapa, Balaji Gopalsamy, John A. Gunnels, Manish Gupta, Fred G. Gustavson, Shawn Hall, Ruud A. Haring, David F. Heidel, Philip Heidelberger, Lorraine Herger, Dirk Hoenicke, R. D. Jackson, T. Jamal-Eddine, Gerard V. Kopcsay, Elie Krevat, Manish P. Kurhekar, Alphonso P. Lanzetta, Derek Lieber, L. K. Liu, M. Lu, Mark P. Mendell, A. Misra, Yosef Moatti, Lawrence S. Mok, José E. Moreira, Ben J. Nathanson, Matthew Newton, Martin Ohmacht, Adam J. Oliner, Vinayaka Pandit, R. B. Pudota, Rick A. Rand, Richard D. Regan, Bradley Rubin, Albert E. Ruehli, Silvius Vasile Rus, Ramendra K. Sahoo, Alda Sanomiya, Eugen Schenfeld, M. Sharma, Edi Shmueli, Sarabjeet Singh, Peilin Song, Vijay Srinivasan, Burkhard D. Steinmacher-Burow, Karin Strauss, Christopher W. Surovic, Richard A. Swetz, Todd Takken, R. Brett Tremaine, Mickey Tsao, Arun R. Umamaheshwaran, P. Verma, Pavlos Vranas, T. J. Christopher Ward, Michael E. Wazlowski, W. Barrett, C. Engel, B. Drehmel, B. Hilgart, D. Hill, F. Kasemkhani, David J. Krolak, Chun-Tao Li, Thomas A. Liebsch, James A. Marcella, A. Muff, A. Okomo, M. Rouse, A. Schram, M. Tubbs, G. Ulsh, Charles D. Wait, J. Wittrup, Myung Bae, Kenneth A. Dockser, Lynn Kissel, Mark K. Seager, Jeffrey S. Vetter, K. Yates:
An overview of the BlueGene/L Supercomputer. SC 2002: 7:1-7:22 - [c19]Dong H. Ahn, Jeffrey S. Vetter:
Scalable analysis techniques for microprocessor performance counter metrics. SC 2002: 12:1-12:16 - [c18]Jeffrey S. Vetter, Patrick H. Worley:
Asserting performance expectations. SC 2002: 15:1-15:13 - [c17]Jeffrey S. Vetter, Andy B. Yoo:
An empirical performance evaluation of scalable scientific applications. SC 2002: 47:1-47:18 - [c16]Jeffrey S. Vetter:
Dynamic statistical profiling of communication activity in distributed applications. SIGMETRICS 2002: 240-250 - 2001
- [c15]Jeffrey S. Vetter, Michael O. McCracken:
Statistical scalability analysis of communication operations in distributed applications. PPoPP 2001: 123-132 - [c14]Jay P. Hoeflinger, Bob Kuhn, Wolfgang E. Nagel, Paul Petersen, Hrabri Rajic, Sanjiv Shah, Jeffrey S. Vetter, Michael Voss, Renee Woo:
An Integrated Performance Visualizer for MPI/OpenMP Programs. WOMPAT 2001: 40-52 - [c13]Jordi Caubet, Judit Giménez, Jesús Labarta, Luiz De Rose, Jeffrey S. Vetter:
A Dynamic Tracing Mechanism for Performance Analysis of OpenMP Applications. WOMPAT 2001: 53-67 - 2000
- [c12]Daniel A. Reed, Luiz De Rose, Mario Pantano, Jeffrey S. Vetter:
Performance Issues in Parallel Processing Systems. Performance Evaluation 2000: 141-159 - [c11]Jeffrey S. Vetter:
Performance analysis of distributed applications using automatic classification of communication inefficiencies. ICS 2000: 245-254 - [c10]Jeffrey S. Vetter, Bronis R. de Supinski:
Dynamic Software Testing of MPI Applications with Umpire. SC 2000: 51 - 1999
- [c9]Jeffrey S. Vetter, Karsten Schwan:
Optimizations for Language-Directed Computational Steering. IPPS/SPDP 1999: 486-491 - [c8]Jeffrey S. Vetter:
Experiences with Computational Steering on Existing Scientific Applications. PP 1999 - [c7]Jeffrey S. Vetter, Daniel A. Reed:
Managing Performance Analysis with Dynamic Statistical Projection Pursuit. SC 1999: 44 - 1998
- [c6]Eileen T. Kraemer, Jeffrey S. Vetter:
Computational Steering. HICSS (7) 1998: 126 - [c5]Randy L. Ribler, Jeffrey S. Vetter, Huseyin Simitci, Daniel A. Reed:
Autopilot: Adaptive Control of Distributed Applications. HPDC 1998: 172-179 - [c4]Jeffrey S. Vetter, Karsten Schwan:
Techniques for Delayed Binding of Monitoring Mechanisms to Application-Specific Instrumentation Points. ICPP 1998: 477-484 - 1997
- [c3]Jeffrey S. Vetter, Karsten Schwan:
High Performance Computational Steering of Physical Simulations. IPPS 1997: 128- - 1996
- [c2]Jeffrey S. Vetter, Karsten Schwan:
Models for computational steering. CDS 1996: 100-107 - 1995
- [c1]Jeffrey S. Vetter, Karsten Schwan:
Progress: A Toolkit for Interactive Program Steering. ICPP (2) 1995: 139-142
Editorship
- 2015
- [e2]Jackie Kern, Jeffrey S. Vetter:
Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC 2015, Austin, TX, USA, November 15-20, 2015. ACM 2015, ISBN 978-1-4503-3723-6 [contents] - 2011
- [e1]Jeffrey S. Vetter, Madanlal Musuvathi, Xipeng Shen:
Proceedings of the 2011 ACM SIGPLAN workshop on Memory Systems Performance and Correctness: held in conjunction with PLDI '11, San Jose, CA, USA, June 5, 2011. ACM 2011, ISBN 978-1-4503-0794-9 [contents]
Informal and Other Publications
- 2023
- [i13]William F. Godoy, Pedro Valero-Lara, T. Elise Dettling, Christian Trefftz, Ian Jorquera, Thomas Sheehy, Ross G. Miller, Marc González Tallada, Jeffrey S. Vetter, Valentin Churavy:
Evaluating performance and portability of high-level programming models: Julia, Python/Numba, and Kokkos on exascale nodes. CoRR abs/2303.06195 (2023) - [i12]William F. Godoy, Pedro Valero-Lara, Keita Teranishi, Prasanna Balaprakash, Jeffrey S. Vetter:
Evaluation of OpenAI Codex for HPC Parallel Programming Models Kernel Generation. CoRR abs/2306.15121 (2023) - [i11]Shruti R. Kulkarni, Aaron R. Young, Prasanna Date, Narasinga Rao Miniskar, Jeffrey S. Vetter, Farah Fahim, Benjamin Parpillon, Jennet Dickinson, Nhan Tran, Jieun Yoo, Corrinne Mills, Morris Swartz, Petar Maksimovic, Catherine D. Schuman, Alice Bean:
On-Sensor Data Filtering using Neuromorphic Computing for High Energy Physics Experiments. CoRR abs/2307.11242 (2023) - [i10]Pedro Valero-Lara, Alexis Huante, Mustafa Al-Lail, William F. Godoy, Keita Teranishi, Prasanna Balaprakash, Jeffrey S. Vetter:
Comparing Llama-2 and GPT-3 LLMs for HPC kernels generation. CoRR abs/2309.07103 (2023) - [i9]William F. Godoy, Pedro Valero-Lara, Caira Anderson, Katrina W. Lee, Ana Gainaru, Rafael Ferreira da Silva, Jeffrey S. Vetter:
Julia as a unifying end-to-end workflow language on the Frontier exascale system. CoRR abs/2309.10292 (2023) - 2022
- [i8]Satoshi Matsuoka, Jens Domke, Mohamed Wahib, Aleksandr Drozd, Ray Bair, Andrew A. Chien, Jeffrey S. Vetter, John Shalf:
Preparing for the Future - Rethinking Proxy Apps. CoRR abs/2204.07336 (2022) - [i7]Prasanna Date, Shruti R. Kulkarni, Aaron R. Young, Catherine D. Schuman, Thomas E. Potok, Jeffrey S. Vetter:
Encoding Integers and Rationals on Neuromorphic Computers using Virtual Neuron. CoRR abs/2208.07468 (2022) - [i6]Valentin Churavy, William F. Godoy, Carsten Bauer, Hendrik Ranocha, Michael Schlottke-Lakemper, Ludovic Räss, Johannes P. Blaschke, Mosè Giordano, Erik Schnetter, Samuel Omlin, Jeffrey S. Vetter, Alan Edelman:
Bridging HPC Communities through the Julia Programming Language. CoRR abs/2211.02740 (2022) - 2019
- [i5]Steven Wei Der Chien, Stefano Markidis, Vyacheslav Olshevsky, Yaroslav Bulatov, Erwin Laure, Jeffrey S. Vetter:
TensorFlow Doing HPC. CoRR abs/1903.04364 (2019) - 2018
- [i4]Stefano Markidis, Steven Wei Der Chien, Erwin Laure, Ivy Bo Peng, Jeffrey S. Vetter:
NVIDIA Tensor Core Programmability, Performance & Precision. CoRR abs/1803.04014 (2018) - [i3]Jiawen Liu, Dong Li, Gokcen Kestor, Jeffrey S. Vetter:
Runtime Concurrency Control and Operation Scheduling for High Performance Neural Network Training. CoRR abs/1810.08955 (2018) - 2017
- [i2]Anshu Dubey, Paul H. J. Kelly, Bernd Mohr, Jeffrey S. Vetter:
Performance Portability in Extreme Scale Computing (Dagstuhl Seminar 17431). Dagstuhl Reports 7(10): 84-110 (2017) - 2014
- [i1]Sparsh Mittal, Jeffrey S. Vetter:
A Survey of Methods For Analyzing and Improving GPU Energy Efficiency. CoRR abs/1404.4629 (2014)
Coauthor Index
aka: Frank Y. Liu
aka: Kyle L. Spafford
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-11 22:23 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint