default search action
HPEC 2023: Boston, MA, USA
- IEEE High Performance Extreme Computing Conference, HPEC 2023, Boston, MA, USA, September 25-29, 2023. IEEE 2023, ISBN 979-8-3503-0860-0
- Elaheh Hassani, Md Taufique Hussain, Ariful Azad:
Parallel Algorithms for Computing Jaccard Weights on Graphs using Linear Algebra. 1-7 - Dewang Sun, Naifeng Zhang, Franz Franchetti:
Optimization and Performance Analysis of Shor's Algorithm in Qiskit. 1-7 - Nathaniel Tomczak, Sanmukh Kuppannagari:
Automated Indexing Of TEM Diffraction Patterns Using Machine Learning. 1-7 - Mario Vega, Xiaokun Yang, John Shalf, Doru-Thom Popovici:
Towards a Flexible Hardware Implementation for Mixed-Radix Fourier Transforms. 1-7 - Yevhen Pankevych, Oleg Farenyuk:
High-Level Framework for Solving Systems of the PDEs on Distributed Systems. 1-5 - Yuttapichai Kerdcharoen, Upasana Sridhar, Tze Meng Low:
Exploiting Fusion Opportunities in Linear Algebraic Graph Query Engines. 1-7 - Jeremy Kepner, Michael Jones, Phil Dykstra, Chansup Byun, Timothy Davis, Hayden Jananthan, William Arcand, David Bestor, William Bergeron, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg, Charles Yee, Peter Michaleas:
Focusing and Calibration of Large Scale Network Sensors Using GraphBLAS Anonymized Hypersparse Matrices. 1-9 - Samuel Wiggins, Yuan Meng, Rajgopal Kannan, Viktor K. Prasanna:
Accelerating Multi-Agent DDPG on CPU-FPGA Heterogeneous Platform. 1-7 - Vasileios Kalantzis, Mark S. Squillante, Chai Wah Wu, Anshul Gupta, Shashanka Ubaru, Tayfun Gokmen, Lior Horesh:
Solving Sparse Linear Systems via Flexible GMRES with In-Memory Analog Preconditioning. 1-7 - Marc Solé, Ivan Rodriguez-Ferrandez, David Steenari, Leonidas Kosmidis:
Acceleration of Synthetic Aperture Radar for On-board Space Systems. 1-7 - Abu Asaduzzaman, Luke Mercer, Md. Raihan Uddin, Yoel Woldeyes:
Modeling and Analyzing Wind Velocity at Entrance Doors to Avoid Accidents. 1-5 - Bruno Silva, Luiz Guerreiro Lopes:
A Massively Parallel BWP Algorithm for Solving Large-Scale Systems of Nonlinear Equations. 1-6 - Michael Vai, David Whelihan, Eric Simpson, Donato Kava, Alice Lee, Huy Nguyen, Jeffrey J. Hughes, Gabriel Torres, Jeffery Lim, Ben Nahill, Roger Khazan, Fred Schneider:
Zero Trust Architecture Approach for Developing Mission Critical Embedded Systems. 1-5 - Jacob Fein-Ashley, Tian Ye, Rajgopal Kannan, Viktor K. Prasanna, Carl E. Busart:
Benchmarking Deep Learning Classifiers for SAR Automatic Target Recognition. 1-6 - Frank Pacini, Allison Gunby-Mann, Sarel Cohen, Peter Chin:
ANEDA: Adaptable Node Embeddings for Shortest Path Distance Approximation. 1-7 - Evan Vogelbaum, Rumen Dangovski, Li Jing, Marin Soljacic:
Contextualizing Enhances Gradient Based Meta Learning for Few Shot Image Classification. 1-13 - Soroush Vahidi, Baruch Schieber, Zhihui Du, David A. Bader:
Parallel Longest Common SubSequence Analysis In Chapel. 1-6 - Praneeth Vepakomma, Yulia Kempner, Rodmy Paredes Alfaro, Ramesh Raskar:
Parallel Quasi-Concave Set Function Optimization for Scalability Even Without Submodularity. 1-8 - Bin Lei, Caiwen Ding, Le Chen, Pei-Hung Lin, Chunhua Liao:
Creating a Dataset for High-Performance Computing Code Translation using LLMs: A Bridge Between OpenMP Fortran and C++. 1-7 - Leonardo Fraccaroli, Rosalba Giugno, Samuele Cancellieri, Federico Busato, Nicola Bombieri:
FAST-CON: a Multi-source Approach for Efficient S- T Connectivity on Sparse Graphs. 1-6 - Timothy Chong, Venkata Krishnan:
Addressing Endpoint-Induced Congestion for Accelerator Scale-Out in a Medium-Scale Domain. 1-8 - Justin Deters, Peyton Gozon, Max Camp-Oberhauser, Ron K. Cytron:
Feature-Oriented FSMs for FPGAs. 1-7 - Cody J. Balos, Steven Roberts, David J. Gardner:
Leveraging Mixed Precision in Exponential Time Integration Methods. 1-8 - Benoît Dupont de Dinechin, Julien Hascoët, Orégane Desrentes:
In-Place Multicore SIMD Fast Fourier Transforms. 1-6 - Robert Munafo, Hafsah Shahzad, Ahmed Sanaullah, Sanjay Arora, Ulrich Drepper, Martin C. Herbordt:
Improved Models for Policy-Agent Learning of Compiler Directives in HLS. 1-8 - Khaled Abdelaal, Richard Veras:
A Framework for Analyzing the Robustness of Graph Models. 1-6 - Adam Michaleas, Darrell O. Ricke:
Scalable and Portable Pipelines for Predicting 3D Protein Structures on Standalone and HPC Systems. 1-4 - Stephen J. Young, Joshua Suetterlein, Jesun Firoz, Joseph B. Manzano, Kevin J. Barker:
Finding Your Niche: An Evolutionary Approach to HPC Topologies. 1-9 - Joshua Geyster, Karen Gettings, Paul Monticciolo, Matthew Rebholz:
Leveraging Mathworks Tools to Accelerate the Prototyping of Custom 5G Applications in Hardware. 1-6 - Roy Gulla:
A look into a GraphBLAS Entry Point into an LLVM Lowering Pass, with A Precision Formatting Example. 1-4 - Het Mankad, Sanil Rao, Phillip Colella, Brian van Straalen, Franz Franchetti:
ProtoX: A First Look. 1-6 - Daniel Edelman, Siddharth Samsi, Joseph McDonald, Adam Michaleas, Vijay Gadepally:
An Analysis of Energy Requirement for Computer Vision Algorithms. 1-7 - Kevin Vogt-Lowell, Noah Lee, Theodoros Tsiligkaridis, Marc Vaillant:
Robust Fine-Tuning of Vision-Language Models for Domain Generalization. 1-7 - Vyacheslav Romanov:
UNet Performance with Wafer Scale Engine (Optimization Case Study). 1-6 - Zhibin Wang, Ziheng Meng, Xue Li, Xi Lin, Long Zheng, Chen Tian, Sheng Zhong:
SMOG: Accelerating Subgraph Matching on GPUs. 1-7 - Anthony M. Cabrera, Yigit A. Yucesan, Frank Y. Liu, Willem Blokland, Jeffrey S. Vetter:
Errant Beam Detection Using the AMD Versal ACAP and Vitis AI. 1-6 - Narasinga Rao Miniskar, Mohammad Alaul Haque Monil, Pedro Valero-Lara, Frank Y. Liu, Jeffrey S. Vetter:
IRIS-DMEM: Efficient Memory Management for Heterogeneous Computing. 1-7 - Siddharth Samsi, Dan Zhao, Joseph McDonald, Baolin Li, Adam Michaleas, Michael Jones, William Bergeron, Jeremy Kepner, Devesh Tiwari, Vijay Gadepally:
From Words to Watts: Benchmarking the Energy Costs of Large Language Model Inference. 1-9 - Hayden Jananthan, Jeremy Kepner, Michael Jones, William Arcand, David Bestor, William Bergeron, Chansup Byun, Timothy Davis, Vijay Gadepally, Daniel Grant, Michael Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Guillermo Morales, Andrew Morris, Julie Mullen, Ritesh Patel, Alex Pentland, Sandeep Pisharody, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Tyler Trigg, Gabriel Wachman, Charles Yee, Peter Michaleas:
Mapping of Internet "Coastlines" via Large Scale Anonymized Network Source Correlations. 1-9 - Xiao Zhang, Zhanhong Huang, Xinming Huang:
MAVR: Multi-Functional Point Cloud Annotations Using Virtual Reality. 1-6 - Anthony Angone, Xiaoyuan Liu, Ruslan Shaydulin, Ilya Safro:
Hybrid Quantum-Classical Multilevel Approach for Maximum Cuts on Graphs. 1-7 - Krish Matta, Xiaoyuan Liu, Ilya Safro:
Decomposition Based Refinement for the Network Interdiction Problem. 1-8 - Shohei Minami, Toshio Endo, Akihiro Nomura:
The Aggressive Oversubscribing Scheduling for Interactive Jobs on a Supercomputing System. 1-7 - Alexander E. Siemenn, Tonio Buonassisi:
Decreasing the Computing Time of Bayesian Optimization Using Generalizable Memory Pruning. 1-7 - David A. Bader:
Fast Triangle Counting. 1-6 - Ashish Bisht, Deepika H. V, Haribabu P, S. A. Kumar, S. D. Sudarsan:
A Holistic Optimisation - Success Mantra for HPC Performance. 1-6 - Tal Kadosh, Niranjan Hasabnis, Timothy G. Mattson, Yuval Pinter, Gal Oren:
Quantifying OpenMP: Statistical Insights into Usage and Adoption. 1-7 - Lance G. Fletcher, Trevor Steil, Roger Pearce:
Optimizing a Distributed Graph Data Structure for K - Path Centrality Estimation on HPC. 1-7 - Atharva Gondhalekar, Wu-chun Feng:
On the Three P's of Parallel Programming for Heterogeneous Computing: Performance, Productivity, and Portability. 1-7 - Helen Xu, Tao B. Schardl, Michael Pellauer, Joel S. Emer:
Optimizing Compression Schemes for Parallel Sparse Tensor Algebra. 1-7 - Piotr Sielski, Akif Çördük, Hugo Linsenmaier, Alexandre Fender:
A GPU Parallel Algorithm for Finding a Negative Subset Disjoint Cycle in a Graph. 1-7 - Justin Kawakami, Dominik Zajac, Miriam Leeser:
Selective Encryption of Compressed Image Regions on the Edge with Reconfigurable Hardware. 1-6 - Ahsen J. Uppal, Thomas B. Rolinger, H. Howie Huang:
Decontentioned Stochastic Block Partition. 1-6 - Frank Wanye, Vitaliy Gleyzer, Edward K. Kao, Wu-chun Feng:
An Integrated Approach for Accelerating Stochastic Block Partitioning. 1-7 - Edwin Lee, Michael Parker, Michael Cervantes, Ben Plotner:
Machine Learning at the Edge Using Neural Network Processor. 1-4 - Chansup Byun, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Hayden Jananthan, Michael Jones, Anna Klein, Peter Michaleas, Lauren Milechin, Guillermo Morales, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Jeremy Kepner:
pPython Performance Study. 1-7 - Alan Ehret, Peter Moore, Milan Stojkov, Michel A. Kinsy:
Hardware Root-of-Trust Support for Operational Technology Cybersecurity in Critical Infrastructures. 1-7 - Mark Barnell, Courtney Raymond, Lisa Loomis, Darrek Isereau, Daniel Brown, Francesca Vidal, Steven Smiley:
Advanced Ultra Low-Power Deep Learning Applications with Neuromorphic Computing. 1-4 - Chih-Chun Chang, Tsung-Wei Huang:
uSAP: An Ultra-Fast Stochastic Graph Partitioner. 1-7 - Igor Betkier, Mateusz Oszczypala, Janusz Pobozniak, Sergiusz Sobieski:
Performance Analysis of Graph Neural Network (GNN) for Manufacturing Feature Recognition Problem. 1-6 - Marika E. Schubert, David Langerman, Alan D. George:
High-Level Frameworks: Effect on Transformer Inference Time and Power on Embedded GPU Devices. 1-8 - Xin Wang, Wei Zhang:
Build Energy-Efficient GPU Computing Environment for Machine Learning Algorithms with Register File Packing Technique. 1-7 - Syamantak Payra, Gabriel Loke, Yoel Fink, Joseph D. Steinmeyer:
Pruning Binarized Neural Networks Enables Low-Latency, Low-Power FPGA-Based Handwritten Digit Classification. 1-8 - Chang-Hung Wu, Che-Rung Lee:
Multi-Sweep-Line Algorithm for Rectangle Union on GPU and Its Application for VLSI Density Calculation. 1-7 - Ivan Williams, Eric Polizzi:
Automatic Differentiation for Inverse Problems with Applications in Quantum Transport. 1-5 - Yashash Jain, Utsav Banerjee:
Tyche: A Compact and Configurable Accelerator for Scalable Probabilistic Computing on FPGA. 1-7 - Avik Pal, Alan Edelman, Chris Rackauckas:
Continuous Deep Equilibrium Models: Training Neural ODEs Faster by Integrating Them to Infinity. 1-9 - Naifeng Zhang, Austin Ebel, Negar Neda, Patrick Brinich, Benedict Reynwar, Andrew G. Schmidt, Mike Franusich, Jeremy Johnson, Brandon Reagen, Franz Franchetti:
Generating High-Performance Number Theoretic Transform Implementations for Vector Architectures. 1-7 - Heliezer J. D. Espinoza, Jennifer A. Loe, Erik G. Boman:
Fast Spectral Graph Partitioning with a Randomized Eigensolver. 1-7 - Piotr Luszczek, Tokey Tahmid:
Towards the FAIR Asset Tracking Across Models, Datasets, and Performance Evaluation Scenarios. 1-6 - Andy Vidan, Lars H. Fiedler:
A Composable Just-In-Time Programming Framework with LLMs and FBP. 1-8 - Noah Lee, Patrick W. Moore, Laura J. Brattain:
Scalable Deep Learning for Pilot Performance Analysis Using Multimodal Physiological Time Series. 1-6 - Bingyi Zhang, Rajgopal Kannan, Viktor K. Prasanna, Carl E. Busart:
Accelerating GNN-Based SAR Automatic Target Recognition on HBM-Enabled FPGA. 1-7 - Jonathan Levine, Leonard MacEachern:
Accelerating Training Data Generation Using Optimal Parallelization and Thread Counts. 1-7 - Brody Williams, Yong Chen, Wendy Poole, Stephen W. Poole:
Exploring Challenges Associated with Employing SmartNICs as General-Purpose HPC Accelerators. 1-7 - Kuan-Lin Chiu, Davide Giri, Luca Piccolboni, Luca P. Carloni:
An Analysis of Accelerator Data-Transfer Modes in NoC-Based SoC Architectures. 1-7 - Jian Hu, Matthew Curtis-Maury, Vinay Devadas:
Dynamic Data Partitioning in the WAFL File System. 1-7 - Lakshmi Nair, David Widemann, Brad Turcott, Nick Moore, Alexandra Wleklinski, Darius Bunandar, Ioannis Papavasileiou, Shihu Wang, Eric Logan:
Photonic Accelerators for Image Segmentation in Autonomous Driving and Defect Detection. 1-9 - Shui Jiang, Tsung-Wei Huang, Tsung-Yi Ho:
GLARE: Accelerating Sparse DNN Inference Kernels with Global Memory Access Reduction. 1-7 - Li Jing, Rumen Dangovski, Marin Soljacic:
Asymmetric Grouped Convolutions for Logarithmic Scale Efficient Convolutional Neural Networks. 1-12 - Ming Dun, Xu Zhang, Huawei Cao, Yuan Zhang, Junying Huang, Xiaochun Ye:
Adaptive Sparse Deep Neural Network Inference on Resource-Constrained Cost-Efficient GPUs. 1-7 - S. Biplab Raut:
AOCL-Compression - A High Performance Optimized Lossless Data Compression Library. 1-7 - Yu Gao, Meng Qin, Yibin Ding, Li Zeng, Chaorui Zhang, Weixi Zhang, Wei Han, Rongqian Zhao, Bo Bai:
RaftGP: Random Fast Graph Partitioning. 1-7 - Michael Jones, Jeremy Kepner, Andrew Prout, Timothy Davis, William Arcand, David Bestor, William Bergeron, Chansup Byun, Vijay Gadepally, Micheal Houle, Matthew Hubbell, Hayden Jananthan, Anna Klein, Lauren Milechin, Guillermo Morales, Julie Mullen, Ritesh Patel, Sandeep Pisharody, Albert Reuther, Antonio Rosa, Siddharth Samsi, Charles Yee, Peter Michaleas:
Deployment of Real-Time Network Traffic Analysis Using GraphBLAS Hypersparse Matrices and D4M Associative Arrays. 1-8 - Nikos Pitsianis, Dimitris Floros, Tiancheng Liu, Xiaobai Sun:
Parallel Clustering with Resolution Variation. 1-8 - Dhruv Parikh, Bingyi Zhang, Rajgopal Kannan, Viktor K. Prasanna, Carl E. Busart:
Performance of Graph Neural Networks for Point Cloud Applications. 1-7 - Jianshen Liu, Carlos Maltzahn, Craig D. Ulmer:
Opportunistic Query Execution on SmartNICs for Analyzing In-Transit Data. 1-7 - Ian Peitzsch, Mark Ciora, Alan D. George:
Multiarchitecture Hardware Acceleration of Hyperdimensional Computing. 1-7 - David A. Bader, Fuhuan Li, Anya Ganeshan, Ahmet Gündogdu, Jason Lew, Oliver Alvarado Rodriguez, Zhihui Du:
Triangle Counting Through Cover-Edges. 1-7 - Kai Huang, Mehmet Güngör, Suranga Handagala, Stratis Ioannidis, Miriam Leeser:
Accelerating Garbled Circuits in the Open Cloud Testbed with Multiple Network-Attached FPGAs. 1-8 - Neelesh Gupta, Pengmiao Zhang, Rajgopal Kannan, Viktor K. Prasanna:
PaCKD: Pattern-Clustered Knowledge Distillation for Compressing Memory Access Prediction Models. 1-7 - Jon Roose, Miheer Vaidya, Ponnuswamy Sadayappan, Sivasankaran Rajamanickam:
TenSQL: An SQL Database Built on GraphBLAS. 1-8 - Ileana Rugina, Rumen Dangovski, Mark Veillette, Pooya Khorrami, Brian Cheung, Olga Simek, Marin Soljacic:
Meta-Learning and Self-Supervised Pretraining for Storm Event Imagery Translation. 1-9 - Shakir Showkat Sofi, Nadezhda Alsahanova:
Image Segmentation with Topological Priors. 1-6 - Li Jing, Lay Jain, Rumen Dangovski, Marin Soljacic:
Manifold Transfer Networks for Lens Distortion Rectification. 1-8 - Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, Jeremy Kepner:
Lincoln AI Computing Survey (LAICS) Update. 1-7 - Abhiram Rao Gorle, Pengmiao Zhang, Rajgopal Kannan, Viktor K. Prasanna:
G-MAP: A Graph Neural Network-Based Framework for Memory Access Prediction. 1-7 - Shachi Khadilkar, Ahmed Sanaullah, Martin Margala:
Quantifying the Gap between Open-Source and Vendor FPGA Place and Route Tools. 1-6 - Dana Diaconu, Yanyue Xie, Mehmet Güngör, Suranga Handagala, Xue Lin, Miriam Leeser:
Machine Learning Across Network-Connected FPGAs. 1-7 - Sadasivan Shankar:
Energy Estimates Across Layers of Computing: From Devices to Large-Scale Applications in Machine Learning for Natural Language Processing, Scientific Computing, and Cryptocurrency Mining1. 1-6 - Oliver Alvarado Rodriguez, Fernando Vera Buschmann, Zhihui Du, David A. Bader:
Property Graphs in Arachne. 1-7
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.