default search action
HPEC 2018: Waltham, MA, USA
- 2018 IEEE High Performance Extreme Computing Conference, HPEC 2018, Waltham, MA, USA, September 25-27, 2018. IEEE 2018, ISBN 978-1-5386-5989-2
- Mauro Bisson, Massimiliano Fatica:
Update on Static Graph Challenge on GPU. 1-8 - Henry Kvinge, Elin Farnell, Michael Kirby, Chris Peterson:
Too many secants: a hierarchical approach to secant-based dimensionality reduction on large data sets. 1-7 - Jacob Leemaster, Michael Vai, David Whelihan, Haley Whitman, Roger Khazan:
Functionality and Security Co-design Environment for Embedded Systems. 1-5 - Vikram S. Mailthody, Ketan Date, Zaid Qureshi, Carl Pearson, Rakesh Nagi, Jinjun Xiong, Wen-Mei Hwu:
Collaborative (CPU + GPU) Algorithms for Triangle Counting and Truss Decomposition. 1-7 - Alok Tripathy, Oded Green:
Scaling Betweenness Centrality in Dynamic Graphs. 1-7 - Dimitris Floros, Tiancheng Liu, Nikos Pitsianis, Xiaobai Sun:
Sparse Dual of the Density Peaks Algorithm for Cluster Analysis of High-dimensional Data. 1-14 - Louis Jenkins, Tanveer Hossain Bhuiyan, Sarah Harun, Christopher Lightsey, David Mentgen, Sinan G. Aksoy, Timothy Stavcnger, Marcin Zalewski, Hugh R. Medal, Cliff A. Joslyn:
Chapel HyperGraph Library (CHGL). 1-6 - Ahmed Sanaullah, Chen Yang, Yuri Alexeev, Kazutomo Yoshii, Martin C. Herbordt:
Application Aware Tuning of Reconfigurable Multi-Layer Perceptron Architectures. 1-9 - Plamen Krastev, Albert Reuther, Chansup Byun, Michael Chrisp:
A Parallel Implementation of FANO using OpenMP and MPI. 1-5 - Ahsen J. Uppal, H. Howie Huang:
Fast Stochastic Block Partition for Streaming Graphs. 1-6 - Zachary K. Baker, Vinay Ramakrishnaiah, Josh Payne, Jon Woodring, Nicholas Dallmann, William Junor:
Accelerated Aperture Synthesis from Free-flying Collectors. 1-6 - Yehia Arafa, Atanu Barai, Mai Zheng, Abdel-Hameed A. Badawy:
Fault Tolerance Performance Evaluation of Large-Scale Distributed Storage Systems HDFS and Ceph Case Study. 1-7 - Xin Wang, Wei Zhang:
Energy-Efficient DNN Computing on GPUs Through Register File Management. 1-7 - Siddharth Samsi, Vijay Gadepally, Michael B. Hurley, Michael Jones, Edward K. Kao, Sanjeev Mohindra, Paul Monticciolo, Albert Reuther, Steven Thomas Smith, William Song, Diane Staheli, Jeremy Kepner:
GraphChallenge.org: Raising the Bar on Graph Analytic Performance. 1-7 - Rohit Varkey Thankachan, Brian Paul Swenson, James P. Fairbanks:
Performance Effects of Dynamic Graph Data Structures in Community Detection Algorithms. 1-7 - Donghe Kang, Vedang Patel, Kalyan Khandrika, Spyros Blanas, Yang Wang, Srinivasan Parthasarathy:
Characterizing I/O optimization opportunities for array-centric applications on HDFS. 1-2 - Justin Sanchez, Nasim Soltani, Ramachandra Vikas Chamarthi, Adarsh Sawant, Hamed Tabkhi:
A Novel 1D-Convolution Accelerator for Low-Power Real-time CNN processing on the Edge. 1-8 - Hao Wen, Wei Zhang:
Regression Based WCET Analysis For Sampling Based Motion Planning. 1-6 - Austin P. Arechiga, Alan J. Michaels:
The Robustness of Modern Deep Learning Architectures against Single Event Upset Errors. 1-6 - Tianyun Sun, Yizhuang Xie, Bingyi Li, He Chen, Xiaoning Liu, Liang Chen:
Efficient and Flexible 2-D Data Controller for SAR Imaging System. 1-6 - Sayan Ghosh, Mahantesh Halappanavar, Antonino Tumeo, Ananth Kalyanaraman, Assefaw H. Gebremedhin:
Scalable Distributed Memory Community Detection Using Vite. 1-7 - Matthew Overlin, Christopher Smith:
High Performance Computing Techniques with Power Systems Simulations. 1-8 - Sitao Huang, Mohamed El-Hadedy, Cong Hao, Qin Li, Vikram S. Mailthody, Ketan Date, Jinjun Xiong, Deming Chen, Rakesh Nagi, Wen-Mei Hwu:
Triangle Counting and Truss Decomposition using FPGA. 1-7 - Mihailo Isakov, Alan Ehret, Michel A. Kinsy:
Chameleon: A Generalized Reconfigurable Open-Source Architecture for Deep Neural Network Training. 1-7 - Timothy A. Davis:
Graph algorithms via SuiteSparse: GraphBLAS: triangle counting and K-truss. 1-6 - Jordi Ros-Giralt, Alan Commike, Peter Cullen, Richard Lethin:
Accelerating Dijkstra's Algorithm Using Multiresolution Priority Queues. 1-7 - Evan Donato, Ming Ouyang, Cristian Peguero-Isalguez:
Triangle Counting with A Multi-Core Computer. 1-7 - Federico Busato, Oded Green, Nicola Bombieri, David A. Bader:
Hornet: An Efficient Data Structure for Dynamic Sparse Graphs and Matrices on GPUs. 1-7 - Kevin Verma, Chong Peng, Kamil Szewc, Robert Wille:
AMulti-GPU PCISPH Implementation with Efficient Memory Transfers. 1-7 - Nishith Tirpankar, Hari Sundar:
Towards Triangle Counting on GPU using Stable Radix binning. 1-6 - Aditya Gudibanda, Tom Henretty, Muthu Manikandan Baskaran, James R. Ezick, Richard Lethin:
All-at-once Decomposition of Coupled Billion-scale Tensors in Apache Spark. 1-8 - Hao Wen, Wei Zhang:
Exploiting GPU with 3D Stacked Memory to Boost Performance for Data-Intensive Applications. 1-6 - Ryo Matsumiya, Toshio Endo:
Scalable RMA-based Communication Library Featuring Node-local NVMs. 1-7 - Max Carlson, Hari Sundar:
Utilizing GPU Parallelism to Improve Fast Spherical Harmonic Transforms. 1-6 - Qing Dong, Kartik Lakhotia, Hanqing Zeng, Rajgopal Karman, Viktor K. Prasanna, Guna Seetharaman:
A Fast and Efficient Parallel Algorithm for Pruned Landmark Labeling. 1-7 - Siddharth Samsi, Bea Yu, Darrell O. Ricke, Philip Fremont-Smith, Jeremy Kepner, Albert Reuther:
Large-Scale Bayesian Kinship Analysis. 1-4 - Zikun Xiang, Tianqi Wang, Tong Geng, Tian Xiang, Xi Jin, Martin C. Herbordt:
Soft-Core. Multiple-Lane, FPGA-based ADCs for a Liquid Helium Environment. 1-6 - Tong Geng, Erkan Diken, Tianqi Wang, Lech Józwiak, Martin C. Herbordt:
An Access-Pattern-Aware On-Chip Vector Memory System with Automatic Loading for SIMD Architectures. 1-7 - Vijay Gadepally, Jeremy Kepner, Lauren Milechin, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Matthew Hubbell, Michael Houle, Michael Jones, Peter Michaleas, Julie Mullen, Andrew Prout, Antonio Rosa, Charles Yee, Siddharth Samsi, Albert Reuther:
Hyperscaling Internet Graph Analysis with D4M on the MIT SuperCloud. 1-6 - Jonas Larsson:
Server-class devices for Space Time Adaptive Processing. 1-7 - Andrew Prout, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Antonio Rosa, Siddharth Samsi, Charles Yee, Albert Reuther, Jeremy Kepner:
Measuring the Impact of Spectre and Meltdown. 1-5 - Tiancheng Liu, Yuchen Qian, Xi Chen, Xiaobai Sun:
Damping Effect on PageRank Distribution. 1-11 - Yanji Chen, Mehmet Güngör, Shweta Singh, Alex Tazin, Mieczyslaw M. Kokar, Miriam Leeser:
Dynamic Deployment of Communication Applications to Different Hardware Platforms using Ontological Representations. 1-6 - John Terragnoli, Miriam Leeser, Paul Monticciolo:
Stripmap SAR Pulse Interleaved Scheduling. 1-7 - Jenna Wise, Emily Lederman, Manoj Kumar, Pratap Pattnaik:
Performance of Graph Analytics Applications on Many-Core Processors. 1-7 - Jianjun Cao, Guojun Lv, Yuling Shang, Nianfeng Weng, Chen Chang, Yi Liu:
An Ensemble Classifier Based on Feature Selection Using Ant Colony Optimization. 1-7 - John D. Leidel, Xi Wang, Yong Chen:
GoblinCore-64: A RISC-V Based Architecture for Data Intensive Computing. 1-8 - Fazle Sadi, Joe Sweeney, Scott McMillan, Tze Meng Low, James C. Hoe, Larry T. Pileggi, Franz Franchetti:
PageRank Acceleration for Large Graphs with Scalable Hardware and Two-Step SpMV. 1-7 - Abdurrahman Yasar, Sivasankaran Rajamanickam, Michael M. Wolf, Jonathan W. Berry, Ümit V. Çatalyürek:
Fast Triangle Counting Using Cilk. 1-7 - Bingyi Li, Changjin Li, Yizhuang Xie, Liang Chen, Hao Shi, Yi Deng:
A SoPC based Fixed Point System for Spaceborne SAR Real-Time Imaging Processing. 1-6 - Amani AlOnazi, Marcin Rogowski, Ahmed Al-Zawawi, David E. Keyes:
Performance Assessment of Hybrid Parallelism for Large-Scale Reservoir Simulation on Multi- and Many-core Architectures. 1-7 - Brian Wheatman, Helen Xu:
Packed Compressed Sparse Row: A Dynamic Graph Representation. 1-7 - Lauren Milechin, Vijay Gadepally, Jeremy Kepner:
Database Operations in D4M.j1. 1-5 - Kaushik Velusamy, Thomas B. Rolinger, Janice McMahon, Tyler A. Simon:
Exploring Parallel Bitonic Sort on a Migratory Thread Architecture. 1-7 - Jiyuan Zhang, Daniele G. Spampinato, Scott McMillan, Franz Franchetti:
Preliminary Exploration of Large-Scale Triangle Counting on Shared-Memory Multicore System. 1-6 - Yang Hu, Hang Liu, H. Howie Huang:
High-Performance Triangle Counting on GPUs. 1-5 - Mehmet E. Belviranli, Seyong Lee, Jeffrey S. Vetter:
Designing Algorithms for the EMU Migrating-threads-based Architecture. 1-7 - Roger Pearce, Geoffrey Sanders:
K-truss decomposition for Scale-Free Graphs at Scale in Distributed Memory. 1-6 - James Hanford, Andrew J. Weinert:
New Computing Frontiers Enabled via Photovoltaic Fiber Energy Generation. 1-7 - Vít Ruzicka, Franz Franchetti:
Fast and accurate object detection in high resolution 4K and 8K video using GPUs. 1-7 - Alyson Fox, Geoffrey Sanders, Andrew Knyazev:
Investigation of Spectral Clustering for Signed Graph Matrix Representations. 1-7 - Ahmad Abdelfattah, Azzam Haidar, Stanimire Tomov, Jack J. Dongarra:
Optimizing GPU Kernels for Irregular Batch Workloads: A Case Study for Cholesky Factorization. 1-7 - Jeremy Kepner, Ron Brightwell, Alan Edelman, Vijay Gadepally, Hayden Jananthan, Michael Jones, Sam Madden, Peter Michaleas, Hamed Okhravi, Kevin T. Pedretti, Albert Reuther, Thomas L. Sterling, Mike Stonebraker:
TabulaROSA: Tabular Operating System Architecture for Massively Parallel Heterogeneous Compute Engines. 1-8 - Christian Palmiero, Giuseppe Di Guglielmo, Luciano Lavagno, Luca P. Carloni:
Design and Implementation of a Dynamic Information Flow Tracking Architecture to Secure a RISC-V Core for IoT Applications. 1-7 - Majid Rasouli, Vidhi Zala, Robert M. Kirby, Hari Sundar:
Improving Performance and Scalability of Algebraic Multigrid through a Specialized MATVEC. 1-7 - Oded Green, James Fox, Alex Watkins, Alok Tripathy, Kasimir Gabert, Euna Kim, Xiaojing An, Kumar Aatish, David A. Bader:
Logarithmic Radix Binning and Vectorized Triangle Counting. 1-7 - Yijie Huangfu, Wei Zhang:
WCET Analysis of GPU L1 Data Caches. 1-7 - Zheming Jin, Hal Finkel:
Evaluating an OpenCL FPGA Platform for HPC: a Case Study with the HACCmk Kernel. 1-6 - Shounak Dhar, David Z. Pan:
GDP: GPU accelerated Detailed Placement. 1-7 - Chun-Yen Kuo, Ching Nam Hang, Pei-Duo Yu, Chee-Wei Tan:
Parallel Counting of Triangles in Large Graphs: Pruning and Hierarchical Clustering Algorithms. 1-6 - Ini Oguntola, Subby Olubeko, Christopher Sweeney:
SlimNets: An Exploration of Deep Model Compression and Acceleration. 1-6 - Albert Reuther, Jeremy Kepner, Chansup Byun, Siddharth Samsi, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones, Anna Klein, Lauren Milechin, Julia S. Mullen, Andrew Prout, Antonio Rosa, Charles Yee, Peter Michaleas:
Interactive Supercomputing on 40, 000 Cores for Machine Learning and Data Analysis. 1-6 - Géraud Krawezik, Peter M. Kogge, Timothy J. Dysart, Shannon K. Kuntz, Janice O. McMahon:
Implementing the Jaccard Index on the Migratory Memory-Side Processing Emu Architecture. 1-6 - Peter Jamieson, Ahmed Sanaullah, Martin C. Herbordt:
Benchmarking Heterogeneous HPC Systems Including Reconfigurable Fabrics: Community Aspirations for Ideal Comparisons. 1-6 - Qingqing Xiong, Emre Ates, Martin C. Herbordt, Ayse K. Coskun:
Tangram: Colocating HPC Applications with Oversubscription. 1-7 - Ahmed Sanaullah, Martin C. Herbordt:
Unlocking Performance-Programmability by Penetrating the Intel FPGA OpenCL Toolflow. 1-8 - Spencer Drakontaidis, Michael Stanchi, Gabriel Glazer, Jason Hussey, Aaron St. Leger, Suzanne J. Matthews:
Towards Energy-Proportional Anomaly Detection in the Smart Grid. 1-7 - Pierre-David Letourneau, Muthu Manikandan Baskaran, Tom Henretty, James R. Ezick, Richard Lethin:
Computationally Efficient CP Tensor Decomposition Update Framework for Emerging Component Discovery in Streaming Data. 1-8 - Steve Roberts, Pradeep Ramanna, John Walthour:
AC922 Data Movement for CORAL. 1-5 - Tze Meng Low, Daniele G. Spampinato, Anurag Kutuluru, Upasana Sridhar, Doru-Thom Popovici, Franz Franchetti, Scott McMillan:
Linear Algebraic Formulation of Edge-centric K-truss Algorithms with Adjacency Matrices. 1-7 - Benjamin W. Priest, Roger Pearce, Geoffrey Sanders:
Estimating Edge-Local Triangle Count Heavy Hitters in Edge-Linear Time and Almost-Vertex-Linear Space. 1-7 - Alessio Conte, Daniele De Sensi, Roberto Grossi, Andrea Marino, Luca Versari:
Discovering $k$-Trusses in Large-Scale Networks. 1-6 - Julien Hascoet, Benoît Dupont de Dinechin, Karol Desnos, Jean-François Nezan:
A Distributed Framework for Low-Latency OpenVX over the RDMA NoC of a Clustered Manycore. 1-7 - Jeremy Kepner, Vijay Gadepally, Hayden Jananthan, Lauren Milechin, Sid Samsi:
Sparse Deep Neural Network Exact Solutions. 1-8 - James Fox, Oded Green, Kasimir Gabert, Xiaojing An, David A. Bader:
Fast and Adaptive List Intersections on the GPU. 1-7 - V. M. Krushnarao Kotteda, Vinod Kumar, William F. Spotz, Daniel Sunderland:
Performance portability of a fluidized bed solver. 1-7 - Michael Jones, Jeremy Kepner, Bradley Orchard, Albert Reuther, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Vijay Gadepally, Michael Houle, Matthew Hubbell, Anna Klein, Lauren Milechin, Julia S. Mullen, Andrew Prout, Antonio Rosa, Siddharth Samsi, Charles Yee, Peter Michaleas:
Interactive Launch of 16, 000 Microsoft Windows Instances on a Supercomputer. 1-6 - Mark Barnell, Courtney Raymond, Christopher Capraro, Darrek Isereau, Chris Cicotta, Nathan Stokes:
High-Performance Computing (HPC) and Machine Learning Demonstrated in Flight Using Agile Condor®. 1-4 - Kimberlee Chestnut Chang, Nicole Lane, Andrew Uhmeyer, Michael Jones, Matthew Hubbell, Albert Reuther, Robert Seater:
Simulation Approach to Sensor Placement Using Unity 3D. 1-6
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.