


Остановите войну!
for scientists:


default search action
Wen-Mei W. Hwu
Wen-mei W. Hwu – Wen-Mei Hwu
Person information

- affiliation: University of Illinois at Urbana-Champaign, Department of Electrical and Computer Engineering
- award (1999): Grace Murray Hopper Award
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [c232]Zaid Qureshi, Vikram Sharma Mailthody, Isaac Gelado, Seungwon Min, Amna Masood, Jeongmin Park, Jinjun Xiong, C. J. Newburn, Dmitri Vainbrand, I-Hsin Chung, Michael Garland, William J. Dally, Wen-Mei W. Hwu:
GPU-Initiated On-Demand High-Throughput Storage Access in the BaM System Architecture. ASPLOS (2) 2023: 325-339 - [i54]Kun Wu, Mert Hidayetoglu, Xiang Song, Sitao Huang, Da Zheng, Israt Nisa, Wen-Mei W. Hwu:
PIGEON: Optimizing CUDA Code Generator for End-to-End Training and Inference of Relational Graph Neural Networks. CoRR abs/2301.06284 (2023) - 2022
- [j81]Omer Anjum, Mohammad Almasri, Simon Garcia de Gonzalo, Wen-Mei W. Hwu:
An efficient GPU implementation and scaling for higher-order 3D stencils. Inf. Sci. 586: 326-343 (2022) - [j80]Xiaofan Zhang
, Yuan Ma
, Jinjun Xiong
, Wen-Mei W. Hwu
, Volodymyr V. Kindratenko
, Deming Chen
:
Exploring HW/SW Co-Design for Video Analysis on CPU-FPGA Heterogeneous Systems. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 41(6): 1606-1619 (2022) - [j79]Mert Hidayetoglu
, Tekin Biçer
, Simon Garcia de Gonzalo, Bin Ren, Doga Gürsoy
, Rajkumar Kettimuthu, Ian T. Foster
, Wen-Mei W. Hwu
:
MemXCT: Design, Optimization, Scaling, and Reproducibility of X-Ray Tomography Imaging. IEEE Trans. Parallel Distributed Syst. 33(9): 2014-2031 (2022) - [c231]Jie Huang, Kevin Chang, Jinjun Xiong, Wen-Mei Hwu:
Open Relation Modeling: Learning to Define Relations between Entities. ACL (Findings) 2022: 297-308 - [c230]Mhd Ghaith Olabi, Juan Gómez-Luna, Onur Mutlu, Wen-Mei Hwu, Izzat El Hajj:
A Compiler Framework for Optimizing Dynamic Parallelism on GPUs. CGO 2022: 1-13 - [c229]Mohammad Almasri, Izzat El Hajj, Rakesh Nagi, Jinjun Xiong, Wen-Mei Hwu:
Parallel K-clique counting on GPUs. ICS 2022: 21:1-21:14 - [c228]Vibhor Dodeja, Mohammad Almasri, Rakesh Nagi, Jinjun Xiong, Wen-Mei Hwu:
PARSEC: PARallel Subgraph Enumeration in CUDA. IPDPS 2022: 168-178 - [c227]Seungwon Min, Kun Wu, Mert Hidayetoglu, Jinjun Xiong, Xiang Song, Wen-Mei Hwu:
Graph Neural Network Training and Data Tiering. KDD 2022: 3555-3565 - [c226]Xiangdong Wei, Mohamed El-Hadedy, Sergiu Mosanu, Zhengping Zhu, Wen-Mei Hwu, Xinfei Guo:
RECO-HCON: A High-Throughput Reconfigurable Compact ASCON Processor for Trusted IoT. SOCC 2022: 1-6 - [i53]Mhd Ghaith Olabi, Juan Gómez-Luna, Onur Mutlu, Wen-Mei W. Hwu, Izzat El Hajj:
A Compiler Framework for Optimizing Dynamic Parallelism on GPUs. CoRR abs/2201.02789 (2022) - [i52]Zaid Qureshi, Vikram Sharma Mailthody, Isaac Gelado, Seungwon Min, Amna Masood, Jeongmin Park, Jinjun Xiong, C. J. Newburn, Dmitri Vainbrand, I-Hsin Chung, Michael Garland, William J. Dally, Wen-Mei W. Hwu:
BaM: A Case for Enabling Fine-grain High Throughput GPU-Orchestrated Access to Storage. CoRR abs/2203.04910 (2022) - [i51]Jie Huang, Kerui Zhu, Kevin Chen-Chuan Chang, Jinjun Xiong, Wen-Mei Hwu:
DKG: A Descriptive Knowledge Graph for Explaining Relationships between Entities. CoRR abs/2205.10479 (2022) - [i50]Jie Huang, Kevin Chen-Chuan Chang, Jinjun Xiong, Wen-Mei Hwu:
Can Language Models Be Specific? How? CoRR abs/2210.05159 (2022) - [i49]Omer Anjum, Alok Kamatar, Toby Liang, Jinjun Xiong, Wen-Mei Hwu:
Submission-Aware Reviewer Profiling for Reviewer Recommender System. CoRR abs/2211.04194 (2022) - [i48]Mohammad Almasri, Yen-Hsiang Chang, Izzat El Hajj, Rakesh Nagi, Jinjun Xiong, Wen-mei W. Hwu:
Parallelizing Maximal Clique Enumeration on GPUs. CoRR abs/2212.01473 (2022) - 2021
- [j78]Seungwon Min, Kun Wu, Sitao Huang, Mert Hidayetoglu
, Jinjun Xiong, Eiman Ebrahimi, Deming Chen, Wen-mei W. Hwu:
Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture. Proc. VLDB Endow. 14(11): 2087-2100 (2021) - [j77]Sitao Huang
, Kun Wu, Hyunmin Jeong, Chengyue Wang, Deming Chen
, Wen-Mei Hwu
:
PyLog: An Algorithm-Centric Python-Based FPGA Programming and Synthesis Flow. IEEE Trans. Computers 70(12): 2015-2028 (2021) - [j76]Qin Li, Xiaofan Zhang
, Jinjun Xiong
, Wen-Mei Hwu, Deming Chen
:
Efficient Methods for Mapping Neural Machine Translator on FPGAs. IEEE Trans. Parallel Distributed Syst. 32(7): 1866-1877 (2021) - [c225]Sultan Durrani, Muhammad Saad Chughtai, Mert Hidayetoglu, Rashid Tahir, Abdul Dakkak, Lawrence Rauchwerger, Fareed Zaffar, Wen-Mei W. Hwu:
Accelerating Fourier and Number Theoretic Transforms using Tensor Cores and Warp Shuffles. PACT 2021: 345-355 - [c224]Jie Huang
, Kevin Chang, Jinjun Xiong, Wen-Mei Hwu:
Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach. ACL/IJCNLP (1) 2021: 3641-3651 - [c223]Ashutosh Dhar, Paul Reckamp, Jinjun Xiong, Wen-Mei Hwu, Deming Chen:
Graviton: A Reconfigurable Memory-Compute Fabric for Data Intensive Applications. ARC 2021: 254-264 - [c222]Sitao Huang, Aayush Ankit, Plínio Silveira, Rodrigo Antunes, Sai Rahul Chalamalasetti, Izzat El Hajj, Dong Eun Kim, Glaucimar Aguiar, Pedro Bruel, Sergey Serebryakov, Cong Xu, Can Li, Paolo Faraboschi, John Paul Strachan, Deming Chen, Kaushik Roy, Wen-Mei W. Hwu, Dejan S. Milojicic:
Mixed Precision Quantization for ReRAM-based DNN Inference Accelerators. ASP-DAC 2021: 372-377 - [c221]Jiachen Li, Bowen Cheng, Rogério Feris, Jinjun Xiong, Thomas S. Huang, Wen-Mei Hwu, Humphrey Shi:
Pseudo-IoU: Improving Label Assignment in Anchor-Free Object Detection. CVPR Workshops 2021: 2378-2387 - [c220]Chengyue Wang, Sitao Huang, Wen-Mei Hwu, Deming Chen:
Extending HLS with High-Level Descriptive Language for Configurable Algorithm-Level Spatial Structure Design. FCCM 2021: 261 - [c219]Sitao Huang, Kun Wu, Hyunmin Jeong, Chengyue Wang, Deming Chen, Wen-Mei Hwu:
PyLog: An Algorithm-Centric Python-Based FPGA Programming and Synthesis Flow. FPGA 2021: 227-228 - [c218]Carl Pearson, Kun Wu, I-Hsin Chung, Jinjun Xiong, Wen-Mei Hwu:
TEMPI: An Interposed MPI Library with a Canonical Representation of CUDA-aware Datatypes. HPDC 2021: 95-106 - [c217]Mohammad Almasri, Neo Vasudeva, Rakesh Nagi, Jinjun Xiong, Wen-Mei Hwu:
HyKernel: A Hybrid Selection of One/Two-Phase Kernels for Triangle Counting on GPUs. HPEC 2021: 1-7 - [c216]Zhonghao Wang, Kai Wang, Mo Yu, Jinjun Xiong, Wen-Mei Hwu, Mark Hasegawa-Johnson, Humphrey Shi:
Interpretable Visual Reasoning via Induced Symbolic Space. ICCV 2021: 1858-1867 - [c215]Sultan Durrani, Muhammad Saad Chughtai, Abdul Dakkak, Wen-Mei Hwu, Lawrence Rauchwerger:
FFT blitz: the tensor cores strike back. PPoPP 2021: 488-489 - [c214]Omer Anjum, Mohammad Almasri, Jinjun Xiong, Wen-Mei W. Hwu:
PhraseScope: An Effective and Unsupervised Framework for Mining High Quality Phrases. SDM 2021: 639-647 - [i47]Vikram Sharma Mailthody, James Wei, Nicholas Chen, Mohammad Behnia, Ruihao Yao, Qihao Wang, Vedant Agrawal, Churan He, Lijian Wang, Leihao Chen, Amit Agarwal, Edward Richter, Wen-Mei Hwu, Christopher W. Fletcher, Jinjun Xiong, Andrew Miller, Sanjay Patel:
Safer Illinois and RokWall: Privacy Preserving University Health Apps for COVID-19. CoRR abs/2101.07897 (2021) - [i46]Seungwon Min, Kun Wu, Sitao Huang, Mert Hidayetoglu, Jinjun Xiong, Eiman Ebrahimi, Deming Chen, Wen-Mei W. Hwu:
PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses. CoRR abs/2101.07956 (2021) - [i45]Seungwon Min, Kun Wu, Sitao Huang, Mert Hidayetoglu, Jinjun Xiong, Eiman Ebrahimi, Deming Chen, Wen-Mei W. Hwu:
Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture. CoRR abs/2103.03330 (2021) - [i44]Mohammad Almasri, Izzat El Hajj, Rakesh Nagi, Jinjun Xiong, Wen-Mei W. Hwu:
K-Clique Counting on GPUs. CoRR abs/2104.13209 (2021) - [i43]Jiachen Li, Bowen Cheng, Rogério Feris, Jinjun Xiong, Thomas S. Huang, Wen-Mei Hwu, Humphrey Shi:
Pseudo-IoU: Improving Label Assignment in Anchor-Free Object Detection. CoRR abs/2104.14082 (2021) - [i42]Jie Huang, Kevin Chen-Chuan Chang, Jinjun Xiong, Wen-mei W. Hwu:
Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach. CoRR abs/2105.13255 (2021) - [i41]Jie Huang
, Kevin Chen-Chuan Chang, Jinjun Xiong, Wen-Mei W. Hwu:
Open Relation Modeling: Learning to Define Relations between Entities. CoRR abs/2108.09241 (2021) - [i40]Yen-Hsiang Chang, Jianhao Pu, Wen-Mei W. Hwu, Jinjun Xiong:
MLHarness: A Scalable Benchmarking System for MLCommons. CoRR abs/2111.05231 (2021) - [i39]Seungwon Min, Kun Wu, Mert Hidayetoglu, Jinjun Xiong, Xiang Song, Wen-mei W. Hwu:
Graph Neural Network Training with Data Tiering. CoRR abs/2111.05894 (2021) - 2020
- [j75]Seungwon Min, Vikram Sharma Mailthody
, Zaid Qureshi, Jinjun Xiong, Eiman Ebrahimi, Wen-Mei Hwu:
EMOGI: Efficient Memory-access for Out-of-memory Graph-traversal In GPUs. Proc. VLDB Endow. 14(2): 114-127 (2020) - [j74]Aayush Ankit
, Izzat El Hajj, Sai Rahul Chalamalasetti, Sapan Agarwal
, Matthew J. Marinella
, Martin Foltin, John Paul Strachan, Dejan S. Milojicic
, Wen-Mei Hwu
, Kaushik Roy
:
PANTHER: A Programmable Architecture for Neural Network Training Harnessing Energy-Efficient ReRAM. IEEE Trans. Computers 69(8): 1128-1142 (2020) - [c213]Cheng Li, Abdul Dakkak, Jinjun Xiong, Wen-Mei W. Hwu:
The Design and Implementation of a Scalable Deep Learning Benchmarking Platform. CLOUD 2020: 414-425 - [c212]Abdul Dakkak, Tom Wickham-Jones, Wen-Mei Hwu:
The design and implementation of the wolfram language compiler. CGO 2020: 212-228 - [c211]Omer Anjum, Chak Ho Chan, Tanitpong Lawphongpanich, Yucheng Liang, Tianyi Tang, Shuchen Zhang, Wen-Mei Hwu, Jinjun Xiong, Sanjay Patel:
Vertext: An End-to-end AI Powered Conversation Management System for Multi-party Chat Platforms. CSCW Companion 2020: 1-6 - [c210]Zhonghao Wang, Yunchao Wei, Rogério Schmidt Feris, Jinjun Xiong, Wen-Mei W. Hwu, Thomas S. Huang, Honghui Shi:
Alleviating Semantic-level Shift: A Semi-supervised Domain Adaptation Method for Semantic Segmentation. CVPR Workshops 2020: 4043-4047 - [c209]Zhonghao Wang, Mo Yu, Yunchao Wei, Rogério Feris, Jinjun Xiong, Wen-Mei Hwu, Thomas S. Huang, Honghui Shi:
Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation. CVPR 2020: 12632-12641 - [c208]Yuhong Li, Cong Hao, Xiaofan Zhang, Xinheng Liu, Yao Chen
, Jinjun Xiong, Wen-mei W. Hwu, Deming Chen:
EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions. DAC 2020: 1-6 - [c207]Jie Huang
, Zilong Wang, Kevin Chang, Wen-Mei Hwu, Jinjun Xiong:
Exploring Semantic Capacity of Terms. EMNLP (1) 2020: 8509-8518 - [c206]Cong Hao, Yao Chen
, Xiaofan Zhang, Yuhong Li, Jinjun Xiong, Wen-Mei Hwu, Deming Chen:
Effective Algorithm-Accelerator Co-design for AI Solutions on Edge Devices. ACM Great Lakes Symposium on VLSI 2020: 283-290 - [c205]Mert Hidayetoglu
, Carl Pearson, Vikram Sharma Mailthody
, Eiman Ebrahimi, Jinjun Xiong, Rakesh Nagi, Wen-Mei Hwu:
At-Scale Sparse Deep Neural Network Inference With Efficient GPU Implementation. HPEC 2020: 1-7 - [c204]Xiaofan Zhang, Hanchen Ye, Junsong Wang, Yonghua Lin, Jinjun Xiong, Wen-Mei Hwu, Deming Chen:
DNNExplorer: A Framework for Modeling and Exploring a Novel Paradigm of FPGA-based DNN Accelerator. ICCAD 2020: 61:1-61:9 - [c203]Mohamed El-Hadedy, Martin Margala, Sergiu Mosanu, Danilo Gligoroski, Jinjun Xiong, Wen-Mei Hwu:
Micro - GAGE: A Low-power Compact GAGE Hash Function Processor for IoT Applications. ICECS 2020: 1-4 - [c202]Cheng Li, Abdul Dakkak, Jinjun Xiong, Wei Wei, Lingjie Xu, Wen-Mei Hwu:
XSP: Across-Stack Profiling and Analysis of Machine Learning Models on GPUs. IPDPS 2020: 326-327 - [c201]Cheng Li, Abdul Dakkak, Jinjun Xiong, Wen-Mei Hwu:
Benanza: Automatic μBenchmark Generation to Compute "Lower-bound" Latency and Inform Optimizations of Deep Learning Models on GPUs. IPDPS 2020: 440-450 - [c200]Carl Pearson, Mert Hidayetoglu
, Mohammad Almasri, Omer Anjum, I-Hsin Chung, Jinjun Xiong, Wen-Mei W. Hwu:
Node-Aware Stencil Communication for Heterogeneous Supercomputers. IPDPS Workshops 2020: 796-805 - [c199]Wen-Mei Hwu:
Advancing Computing Infrastructure for Very Large-Scale Deep Learning at C3SR. IPDPS Workshops 2020: 989 - [c198]Ashutosh Dhar, Xiaohao Wang, Hubertus Franke, Jinjun Xiong, Jian Huang, Wen-Mei W. Hwu, Nam Sung Kim, Deming Chen:
FReaC Cache: Folded-logic Reconfigurable Computing in the Last Level Cache. MICRO 2020: 102-117 - [c197]Xiaofan Zhang, Haoming Lu, Cong Hao, Jiachen Li, Bowen Cheng, Yuhong Li, Kyle Rupnow, Jinjun Xiong, Thomas S. Huang, Honghui Shi, Wen-Mei Hwu, Deming Chen:
SkyNet: a Hardware-Efficient Method for Object Detection and Tracking on Embedded Systems. MLSys 2020 - [c196]Abdul Dakkak, Cheng Li, Jinjun Xiong, Wen-mei W. Hwu:
DLSpec: A Deep Learning Task Exchange Specification. OpML 2020 - [c195]Mert Hidayetoglu
, Tekin Bicer, Simon Garcia De Gonzalo, Bin Ren, Vincent De Andrade, Doga Gürsoy, Raj Kettimuthu, Ian T. Foster, Wen-mei W. Hwu:
Petascale XCT: 3D image reconstruction with hierarchical communications on multi-GPU nodes. SC 2020: 37 - [c194]Cheng Li, Abdul Dakkak, Jinjun Xiong, Wen-Mei Hwu:
DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs. ICPE 2020: 202-209 - [e5]Eduard Ayguadé, Wen-mei W. Hwu, Rosa M. Badia, H. Peter Hofstee:
ICS '20: 2020 International Conference on Supercomputing, Barcelona Spain, June, 2020. ACM 2020, ISBN 978-1-4503-7983-0 [contents] - [i38]Abdul Dakkak, Cheng Li, Jinjun Xiong, Wen-Mei Hwu:
MLModelScope: A Distributed Platform for Model Evaluation and Benchmarking at Scale. CoRR abs/2002.08295 (2020) - [i37]Abdul Dakkak, Cheng Li, Jinjun Xiong, Wen-Mei Hwu:
DLSpec: A Deep Learning Task Exchange Specification. CoRR abs/2002.11262 (2020) - [i36]Zhonghao Wang, Mo Yu, Yunchao Wei, Rogério Schmidt Feris, Jinjun Xiong, Wen-Mei Hwu, Thomas S. Huang, Honghui Shi:
Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation. CoRR abs/2003.08040 (2020) - [i35]Zhonghao Wang, Yunchao Wei, Rogério Feris, Jinjun Xiong, Wen-Mei Hwu, Thomas S. Huang, Honghui Shi:
Alleviating Semantic-level Shift: A Semi-supervised Domain Adaptation Method for Semantic Segmentation. CoRR abs/2004.00794 (2020) - [i34]Yuhong Li, Cong Hao, Xiaofan Zhang, Xinheng Liu, Yao Chen, Jinjun Xiong, Wen-Mei W. Hwu, Deming Chen:
EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions. CoRR abs/2005.02563 (2020) - [i33]Seungwon Min, Vikram Sharma Mailthody, Zaid Qureshi, Jinjun Xiong, Eiman Ebrahimi, Wen-Mei W. Hwu:
EMOGI: Efficient Memory-access for Out-of-memory Graph-traversal In GPUs. CoRR abs/2006.06890 (2020) - [i32]Mert Hidayetoglu, Carl Pearson, Vikram Sharma Mailthody, Eiman Ebrahimi, Jinjun Xiong, Rakesh Nagi, Wen-mei W. Hwu:
Efficient Inference on GPUs for the Sparse Deep Neural Network Graph Challenge 2020. CoRR abs/2007.14152 (2020) - [i31]Zaid Qureshi, Vikram Sharma Mailthody, Seungwon Min, I-Hsin Chung, Jinjun Xiong, Wen-Mei W. Hwu:
Tearing Down the Memory Wall. CoRR abs/2008.10169 (2020) - [i30]Xiaofan Zhang, Hanchen Ye, Junsong Wang, Yonghua Lin, Jinjun Xiong, Wen-Mei W. Hwu, Deming Chen:
DNNExplorer: A Framework for Modeling and Exploring a Novel Paradigm of FPGA-based DNN Accelerator. CoRR abs/2008.12745 (2020) - [i29]Mert Hidayetoglu, Tekin Bicer
, Simon Garcia De Gonzalo, Bin Ren, Vincent De Andrade, Doga Gürsoy, Raj Kettimuthu, Ian T. Foster, Wen-mei W. Hwu:
Petascale XCT: 3D Image Reconstruction with Hierarchical Communications on Multi-GPU Nodes. CoRR abs/2009.07226 (2020) - [i28]Jie Huang, Zilong Wang, Kevin Chen-Chuan Chang, Wen-Mei Hwu, Jinjun Xiong:
Exploring Semantic Capacity of Terms. CoRR abs/2010.01898 (2020) - [i27]Cong Hao, Yao Chen, Xiaofan Zhang, Yuhong Li, Jinjun Xiong, Wen-Mei Hwu, Deming Chen:
Effective Algorithm-Accelerator Co-design for AI Solutions on Edge Devices. CoRR abs/2010.07185 (2020) - [i26]Zhonghao Wang, Mo Yu, Kai Wang, Jinjun Xiong, Wen-Mei Hwu, Mark Hasegawa-Johnson, Humphrey Shi:
Interpretable Visual Reasoning via Induced Symbolic Space. CoRR abs/2011.11603 (2020) - [i25]Carl Pearson, Kun Wu, I-Hsin Chung, Jinjun Xiong, Wen-Mei Hwu:
Fast CUDA-Aware MPI Datatypes without Platform Support. CoRR abs/2012.14363 (2020)
2010 – 2019
- 2019
- [c193]Abdul Dakkak, Cheng Li
, Simon Garcia De Gonzalo, Jinjun Xiong, Wen-Mei Hwu:
TrIMS: Transparent and Isolated Model Sharing for Low Latency Deep Learning Inference in Function-as-a-Service. CLOUD 2019: 372-382 - [c192]Qin Li, Xiaofan Zhang, Jinjun Xiong, Wen-Mei Hwu, Deming Chen:
Implementing neural machine translation with bi-directional GRU and attention mechanism on FPGAs using HLS. ASP-DAC 2019: 693-698 - [c191]Aayush Ankit, Izzat El Hajj, Sai Rahul Chalamalasetti, Geoffrey Ndu, Martin Foltin, R. Stanley Williams
, Paolo Faraboschi, Wen-mei W. Hwu, John Paul Strachan, Kaushik Roy, Dejan S. Milojicic
:
PUMA: A Programmable Ultra-efficient Memristor-based Accelerator for Machine Learning Inference. ASPLOS 2019: 715-731 - [c190]Ahmed H. M. O. Abulila, Vikram Sharma Mailthody
, Zaid Qureshi, Jian Huang, Nam Sung Kim, Jinjun Xiong, Wen-Mei W. Hwu:
FlatFlash: Exploiting the Byte-Accessibility of SSDs within a Unified Memory-Storage Hierarchy. ASPLOS 2019: 971-985 - [c189]Simon Garcia De Gonzalo, Sitao Huang, Juan Gómez-Luna, Simon D. Hammond, Onur Mutlu, Wen-Mei Hwu:
Automatic Generation of Warp-Level Primitives and Atomic Instructions for Fast and Portable Parallel Reduction on GPUs. CGO 2019: 73-84 - [c188]Cong Hao, Xiaofan Zhang, Yuhong Li, Sitao Huang, Jinjun Xiong, Kyle Rupnow, Wen-Mei Hwu, Deming Chen:
FPGA/DNN Co-Design: An Efficient Design Methodology for IoT Intelligence on the Edge. DAC 2019: 206 - [c187]Omer Anjum, Hongyu Gong, Suma Bhat, Wen-Mei Hwu, Jinjun Xiong:
PaRe: A Paper-Reviewer Matching Approach Using a Common Topic Space. EMNLP/IJCNLP (1) 2019: 518-528 - [c186]Seungwon Min, Sitao Huang, Mohamed El-Hadedy, Jinjun Xiong, Deming Chen, Wen-Mei Hwu:
Analysis and Optimization of I/O Cache Coherency Strategies for SoC-FPGA Device. FPL 2019: 301-306 - [c185]Omer Anjum, Simon Garcia De Gonzalo, Mert Hidayetoglu
, Wen-Mei Hwu:
An Efficient GPU Implementation Technique for Higher-Order 3D Stencils. HPCC/SmartCity/DSS 2019: 552-561 - [c184]Mohammad Almasri, Omer Anjum, Carl Pearson, Zaid Qureshi, Vikram S. Mailthody
, Rakesh Nagi, Jinjun Xiong, Wen-Mei W. Hwu:
Update on k-truss Decomposition on GPU. HPEC 2019: 1-7 - [c183]Sitao Huang, Carl Pearson, Rakesh Nagi, Jinjun Xiong, Deming Chen, Wen-Mei W. Hwu:
Accelerating Sparse Deep Neural Networks on FPGAs. HPEC 2019: 1-7 - [c182]Carl Pearson, Mohammad Almasri, Omer Anjum, Vikram S. Mailthody
, Zaid Qureshi, Rakesh Nagi, Jinjun Xiong, Wen-Mei W. Hwu:
Update on Triangle Counting on GPU. HPEC 2019: 1-7 - [c181]Cong Hao, Yao Chen
, Xinheng Liu, Atif Sarwari, Daryl Sew, Ashutosh Dhar, Bryan Wu, Dongdong Fu, Jinjun Xiong, Wen-Mei Hwu, Junli Gu, Deming Chen:
NAIS: Neural Architecture and Implementation Search and its Applications in Autonomous Driving. ICCAD 2019: 1-8 - [c180]Bowen Cheng, Liang-Chieh Chen, Yunchao Wei, Yukun Zhu, Zilong Huang, Jinjun Xiong, Thomas S. Huang, Wen-Mei Hwu, Honghui Shi:
SPGNet: Semantic Prediction Guidance for Scene Parsing. ICCV 2019: 5217-5227 - [c179]Abdul Dakkak, Cheng Li, Jinjun Xiong, Isaac Gelado, Wen-Mei W. Hwu:
Accelerating reduction and scan using tensor core units. ICS 2019: 46-57 - [c178]Ashutosh Dhar, Sitao Huang, Jinjun Xiong, Damir A. Jamsek, Bruno Mesnet, Jian Huang, Nam Sung Kim, Wen-Mei W. Hwu, Deming Chen:
Near-Memory and In-Storage FPGA Acceleration for Emerging Cognitive Computing Workloads. ISVLSI 2019: 68-75 - [c177]Vikram Sharma Mailthody
, Zaid Qureshi, Weixin Liang, Ziyan Feng, Simon Garcia De Gonzalo, Youjie Li, Hubertus Franke, Jinjun Xiong, Jian Huang, Wen-Mei Hwu:
DeepStore: In-Storage Acceleration for Intelligent Queries. MICRO 2019: 224-238 - [c176]Hongyu Gong, Suma Bhat, Lingfei Wu, Jinjun Xiong, Wen-Mei W. Hwu:
Reinforcement Learning Based Text Style Transfer without Parallel Training Corpus. NAACL-HLT (1) 2019: 3168-3180 - [c175]Mert Hidayetoglu
, Tekin Biçer
, Simon Garcia De Gonzalo, Bin Ren, Doga Gürsoy
, Rajkumar Kettimuthu, Ian T. Foster, Wen-mei W. Hwu:
MemXCT: memory-centric X-ray CT reconstruction with massive parallelization. SC 2019: 85:1-85:56 - [c174]Cheng Li, Abdul Dakkak, Jinjun Xiong, Wen-Mei Hwu:
MLModelScope: Evaluate and Introspect Cognitive Pipelines. SERVICES 2019: 335-338 - [c173]Sitao Huang, Li-Wen Chang, Izzat El Hajj, Simon Garcia De Gonzalo, Juan Gómez-Luna, Sai Rahul Chalamalasetti, Mohamed El-Hadedy, Dejan S. Milojicic
, Onur Mutlu, Deming Chen, Wen-Mei W. Hwu:
Analysis and Modeling of Collaborative Execution Strategies for Heterogeneous CPU-FPGA Architectures. ICPE 2019: 79-90 - [c172]Carl Pearson, Abdul Dakkak, Sarah Hashash, Cheng Li, I-Hsin Chung, Jinjun Xiong, Wen-Mei Hwu:
Evaluating Characteristics of CUDA Communication Primitives on High-Bandwidth Interconnects. ICPE 2019: 209-218 - [i24]Aayush Ankit, Izzat El Hajj, Sai Rahul Chalamalasetti, Geoffrey Ndu, Martin Foltin, R. Stanley Williams, Paolo Faraboschi, Wen-Mei Hwu, John Paul Strachan, Kaushik Roy, Dejan S. Milojicic:
PUMA: A Programmable Ultra-efficient Memristor-based Accelerator for Machine Learning Inference. CoRR abs/1901.10351 (2019) - [i23]Hongyu Gong, Suma Bhat, Lingfei Wu, Jinjun Xiong, Wen-Mei Hwu:
Reinforcement Learning Based Text Style Transfer without Parallel Training Corpus. CoRR abs/1903.10671 (2019) - [i22]Cong Hao, Xiaofan Zhang, Yuhong Li, Sitao Huang, Jinjun Xiong, Kyle Rupnow, Wen-Mei Hwu, Deming Chen:
FPGA/DNN Co-Design: An Efficient Design Methodology for IoT Intelligence on the Edge. CoRR abs/1904.04421 (2019) - [i21]Cheng Li, Abdul Dakkak, Jinjun Xiong, Wen-Mei Hwu:
Challenges and Pitfalls of Reproducing Machine Learning Artifacts. CoRR abs/1904.12437 (2019) - [i20]Xiaofan Zhang, Cong Hao, Yuhong Li, Yao Chen
, Jinjun Xiong, Wen-Mei W. Hwu, Deming Chen:
A Bi-Directional Co-Design Approach to Enable Deep Learning on IoT Devices. CoRR abs/1905.08369 (2019) - [i19]Omer Anjum, Wen-Mei Hwu, Jinjun Xiong:
A Retrospective Recount of Computer Architecture Research with a Data-Driven Study of Over Four Decades of ISCA Publications. CoRR abs/1906.09380 (2019) - [i18]Xiaofan Zhang, Cong Hao, Haoming Lu, Jiachen Li, Yuhong Li, Yuchen Fan, Kyle Rupnow, Jinjun Xiong, Thomas S. Huang, Honghui Shi, Wen-Mei Hwu, Deming Chen:
SkyNet: A Champion Model for DAC-SDC on Low Power Object Detection. CoRR abs/1906.10327 (2019) - [i17]Seungwon Min, Sitao Huang, Mohamed El-Hadedy, Jinjun Xiong, Deming Chen, Wen-Mei Hwu:
Analysis and Optimization of I/O Cache Coherency Strategies for SoC-FPGA Device. CoRR abs/1908.01261 (2019) - [i16]Cheng Li, Abdul Dakkak, Jinjun Xiong, Wei Wei, Lingjie Xu, Wen-Mei Hwu:
Across-Stack Profiling and Characterization of Machine Learning Models on GPUs. CoRR abs/1908.06869 (2019) - [i15]