default search action
John D. Owens
Person information
- affiliation: University of California, Davis, US
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c101]John D. Owens, Bruce Hoppe:
Helping Faculty Teach Software Performance Engineering. IPDPS (Workshops) 2024: 338-341 - [i36]Toluwanimi O. Odemuyiwa, Joel S. Emer, John D. Owens:
The EDGE Language: Extended General Einsums for Graph Algorithms. CoRR abs/2404.11591 (2024) - [i35]Zhongyi Lin, Ning Sun, Pallab Bhattacharya, Xizhou Feng, Louis Feng, John D. Owens:
Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms. CoRR abs/2404.12674 (2024) - 2023
- [c100]Muhammad A. Awad, Saman Ashkiani, Serban D. Porumbescu, Martin Farach-Colton, John D. Owens:
Analyzing and Implementing GPU Hash Tables. APOCS 2023: 33-50 - [c99]Toluwanimi O. Odemuyiwa, Hadi Asghari Moghaddam, Michael Pellauer, Kartik Hegde, Po-An Tsai, Neal Clayton Crago, Aamer Jaleel, John D. Owens, Edgar Solomonik, Joel S. Emer, Christopher W. Fletcher:
Accelerating Sparse Data Orchestration via Dynamic Reflexive Tiling. ASPLOS (3) 2023: 18-32 - [c98]Toluwanimi O. Odemuyiwa, Hadi Asghari Moghaddam, Michael Pellauer, Kartik Hegde, Po-An Tsai, Neal Clayton Crago, Aamer Jaleel, John D. Owens, Edgar Solomonik, Joel S. Emer, Christopher W. Fletcher:
Accelerating Sparse Data Orchestration via Dynamic Reflexive Tiling (Extended Abstract). HOPC@SPAA 2023: 15-16 - [c97]Afton Geil, Serban D. Porumbescu, John D. Owens:
Maximum Clique Enumeration on the GPU. IPDPS Workshops 2023: 234-244 - [c96]Jonathan D. Wapman, Sean Treichler, Serban D. Porumbescu, John D. Owens:
Harmonic CUDA: Asynchronous Programming on GPUs. PMAM@PPoPP 2023: 39-49 - [c95]Muhammad Osama, Serban D. Porumbescu, John D. Owens:
A Programming Model for GPU Load Balancing. PPoPP 2023: 79-91 - [c94]Muhammad Osama, Duane Merrill, Cris Cecka, Michael Garland, John D. Owens:
Stream-K: Work-Centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU. PPoPP 2023: 429-431 - [i34]Muhammad Osama, Duane Merrill, Cris Cecka, Michael Garland, John D. Owens:
Stream-K: Work-centric Parallel Decomposition for Dense Matrix-Matrix Multiplication on the GPU. CoRR abs/2301.03598 (2023) - [i33]Muhammad Osama, Serban D. Porumbescu, John D. Owens:
A Programming Model for GPU Load Balancing. CoRR abs/2301.04792 (2023) - [i32]Matthew Drescher, Muhammad A. Awad, Serban D. Porumbescu, John D. Owens:
BOBA: A Parallel Lightweight Graph Reordering Algorithm with Heavyweight Implications. CoRR abs/2306.10410 (2023) - [i31]Cameron Shinn, Collin McCarthy, Saurav Muralidharan, Muhammad Osama, John D. Owens:
The Sparsity Roofline: Understanding the Hardware Limits of Sparse Neural Networks. CoRR abs/2310.00496 (2023) - 2022
- [j34]Kerry A. Seitz, Theresa Foley, Serban D. Porumbescu, John D. Owens:
Supporting Unified Shader Specialization by Co-opting C++ Features. Proc. ACM Comput. Graph. Interact. Tech. 5(3): 25:1-25:17 (2022) - [j33]Carl Yang, Aydin Buluç, John D. Owens:
GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU. ACM Trans. Math. Softw. 48(1): 1:1-1:51 (2022) - [c93]Muhammad A. Awad, Serban D. Porumbescu, John D. Owens:
A GPU Multiversion B-Tree. PACT 2022: 481-493 - [c92]Zhongyi Lin, Louis Feng, Ehsan K. Ardestani, Jaewon Lee, John Lundell, Changkyu Kim, Arun Kejariwal, John D. Owens:
Building a Performance Model for Deep Learning Recommendation Model Training on GPUs. HIPC 2022: 48-58 - [c91]Yuxin Chen, Benjamin Brock, Serban D. Porumbescu, Aydin Buluç, Katherine A. Yelick, John D. Owens:
Atos: A Task-Parallel GPU Scheduler for Graph Analytics. ICPP 2022: 50:1-50:11 - [c90]Muhammad Osama, Serban D. Porumbescu, John D. Owens:
Essentials of Parallel Graph Analytics. IPDPS Workshops 2022: 314-317 - [c89]Zhongyi Lin, Louis Feng, Ehsan K. Ardestani, Jaewon Lee, John Lundell, Changkyu Kim, Arun Kejariwal, John D. Owens:
Building a Performance Model for Deep Learning Recommendation Model Training on GPUs. ISPASS 2022: 227-229 - [c88]Yuxin Chen, Benjamin Brock, Serban D. Porumbescu, Aydin Buluç, Katherine A. Yelick, John D. Owens:
Scalable Irregular Parallelism with GPUs: Getting CPUs Out of the Way. SC 2022: 50:1-50:16 - [p3]Jeremy Kepner, Peter Aaltonen, David A. Bader, Aydin Buluç, Franz Franchetti, John Gilbert, Shana Hutchison, Manoj Kumar, Andrew Lumsdaine, Henning Meyerhenke, Scott McMillan, José E. Moreira, John D. Owens, Carl Yang, Marcin Zalewski, Timothy G. Mattson:
Introduction to GraphBLAS. Massive Graph Analytics 2022: 507-524 - [i30]Zhongyi Lin, Louis Feng, Ehsan K. Ardestani, Jaewon Lee, John Lundell, Changkyu Kim, Arun Kejariwal, John D. Owens:
Building a Performance Model for Deep Learning Recommendation Model Training on GPUs. CoRR abs/2201.07821 (2022) - [i29]Muhammad Osama, Serban D. Porumbescu, John D. Owens:
Essentials of Parallel Graph Analytics. CoRR abs/2212.08200 (2022) - 2021
- [j32]Ahmed H. Mahmoud, Serban D. Porumbescu, John D. Owens:
RXMesh: a GPU mesh data structure. ACM Trans. Graph. 40(4): 104:1-104:16 (2021) - [c87]Zhongyi Lin, Evangelos Georganas, John D. Owens:
Towards Flexible and Compiler-Friendly Layer Fusion for CNNs on Multicore CPUs. Euro-Par 2021: 232-248 - [i28]Muhammad A. Awad, Saman Ashkiani, Serban D. Porumbescu, Martin Farach-Colton, John D. Owens:
Better GPU Hash Tables. CoRR abs/2108.07232 (2021) - [i27]Kerry A. Seitz Jr., Theresa Foley, Serban D. Porumbescu, John D. Owens:
Unified Shader Programming in C++. CoRR abs/2109.14682 (2021) - [i26]Yuxin Chen, Benjamin Brock, Serban D. Porumbescu, Aydin Buluç, Katherine A. Yelick, John D. Owens:
Atos: A Task-Parallel GPU Dynamic Scheduling Framework for Dynamic Irregular Computations. CoRR abs/2112.00132 (2021) - 2020
- [j31]Ahmed Abdelkader, Chandrajit L. Bajaj, Mohamed S. Ebeida, Ahmed H. Mahmoud, Scott A. Mitchell, John D. Owens, Ahmad A. Rushdi:
VoroCrust: Voronoi Meshing Without Clipping. ACM Trans. Graph. 39(3): 23:1-23:16 (2020) - [c86]Muhammad A. Awad, Saman Ashkiani, Serban D. Porumbescu, John D. Owens:
Dynamic Graphs on the GPU. IPDPS 2020: 739-748 - [c85]Weitang Liu, Xiaoyun Wang, John D. Owens, Yixuan Li:
Energy-based Out-of-distribution Detection. NeurIPS 2020 - [i25]Leyuan Wang, John D. Owens:
Fast Gunrock Subgraph Matching (GSM) on GPUs. CoRR abs/2003.01527 (2020) - [i24]Weitang Liu, Xiaoyun Wang, John D. Owens, Yixuan Li:
Energy-based Out-of-distribution Detection. CoRR abs/2010.03759 (2020)
2010 – 2019
- 2019
- [j30]Zhongyi Lin, Matthew Yih, Jeffrey M. Ota, John D. Owens, Pinar Muyan-Özçelik:
Benchmarking Deep Learning Frameworks and Investigating FPGA Deployment for Traffic Sign Classification and Detection. IEEE Trans. Intell. Veh. 4(3): 385-395 (2019) - [j29]Kerry A. Seitz Jr., Theresa Foley, Serban D. Porumbescu, John D. Owens:
Staged metaprogramming for shader system development. ACM Trans. Graph. 38(6): 202:1-202:15 (2019) - [c84]Xiaoyun Wang, Zhongyi Lin, Carl Yang, John D. Owens:
Accelerating DNN Inference with GraphBLAS and the GPU. HPEC 2019: 1-6 - [c83]Leyuan Wang, John D. Owens:
Fast BFS-Based Triangle Counting on GPUs. HPEC 2019: 1-6 - [c82]Muhammad Osama, Minh Truong, Carl Yang, Aydin Buluç, John D. Owens:
Graph Coloring on the GPU. IPDPS Workshops 2019: 231-240 - [c81]Muhammad A. Awad, Saman Ashkiani, Rob Johnson, Martin Farach-Colton, John D. Owens:
Engineering a high-performance GPU B-Tree. PPoPP 2019: 145-157 - [c80]Benjamin A. Brock, Yuxin Chen, Jiakun Yan, John D. Owens, Aydin Buluç, Katherine A. Yelick:
RDMA vs. RPC for Implementing Distributed Data Structures. IA3@SC 2019: 17-22 - [i23]Ahmed Abdelkader, Chandrajit L. Bajaj, Mohamed S. Ebeida, Ahmed H. Mahmoud, Scott A. Mitchell, John D. Owens, Ahmad A. Rushdi:
VoroCrust: Voronoi Meshing Without Clipping. CoRR abs/1902.08767 (2019) - [i22]Carl Yang, Aydin Buluç, John D. Owens:
GraphBLAST: A High-Performance Linear Algebra-based Graph Framework on the GPU. CoRR abs/1908.01407 (2019) - [i21]Leyuan Wang, John D. Owens:
Fast BFS-Based Triangle Counting on GPUs. CoRR abs/1909.02127 (2019) - [i20]Benjamin Brock, Yuxin Chen, Jiakun Yan, John D. Owens, Aydin Buluç, Katherine A. Yelick:
RDMA vs. RPC for Implementing Distributed Data Structures. CoRR abs/1910.02158 (2019) - [i19]Weitang Liu, Lifeng Wei, James Sharpnack, John D. Owens:
Unsupervised Object Segmentation with Explicit Localization Module. CoRR abs/1911.09228 (2019) - 2018
- [j28]John D. Owens:
Technical perspective: Graphs, betweenness centrality, and the GPU. Commun. ACM 61(8): 84 (2018) - [c79]Ahmed Abdelkader, Chandrajit L. Bajaj, Mohamed S. Ebeida, Ahmed H. Mahmoud, Scott A. Mitchell, John D. Owens, Ahmad A. Rushdi:
Sampling Conditions for Conforming Voronoi Meshing by the VoroCrust Algorithm. SoCG 2018: 1:1-1:16 - [c78]Ahmed Abdelkader, Chandrajit L. Bajaj, Mohamed S. Ebeida, Ahmed H. Mahmoud, Scott A. Mitchell, John D. Owens, Ahmad A. Rushdi:
VoroCrust Illustrated: Theory and Challenges (Multimedia Exposition). SoCG 2018: 77:1-77:4 - [c77]Carl Yang, Aydin Buluç, John D. Owens:
Design Principles for Sparse Matrix Multiplication on the GPU. Euro-Par 2018: 672-687 - [c76]Carl Yang, Aydin Buluç, John D. Owens:
Implementing Push-Pull Efficiently in GraphBLAS. ICPP 2018: 89:1-89:11 - [c75]Saman Ashkiani, Martin Farach-Colton, John D. Owens:
A Dynamic Hash Table for the GPU. IPDPS 2018: 419-429 - [c74]Saman Ashkiani, Shengren Li, Martin Farach-Colton, Nina Amenta, John D. Owens:
GPU LSM: A Dynamic Dictionary Data Structure for the GPU. IPDPS 2018: 430-440 - [c73]Afton Geil, Martin Farach-Colton, John D. Owens:
Quotient Filters: Approximate Membership Queries on the GPU. IPDPS 2018: 451-462 - [c72]Yuechao Pan, Roger Pearce, John D. Owens:
Scalable Breadth-First Search on a GPU Cluster. IPDPS 2018: 1090-1101 - [c71]Matthew Yih, Jeffrey M. Ota, John D. Owens, Pinar Muyan-Özçelik:
FPGA versus GPU for Speed-Limit-Sign Recognition. ITSC 2018: 843-850 - [c70]Zhongyi Lin, Jeffrey M. Ota, John D. Owens, Pinar Muyan-Özçelik:
Benchmarking Deep Learning Frameworks with FPGA-suitable Models on a Traffic Sign Dataset. Intelligent Vehicles Symposium 2018: 1197-1203 - [i18]Yuechao Pan, Roger Pearce, John D. Owens:
Scalable Breadth-First Search on a GPU Cluster. CoRR abs/1803.03922 (2018) - [i17]Ahmed Abdelkader, Chandrajit L. Bajaj, Mohamed S. Ebeida, Ahmed H. Mahmoud, Scott A. Mitchell, John D. Owens, Ahmad A. Rushdi:
Sampling Conditions for Conforming Voronoi Meshing by the VoroCrust Algorithm. CoRR abs/1803.06078 (2018) - [i16]Carl Yang, Aydin Buluç, John D. Owens:
Design Principles for Sparse Matrix Multiplication on the GPU. CoRR abs/1803.08601 (2018) - [i15]Carl Yang, Aydin Buluç, John D. Owens:
Implementing Push-Pull Efficiently in GraphBLAS. CoRR abs/1804.03327 (2018) - [i14]Leyuan Wang, Yangzihao Wang, Carl Yang, John D. Owens:
A Comparative Study on Exact Triangle Counting Algorithms on the GPU. CoRR abs/1804.06926 (2018) - [i13]Weitang Liu, Emad Barsoum, John D. Owens:
Object Localization and Motion Transfer learning with Capsules. CoRR abs/1805.07706 (2018) - 2017
- [j27]Ahmed Abdelkader, Ahmed H. Mahmoud, Ahmad A. Rushdi, Scott A. Mitchell, John D. Owens, Mohamed S. Ebeida:
A Constrained Resampling Strategy for Mesh Improvement. Comput. Graph. Forum 36(5): 189-201 (2017) - [j26]Pinar Muyan-Özçelik, John D. Owens:
Methods for multitasking among real-time embedded compute tasks running on the GPU. Concurr. Comput. Pract. Exp. 29(15) (2017) - [j25]Saman Ashkiani, Andrew A. Davidson, Ulrich Meyer, John D. Owens:
GPU Multisplit: An Extended Study of a Parallel Algorithm. ACM Trans. Parallel Comput. 4(1): 2:1-2:44 (2017) - [j24]Yangzihao Wang, Yuechao Pan, Andrew A. Davidson, Yuduo Wu, Carl Yang, Leyuan Wang, Muhammad Osama, Chenshan Yuan, Weitang Liu, Andy T. Riffel, John D. Owens:
Gunrock: GPU Graph Analytics. ACM Trans. Parallel Comput. 4(1): 3:1-3:49 (2017) - [c69]Yuechao Pan, Yangzihao Wang, Yuduo Wu, Carl Yang, John D. Owens:
Multi-GPU Graph Analytics. IPDPS 2017: 479-490 - [c68]Yangzihao Wang, Sean Baxter, John D. Owens:
Mini-Gunrock: A Lightweight Graph Analytics Framework on the GPU. IPDPS Workshops 2017: 616-626 - [i12]Yangzihao Wang, Yuechao Pan, Andrew A. Davidson, Yuduo Wu, Carl Yang, Leyuan Wang, Muhammad Osama, Chenshan Yuan, Weitang Liu, Andy T. Riffel, John D. Owens:
Gunrock: GPU Graph Analytics. CoRR abs/1701.01170 (2017) - [i11]Saman Ashkiani, Andrew A. Davidson, Ulrich Meyer, John D. Owens:
GPU Multisplit. CoRR abs/1701.01189 (2017) - [i10]Saman Ashkiani, Shengren Li, Martin Farach-Colton, Nina Amenta, John D. Owens:
GPU LSM: A Dynamic Dictionary Data Structure for the GPU. CoRR abs/1707.05354 (2017) - [i9]Saman Ashkiani, Martin Farach-Colton, John D. Owens:
A Dynamic Hash Table for the GPU. CoRR abs/1710.11246 (2017) - 2016
- [j23]Mohamed S. Ebeida, Ahmad A. Rushdi, Muhammad A. Awad, Ahmed H. Mahmoud, Dong-Ming Yan, Shawn A. English, John D. Owens, Chandrajit L. Bajaj, Scott A. Mitchell:
Disk Density Tuning of a Maximal Random Packing. Comput. Graph. Forum 35(5): 259-269 (2016) - [j22]Leyuan Wang, Sean Baxter, John D. Owens:
Fast parallel skew and prefix-doubling suffix array construction on the GPU. Concurr. Comput. Pract. Exp. 28(12): 3466-3484 (2016) - [j21]Jonathan Y. Kemal, Roger L. Davis, John D. Owens:
Multidisciplinary simulation acceleration using multiple shared memory graphical processing units. Int. J. High Perform. Comput. Appl. 30(4): 486-508 (2016) - [c67]Ross K. Gegan, Vishal Ahuja, John D. Owens, Dipak Ghosal:
Real-time GPU-based timing channel detection using entropy. CNS 2016: 296-305 - [c66]Leyuan Wang, Yangzihao Wang, Carl Yang, John D. Owens:
A Comparative Study on Exact Triangle Counting Algorithms on the GPU. HPGP@HPDC 2016: 1-8 - [c65]Jeremy Kepner, Peter Aaltonen, David A. Bader, Aydin Buluç, Franz Franchetti, John R. Gilbert, Dylan Hutchison, Manoj Kumar, Andrew Lumsdaine, Henning Meyerhenke, Scott McMillan, Carl Yang, John D. Owens, Marcin Zalewski, Timothy G. Mattson, José E. Moreira:
Mathematical foundations of the GraphBLAS. HPEC 2016: 1-9 - [c64]Yangzihao Wang, Andrew A. Davidson, Yuechao Pan, Yuduo Wu, Andy Riffel, John D. Owens:
Gunrock: a high-performance graph processing library on the GPU. PPoPP 2016: 11:1-11:12 - [c63]Saman Ashkiani, Andrew A. Davidson, Ulrich Meyer, John D. Owens:
GPU multisplit. PPoPP 2016: 12:1-12:13 - [c62]Pinar Muyan-Özçelik, John D. Owens:
Multitasking Real-time Embedded GPU Computing Tasks. PMAM@PPoPP 2016: 78-87 - [c61]Saman Ashkiani, Nina Amenta, John D. Owens:
Parallel Approaches to the String Matching Problem on the GPU. SPAA 2016: 275-285 - [i8]Jeremy Kepner, Peter Aaltonen, David A. Bader, Aydin Buluç, Franz Franchetti, John R. Gilbert, Dylan Hutchison, Manoj Kumar, Andrew Lumsdaine, Henning Meyerhenke, Scott McMillan, José E. Moreira, John D. Owens, Carl Yang, Marcin Zalewski, Timothy G. Mattson:
Mathematical Foundations of the GraphBLAS. CoRR abs/1606.05790 (2016) - 2015
- [j20]Anjul Patney, Stanley Tzeng, Kerry A. Seitz Jr., John D. Owens:
Piko: a framework for authoring programmable graphics pipelines. ACM Trans. Graph. 34(4): 147:1-147:13 (2015) - [c60]Mikhail M. Shashkov, Jason Mak, Shawn Recker, Connie S. Nguyen, John D. Owens, Kenneth I. Joy:
Efficient dense reconstruction using geometry and image consistency constraints. AIPR 2015: 1-7 - [c59]Leyuan Wang, Sean Baxter, John D. Owens:
Fast Parallel Suffix Array on the GPU. Euro-Par 2015: 573-587 - [c58]Yuduo Wu, Yangzihao Wang, Yuechao Pan, Carl Yang, John D. Owens:
Performance Characterization of High-Level Programming Models for GPU Graph Analytics. IISWC 2015: 66-75 - [c57]Carl Yang, Yangzihao Wang, John D. Owens:
Fast Sparse Matrix and Sparse Vector Multiplication Algorithm on the GPU. IPDPS Workshops 2015: 841-847 - [c56]Yangzihao Wang, Andrew A. Davidson, Yuechao Pan, Yuduo Wu, Andy Riffel, John D. Owens:
Gunrock: a high-performance graph processing library on the GPU. PPoPP 2015: 265-266 - [c55]Thomas Weber, Michael Wimmer, John D. Owens:
Parallel Reyes-style adaptive subdivision with bounded memory usage. I3D 2015: 39-45 - [p2]Mohamed S. Ebeida, Scott A. Mitchell, Anjul Patney, Andrew A. Davidson, Stanley Tzeng, Muhammad A. Awad, Ahmed H. Mahmoud, John D. Owens:
Exercises in High-Dimensional Sampling: Maximal Poisson-Disk Sampling and k-d Darts. Topological and Statistical Methods for Complex Data, Tackling Large-Scale, High-Dimensional, and Multivariate Data Spaces 2015: 221-238 - [i7]Yangzihao Wang, Andrew A. Davidson, Yuechao Pan, Yuduo Wu, Andy Riffel, John D. Owens:
Gunrock: A High-Performance Graph Processing Library on the GPU. CoRR abs/1501.05387 (2015) - [i6]Yuechao Pan, Yangzihao Wang, Yuduo Wu, Carl Yang, John D. Owens:
Multi-GPU Graph Analytics. CoRR abs/1504.04804 (2015) - 2014
- [j19]Mohamed S. Ebeida, Anjul Patney, Scott A. Mitchell, Keith R. Dalbey, Andrew A. Davidson, John D. Owens:
k-d Darts: Sampling by k-dimensional flat searches. ACM Trans. Graph. 33(1): 3:1-3:16 (2014) - [c54]Jason Mak, Mauricio Hess-Flores, Shawn Recker, John D. Owens, Kenneth I. Joy:
A Comparative Study of GPU-Accelerated Multi-view Sequential Reconstruction Triangulation Methods for Large-Scale Scenes. ACCV Workshops (1) 2014: 254-269 - [c53]Afton Geil, Yangzihao Wang, John D. Owens:
WTF, GPU! computing twitter's who-to-follow on the GPU. COSN 2014: 63-68 - [c52]Andrew A. Davidson, Sean Baxter, Michael Garland, John D. Owens:
Work-Efficient Parallel GPU Methods for Single-Source Shortest Paths. IPDPS 2014: 349-359 - [c51]Jason Mak, Mauricio Hess-Flores, Shawn Recker, John D. Owens, Kenneth I. Joy:
GPU-accelerated and efficient multi-view triangulation for scene reconstruction. WACV 2014: 61-68 - [i5]Anjul Patney, Stanley Tzeng, Kerry A. Seitz Jr., John D. Owens:
Piko: A Design Framework for Programmable Graphics Pipelines. CoRR abs/1404.6293 (2014) - 2013
- [j18]Mohamed S. Ebeida, Ahmed H. Mahmoud, Muhammad A. Awad, Mohammed A. Mohammed, Scott A. Mitchell, Alexander Rand, John D. Owens:
Sifted Disks. Comput. Graph. Forum 32(2): 509-518 (2013) - [i4]Mohamed S. Ebeida, Anjul Patney, Scott A. Mitchell, Keith R. Dalbey, Andrew A. Davidson, John D. Owens:
k-d Darts: Sampling by k-Dimensional Flat Searches. CoRR abs/1302.3917 (2013) - [i3]Kerry A. Seitz Jr., Alex Kennedy, Owen Ransom, Bassam A. Younis, John D. Owens:
A GPU Implementation for Two-Dimensional Shallow Water Modeling. CoRR abs/1309.1230 (2013) - 2012
- [j17]Mohamed S. Ebeida, Scott A. Mitchell, Anjul Patney, Andrew A. Davidson, John D. Owens:
A Simple Algorithm for Maximal Poisson-Disk Sampling in High Dimensions. Comput. Graph. Forum 31(2pt4): 785-794 (2012) - [j16]Stanley Tzeng, Brandon Lloyd, John D. Owens:
A GPU Task-Parallel Model with Dependency Resolution. Computer 45(8): 34-41 (2012) - [c50]Stanley Tzeng, Anjul Patney, Andrew A. Davidson, Mohamed S. Ebeida, Scott A. Mitchell, John D. Owens:
High-Quality Parallel Depth-of-Field Using Line Samples. High Performance Graphics 2012: 23-31 - [c49]Shengren Li, Lance Simons, Jagadeesh Bhaskar Pakaravoor, Fatemeh Abbasinejad, John D. Owens, Nina Amenta:
kANN on the GPU with Shifted Sorting. High Performance Graphics 2012: 39-47 - [c48]Yao Zhang, John Recker, Robert Ulichney, Ingeborg Tastl, John D. Owens:
Plane-dependent error diffusion on a GPU. Image Processing: Algorithms and Systems / Parallel Processing for Imaging Applications 2012: 829515 - [i2]Stanley Tzeng, John D. Owens:
Finding Convex Hulls Using Quickhull on the GPU. CoRR abs/1201.2936 (2012) - 2011
- [j15]Mohamed S. Ebeida, Scott A. Mitchell, Andrew A. Davidson, Anjul Patney, Patrick M. Knupp, John D. Owens:
Efficient and good Delaunay meshes from random points. Comput. Aided Des. 43(11): 1506-1515 (2011) - [j14]Everett H. Phillips, Yao Zhang, Roger L. Davis, John D. Owens:
Acceleration of 2-D Compressible Flow Solvers with Graphics Processing Unit Clusters. J. Aerosp. Comput. Inf. Commun. 8(8): 237-249 (2011) - [j13]Mohamed S. Ebeida, Andrew A. Davidson, Anjul Patney, Patrick M. Knupp, Scott A. Mitchell, John D. Owens:
Efficient maximal poisson-disk sampling. ACM Trans. Graph. 30(4): 49 (2011) - [c47]Andrew A. Davidson, John D. Owens:
Register packing for cyclic reduction: a case study. GPGPU 2011: 4 - [c46]John Jenkins, Isha Arkatkar, John D. Owens, Alok N. Choudhary, Nagiza F. Samatova:
Lessons Learned from Exploring the Backtracking Paradigm on the GPU. Euro-Par (2) 2011: 425-437 - [c45]Kshitij Gupta, John D. Owens:
Compute & memory optimizations for high-quality speech recognition on low-end GPU processors. HiPC 2011: 1-10 - [c44]Yao Zhang, John D. Owens:
A quantitative performance analysis model for GPU architectures. HPCA 2011: 382-393 - [c43]Andrew A. Davidson, Yao Zhang, John D. Owens:
An Auto-tuned Method for Solving Large Tridiagonal Systems on the GPU. IPDPS 2011: 956-965 - [c42]Jeff A. Stuart, John D. Owens:
Multi-GPU MapReduce on GPU Clusters. IPDPS 2011: 1068-1079 - [c41]Vladimir Glavtchev, Pinar Muyan-Özçelik, Jeffrey M. Ota, John D. Owens:
Feature-based speed limit sign detection using a graphics processing unit. Intelligent Vehicles Symposium 2011: 195-200 - [c40]Yao Zhang, John Recker, Robert Ulichney, Giordano B. Beretta, Ingeborg Tastl, I-Jong Lin, John D. Owens:
A parallel error diffusion implementation on a GPU. Parallel Processing for Imaging Applications 2011: 78720K - [c39]Stanley Tzeng, Anjul Patney, John D. Owens:
Efficient adaptive tiling for programmable rendering. SI3D 2011: 201 - [e2]John D. Owens, I-Jong Lin, Yu-Jin Zhang, Giordano B. Beretta:
Proceedings of the Conference on Parallel Processing for Imaging Applications 2011, San Francisco Airport, CA, USA, January 24-25, 2011. SPIE Proceedings 7872, SPIE 2011, ISBN 978-0-8194-8409-3 [contents] - [i1]Jeff A. Stuart, John D. Owens:
Efficient Synchronization Primitives for GPUs. CoRR abs/1110.4623 (2011) - 2010
- [j12]Anjul Patney, Stanley Tzeng, John D. Owens:
Fragment-Parallel Composite and Filter. Comput. Graph. Forum 29(4): 1251-1258 (2010) - [c38]Pinar Muyan-Özçelik, Vladimir Glavtchev, Jeffrey M. Ota, John D. Owens:
A Template-Based Approach for Real-Time Speed-Limit-Sign Recognition on an Embedded System Using GPU Computing. DAGM-Symposium 2010: 162-171 - [c37]Stanley Tzeng, Anjul Patney, John D. Owens:
Task management for irregular-parallel workloads on the GPU. High Performance Graphics 2010: 29-37 - [c36]Jeff A. Stuart, Michael Cox, John D. Owens:
GPU-to-CPU Callbacks. Euro-Par Workshops 2010: 365-372 - [c35]Jeff A. Stuart, Cheng-Kai Chen, Kwan-Liu Ma, John D. Owens:
Multi-GPU volume rendering using MapReduce. HPDC 2010: 841-848 - [c34]Andrew A. Davidson, John D. Owens:
Toward Techniques for Auto-tuning GPU Algorithms. PARA (2) 2010: 110-119 - [c33]Yao Zhang, Jonathan Cohen, John D. Owens:
Fast tridiagonal solvers on the GPU. PPoPP 2010: 127-136 - [p1]Shubhabrata Sengupta, Mark J. Harris, Michael Garland, John D. Owens:
Efficient Parallel Scan Algorithms for Manycore GPUs. Scientific Computing with Multicore and Accelerators 2010: 413-442
2000 – 2009
- 2009
- [j11]Brian Budge, Tony Bernardin, Jeff A. Stuart, Shubhabrata Sengupta, Kenneth I. Joy, John D. Owens:
Out-of-core Data Management for Path Tracing on Hybrid Resources. Comput. Graph. Forum 28(2): 385-396 (2009) - [j10]Dan A. Alcantara, Andrei Sharf, Fatemeh Abbasinejad, Shubhabrata Sengupta, Michael Mitzenmacher, John D. Owens, Nina Amenta:
Real-time parallel hashing on the GPU. ACM Trans. Graph. 28(5): 154 (2009) - [c32]Kshitij Gupta, John D. Owens:
Three-layer optimizations for fast GMM computations on GPU-like parallel processors. ASRU 2009: 146-151 - [c31]Anjul Patney, Mohamed S. Ebeida, John D. Owens:
Parallel view-dependent tessellation of Catmull-Clark subdivision surfaces. High Performance Graphics 2009: 99-108 - [c30]John D. Owens:
HCW 2009 keynote talk: GPU computing: Heterogeneous computing for future systems. IPDPS 2009: 1 - [c29]Jeff A. Stuart, John D. Owens:
Message passing on data-parallel architectures. IPDPS 2009: 1-12 - [c28]Luke J. Gosink, Kesheng Wu, E. Wes Bethel, John D. Owens, Kenneth I. Joy:
Data Parallel Bin-Based Indexing for Answering Queries on Multi-core Architectures. SSDBM 2009: 110-129 - 2008
- [j9]Adam Moerschell, John D. Owens:
Distributed Texture Memory in a Multi-GPU Environment. Comput. Graph. Forum 27(1): 130-151 (2008) - [j8]John D. Owens, Mike Houston, David Luebke, Simon Green, John E. Stone, James C. Phillips:
GPU Computing. Proc. IEEE 96(5): 879-899 (2008) - [j7]Anjul Patney, John D. Owens:
Real-time Reyes-style adaptive surface subdivision. ACM Trans. Graph. 27(5): 143 (2008) - [c27]Pinar Muyan-Özçelik, John D. Owens, Junyi Xia, Sanjiv S. Samant:
Fast Deformable Registration on the GPU: A CUDA Implementation of Demons. ICCSA Workshops 2008: 223-233 - [c26]Mark Silberstein, Assaf Schuster, Dan Geiger, Anjul Patney, John D. Owens:
Efficient computation of sum-products on GPUs through software-managed cache. ICS 2008: 309-318 - [c25]Aaron E. Lefohn, Mike Houston, Chas Boyd, Kayvon Fatahalian, Tom Forsyth, David Luebke, John D. Owens:
Beyond programmable shading: fundamentals. SIGGRAPH Classes 2008: 9:1-9:21 - [c24]John D. Owens:
Parallel programming models overview. SIGGRAPH Classes 2008: 13:1-13:15 - [e1]David P. Luebke, John D. Owens:
Proceedings of the EUROGRAPHICS/ACM SIGGRAPH Conference on Graphics Hardware 2008, Sarajevo, Bosnia and Herzegovina, 2008. Eurographics Association 2008, ISBN 978-3-905674-09-5 [contents] - 2007
- [j6]John D. Owens, William J. Dally, Ron Ho, Doddaballapur Narasimha-Murthy Jayasimha, Stephen W. Keckler, Li-Shiuan Peh:
Research Challenges for On-Chip Interconnection Networks. IEEE Micro 27(5): 96-108 (2007) - [j5]Aaron E. Lefohn, Shubhabrata Sengupta, John D. Owens:
Resolution-matched shadow maps. ACM Trans. Graph. 26(4): 20 (2007) - [c23]Shubhabrata Sengupta, Mark J. Harris, Yao Zhang, John D. Owens:
Scan primitives for GPU computing. Graphics Hardware 2007: 97-106 - [c22]John D. Owens:
GPU architecture overview. SIGGRAPH Courses 2007: 2 - [c21]John D. Owens:
Data-parallel algorithms and data structures. SIGGRAPH Courses 2007: 3 - 2006
- [j4]Aaron E. Lefohn, Shubhabrata Sengupta, Joe Kniss, Robert Strzodka, John D. Owens:
Glift: Generic, efficient, random-access GPU data structures. ACM Trans. Graph. 25(1): 60-99 (2006) - [j3]Sung W. Park, Lars Linsen, Oliver Kreylos, John D. Owens, Bernd Hamann:
Discrete Sibson Interpolation. IEEE Trans. Vis. Comput. Graph. 12(2): 243-253 (2006) - [c20]Leo Szumel, John D. Owens:
The Virtual Pheromone Communication Primitive. DCOSS 2006: 135-149 - [c19]Adam Moerschell, John D. Owens:
Distributed Texture Memory in a Multi-GPU Environment. Graphics Hardware 2006: 31-38 - [c18]David P. Luebke, Mark J. Harris, Naga K. Govindaraju, Aaron E. Lefohn, Mike Houston, John D. Owens, Mark Segal, Matthew Papakipos, Ian Buck:
S07 - GPGPU: general-purpose computation on graphics hardware. SC 2006: 208 - 2005
- [c17]John D. Owens, David Luebke, Naga K. Govindaraju, Mark J. Harris, Jens H. Krüger, Aaron E. Lefohn, Timothy J. Purcell:
A Survey of General-Purpose Computation on Graphics Hardware. Eurographics (State of the Art Reports) 2005: 21-51 - [c16]John D. Owens:
Streaming architectures and technology trends. SIGGRAPH Courses 2005: 9 - [c15]Aaron E. Lefohn, Shubhabrata Sengupta, Joe Kniss, Robert Strzodka, John D. Owens:
Dynamic adaptive shadow maps on graphics hardware. SIGGRAPH Sketches 2005: 13 - [c14]Joe Kniss, Aaron E. Lefohn, Robert Strzodka, Shubhabrata Sengupta, John D. Owens:
Octree textures on graphics hardware. SIGGRAPH Sketches 2005: 16 - [c13]Aaron E. Lefohn, Ian Buck, Patrick S. McCormick, John D. Owens, Timothy J. Purcell, Robert Strzodka:
General Purpose Computation on Graphics Hardware. IEEE Visualization 2005: 121 - 2004
- [c12]Andrew Riffel, Aaron E. Lefohn, Kiril Vidimce, Mark Leone, John D. Owens:
Mio: fast multipass partitioning via priority-based instruction scheduling. Graphics Hardware 2004: 35-44 - 2003
- [j2]Ujval J. Kapasi, Scott Rixner, William J. Dally, Brucek Khailany, Jung Ho Ahn, Peter R. Mattson, John D. Owens:
Programmable Stream Processors. Computer 36(8): 54-62 (2003) - [c11]Brucek Khailany, William J. Dally, Scott Rixner, Ujval J. Kapasi, John D. Owens, Brian Towles:
Exploring the VLSI Scalability of Stream Processors. HPCA 2003: 153-164 - 2002
- [c10]John D. Owens, Brucek Khailany, Brian Towles, William J. Dally:
Comparing Reyes and OpenGL on a Stream Architecture. Graphics Hardware 2002: 47-56 - [c9]Ujval J. Kapasi, William J. Dally, Scott Rixner, John D. Owens, Brucek Khailany:
The Imagine Stream Processor. ICCD 2002: 282-288 - [c8]John D. Owens, Scott Rixner, Ujval J. Kapasi, Peter R. Mattson, Brian Towles, Ben Serebrin, William J. Dally:
Media Processing Applications on the Imagine Stream Processor. ICCD 2002: 295-302 - [c7]Ben Serebrin, John D. Owens, Chen H. Chen, Stephen P. Crago, Ujval J. Kapasi, Peter R. Mattson, Jinyung Namkoong, Scott Rixner, William J. Dally:
A Stream Processor Development Platform. ICCD 2002: 303- - 2001
- [j1]Brucek Khailany, William J. Dally, Ujval J. Kapasi, Peter R. Mattson, Jinyung Namkoong, John D. Owens, Brian Towles, Andrew Chang, Scott Rixner:
Imagine: Media Processing with Streams. IEEE Micro 21(2): 35-46 (2001) - 2000
- [c6]Peter R. Mattson, William J. Dally, Scott Rixner, Ujval J. Kapasi, John D. Owens:
Communication Scheduling. ASPLOS 2000: 82-92 - [c5]John D. Owens, William J. Dally, Ujval J. Kapasi, Scott Rixner, Peter R. Mattson, Ben Mowery:
Polygon Rendering on a Stream Architecture. Workshop on Graphics Hardware 2000: 23-32 - [c4]Scott Rixner, William J. Dally, Brucek Khailany, Peter R. Mattson, Ujval J. Kapasi, John D. Owens:
Register Organization for Media Processing. HPCA 2000: 375-386 - [c3]Scott Rixner, William J. Dally, Ujval J. Kapasi, Peter R. Mattson, John D. Owens:
Memory access scheduling. ISCA 2000: 128-138 - [c2]Ujval J. Kapasi, William J. Dally, Scott Rixner, Peter R. Mattson, John D. Owens, Brucek Khailany:
Efficient conditional operations for data-parallel architectures. MICRO 2000: 159-170
1990 – 1999
- 1998
- [c1]Scott Rixner, William J. Dally, Ujval J. Kapasi, Brucek Khailany, Abelardo López-Lagunas, Peter R. Mattson, John D. Owens:
A Bandwidth-efficient Architecture for Media Processing. MICRO 1998: 3-13
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-24 21:32 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint