default search action
Pradeep Dubey
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [c58]Nicolas P. D. Sawaya, Daniel Marti-Dafcik, Yang Ho, Daniel P. Tabor, David Esteban Bernal Neira, Alicia B. Magann, Shavindra P. Premaratne, Pradeep Dubey, Anne Y. Matsuura, Nathan Bishop, Wibe A. de Jong, Simon Benjamin, Ojas D. Parekh, Norm M. Tubman, Katherine Klymko, Daan Camps:
HamLib: A Library of Hamiltonians for Benchmarking Quantum Algorithms and Hardware. QCE 2023: 389-390 - [i25]Abhisek Kundu, Naveen K. Mellempudi, Dharma Teja Vooturi, Bharat Kaul, Pradeep Dubey:
AUTOSPARSE: Towards Automated Sparse Training of Deep Neural Networks. CoRR abs/2304.06941 (2023) - [i24]Bita Darvish Rouhani, Ritchie Zhao, Ankit More, Mathew Hall, Alireza Khodamoradi, Summer Deng, Dhruv Choudhary, Marius Cornea, Eric Dellinger, Kristof Denolf, Dusan Stosic, Venmugil Elango, Maximilian Golub, Alexander Heinecke, Phil James-Roxby, Dharmesh Jani, Gaurav Kolhe, Martin Langhammer, Ada Li, Levi Melnick, Maral Mesmakhosroshahi, Andres Rodriguez, Michael Schulte, Rasoul Shafipour, Lei Shao, Michael Y. Siu, Pradeep Dubey, Paulius Micikevicius, Maxim Naumov, Colin Verilli, Ralph Wittig, Doug Burger, Eric S. Chung:
Microscaling Data Formats for Deep Learning. CoRR abs/2310.10537 (2023) - 2022
- [i23]Paulius Micikevicius, Dusan Stosic, Neil Burgess, Marius Cornea, Pradeep Dubey, Richard Grisenthwaite, Sangwon Ha, Alexander Heinecke, Patrick Judd, John Kamalu, Naveen Mellempudi, Stuart F. Oberman, Mohammad Shoeybi, Michael Y. Siu, Hao Wu:
FP8 Formats for Deep Learning. CoRR abs/2209.05433 (2022) - 2020
- [c57]Yi-Hsiang Lai, Hongbo Rong, Size Zheng, Weihao Zhang, Xiuping Cui, Yunshan Jia, Jie Wang, Brendan Sullivan, Zhiru Zhang, Yun Liang, Youhui Zhang, Jason Cong, Nithin George, Jose Alvarez, Christopher J. Hughes, Pradeep Dubey:
SuSy: A Programming Model for Productive Construction of High-Performance Systolic Arrays on FPGAs. ICCAD 2020: 73:1-73:9 - [i22]Fangke Ye, Shengtian Zhou, Anand Venkat, Ryan Marcus, Paul Petersen, Jesmin Jahan Tithi, Tim Mattson, Tim Kraska, Pradeep Dubey, Vivek Sarkar, Justin Gottschlich:
Context-Aware Parse Trees. CoRR abs/2003.11118 (2020) - [i21]Fangke Ye, Shengtian Zhou, Anand Venkat, Ryan Marcus, Nesime Tatbul, Jesmin Jahan Tithi, Paul Petersen, Timothy G. Mattson, Tim Kraska, Pradeep Dubey, Vivek Sarkar, Justin Gottschlich:
MISIM: An End-to-End Neural Code Similarity System. CoRR abs/2006.05265 (2020) - [i20]Hongbo Rong, Xiaochen Hao, Yun Liang, Lidong Xu, Hong H. Jiang, Pradeep Dubey:
Systolic Computing on GPUs for Productive Performance. CoRR abs/2010.15884 (2020)
2010 – 2019
- 2019
- [j42]Shihao Ji, Nadathur Satish, Sheng Li, Pradeep Dubey:
Parallelizing Word2Vec in Shared and Distributed Memory. IEEE Trans. Parallel Distributed Syst. 30(9): 2090-2100 (2019) - [c56]Nitish Kumar Srivastava, Hongbo Rong, Prithayan Barua, Guanyu Feng, Huanqi Cao, Zhiru Zhang, David H. Albonesi, Vivek Sarkar, Wenguang Chen, Paul Petersen, Geoff Lowney, Adam Herr, Christopher J. Hughes, Timothy G. Mattson, Pradeep Dubey:
T2S-Tensor: Productively Generating High-Performance Spatial Hardware for Dense Tensor Computations. FCCM 2019: 181-189 - [i19]Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Eric S. Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros G. Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim M. Hazelwood, Furong Huang, Martin Jaggi, Kevin G. Jamieson, Michael I. Jordan, Gauri Joshi, Rania Khalaf, Jason Knight, Jakub Konecný, Tim Kraska, Arun Kumar, Anastasios Kyrillidis, Jing Li, Samuel Madden, H. Brendan McMahan, Erik Meijer, Ioannis Mitliagkas, Rajat Monga, Derek Gordon Murray, Dimitris S. Papailiopoulos, Gennady Pekhimenko, Theodoros Rekatsinas, Afshin Rostamizadeh, Christopher Ré, Christopher De Sa, Hanie Sedghi, Siddhartha Sen, Virginia Smith, Alex Smola, Dawn Song, Evan Randall Sparks, Ion Stoica, Vivienne Sze, Madeleine Udell, Joaquin Vanschoren, Shivaram Venkataraman, Rashmi Vinayak, Markus Weimer, Andrew Gordon Wilson, Eric P. Xing, Matei Zaharia, Ce Zhang, Ameet Talwalkar:
SysML: The New Frontier of Machine Learning Systems. CoRR abs/1904.03257 (2019) - [i18]Dhiraj D. Kalamkar, Dheevatsa Mudigere, Naveen Mellempudi, Dipankar Das, Kunal Banerjee, Sasikanth Avancha, Dharma Teja Vooturi, Nataraj Jammalamadaka, Jianyu Huang, Hector Yuen, Jiyan Yang, Jongsoo Park, Alexander Heinecke, Evangelos Georganas, Sudarshan Srinivasan, Abhisek Kundu, Misha Smelyanskiy, Bharat Kaul, Pradeep Dubey:
A Study of BFLOAT16 for Deep Learning Training. CoRR abs/1905.12322 (2019) - [i17]Abhisek Kundu, Sudarshan Srinivasan, Eric C. Qin, Dhiraj D. Kalamkar, Naveen K. Mellempudi, Dipankar Das, Kunal Banerjee, Bharat Kaul, Pradeep Dubey:
K-TanH: Hardware Efficient Activations For Deep Learning. CoRR abs/1909.07729 (2019) - 2018
- [j41]Pradeep Dubey, Siddhartha Sahi, Martin Shubik:
Money as minimal complexity. Games Econ. Behav. 108: 432-451 (2018) - [j40]Pradeep Dubey, Siddhartha Sahi, Martin Shubik:
Graphical exchange mechanisms. Games Econ. Behav. 108: 452-465 (2018) - [c55]Dipankar Das, Naveen Mellempudi, Dheevatsa Mudigere, Dhiraj D. Kalamkar, Sasikanth Avancha, Kunal Banerjee, Srinivas Sridharan, Karthik Vaidyanathan, Bharat Kaul, Evangelos Georganas, Alexander Heinecke, Pradeep Dubey, Jesús Corbal, Nikita Shustrov, Roman Dubtsov, Evarist Fomenko, Vadim O. Pirogov:
Mixed Precision Training of Convolutional Neural Networks using Integer Operations. ICLR (Poster) 2018 - [i16]Srinivas Sridharan, Karthikeyan Vaidyanathan, Dhiraj D. Kalamkar, Dipankar Das, Mikhail E. Smorkalov, Mikhail Shiryaev, Dheevatsa Mudigere, Naveen Mellempudi, Sasikanth Avancha, Bharat Kaul, Pradeep Dubey:
On Scale-out Deep Learning Training for Cloud and HPC. CoRR abs/1801.08030 (2018) - [i15]Dipankar Das, Naveen Mellempudi, Dheevatsa Mudigere, Dhiraj D. Kalamkar, Sasikanth Avancha, Kunal Banerjee, Srinivas Sridharan, Karthik Vaidyanathan, Bharat Kaul, Evangelos Georganas, Alexander Heinecke, Pradeep Dubey, Jesús Corbal, Nikita Shustrov, Roman Dubtsov, Evarist Fomenko, Vadim O. Pirogov:
Mixed Precision Training of Convolutional Neural Networks using Integer Operations. CoRR abs/1802.00930 (2018) - 2017
- [c54]Jongsoo Park, Sheng R. Li, Wei Wen, Ping Tak Peter Tang, Hai Li, Yiran Chen, Pradeep Dubey:
Faster CNNs with Direct Sparse Convolutions and Guided Pruning. ICLR (Poster) 2017 - [c53]Swagath Venkataramani, Ashish Ranjan, Subarno Banerjee, Dipankar Das, Sasikanth Avancha, Ashok Jagannathan, Ajaya Durg, Dheemanth Nagaraj, Bharat Kaul, Pradeep Dubey, Anand Raghunathan:
ScaleDeep: A Scalable Compute Architecture for Learning and Evaluating Deep Networks. ISCA 2017: 13-26 - [c52]Pradeep Dubey:
The Quest for The Ultimate Learning Machine. ISPD 2017: 3 - [c51]Thorsten Kurth, Jian Zhang, Nadathur Satish, Evan Racah, Ioannis Mitliagkas, Md. Mostofa Ali Patwary, Tareq M. Malas, Narayanan Sundaram, Wahid Bhimji, Mikhail Smorkalov, Jack Deslippe, Mikhail Shiryaev, Srinivas Sridharan, Prabhat, Pradeep Dubey:
Deep learning at 15PF: supervised and semi-supervised classification for scientific data. SC 2017: 7 - [c50]Brian Friesen, Md. Mostofa Ali Patwary, Brian Austin, Nadathur Satish, Zachary Slepian, Narayanan Sundaram, Deborah Bard, Daniel J. Eisenstein, Jack Deslippe, Pradeep Dubey, Prabhat:
Galactos: computing the anisotropic 3-point correlation function for 2 billion galaxies. SC 2017: 20 - [i14]Naveen Mellempudi, Abhisek Kundu, Dheevatsa Mudigere, Dipankar Das, Bharat Kaul, Pradeep Dubey:
Ternary Neural Networks with Fine-Grained Quantization. CoRR abs/1705.01462 (2017) - [i13]Abhisek Kundu, Kunal Banerjee, Naveen Mellempudi, Dheevatsa Mudigere, Dipankar Das, Bharat Kaul, Pradeep Dubey:
Ternary Residual Networks. CoRR abs/1707.04679 (2017) - [i12]Thorsten Kurth, Jian Zhang, Nadathur Satish, Ioannis Mitliagkas, Evan Racah, Md. Mostofa Ali Patwary, Tareq M. Malas, Narayanan Sundaram, Wahid Bhimji, Mikhail Smorkalov, Jack Deslippe, Mikhail Shiryaev, Srinivas Sridharan, Prabhat, Pradeep Dubey:
Deep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific Data. CoRR abs/1708.05256 (2017) - [i11]Brian Friesen, Md. Mostofa Ali Patwary, Brian Austin, Nadathur Satish, Zachary Slepian, Narayanan Sundaram, Deborah Bard, Daniel J. Eisenstein, Jack Deslippe, Pradeep Dubey, Prabhat:
Galactos: Computing the Anisotropic 3-Point Correlation Function for 2 Billion Galaxies. CoRR abs/1709.00086 (2017) - 2016
- [j39]Pradeep Dubey, Siddhartha Sahi:
Eliciting performance: deterministic versus proportional prizes. Int. J. Game Theory 45(1-2): 239-267 (2016) - [j38]Jongsoo Park, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Alexander Heinecke, Dhiraj D. Kalamkar, Md. Mostofa Ali Patwary, Vadim O. Pirogov, Pradeep Dubey, Xing Liu, Carlos Rosales, Cyril Mazauric, Christopher S. Daley:
Optimizations in a high-performance conjugate gradient benchmark for IA-based multi- and many-core processors. Int. J. High Perform. Comput. Appl. 30(1): 11-27 (2016) - [j37]Edmond Chow, Xing Liu, Sanchit Misra, Marat Dukhan, Mikhail Smelyanskiy, Jeff R. Hammond, Yunfei Du, Xiangke Liao, Pradeep Dubey:
Scaling up Hartree-Fock calculations on Tianhe-2. Int. J. High Perform. Comput. Appl. 30(1): 85-102 (2016) - [j36]Sheng Li, Hyeontaek Lim, Victor W. Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey:
Achieving One Billion Key-Value Requests per Second on a Single Server. IEEE Micro 36(3): 94-104 (2016) - [j35]Arif M. Khan, Alex Pothen, Md. Mostofa Ali Patwary, Nadathur Rajagopalan Satish, Narayanan Sundaram, Fredrik Manne, Mahantesh Halappanavar, Pradeep Dubey:
Efficient Approximation Algorithms for Weighted b-Matching. SIAM J. Sci. Comput. 38(5) (2016) - [j34]Sheng Li, Hyeontaek Lim, Victor W. Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey:
Full-Stack Architecting to Achieve a Billion-Requests-Per-Second Throughput on a Single Key-Value Store Server Platform. ACM Trans. Comput. Syst. 34(2): 5:1-5:30 (2016) - [c49]Michael J. Anderson, Narayanan Sundaram, Nadathur Satish, Md. Mostofa Ali Patwary, Theodore L. Willke, Pradeep Dubey:
GraphPad: Optimized Graph Primitives for Parallel and Distributed Platforms. IPDPS 2016: 313-322 - [c48]Md. Mostofa Ali Patwary, Nadathur Rajagopalan Satish, Narayanan Sundaram, Jialin Liu, Peter J. Sadowski, Evan Racah, Surendra Byna, Craig Tull, Wahid Bhimji, Prabhat, Pradeep Dubey:
PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures. IPDPS 2016: 494-503 - [c47]Arif M. Khan, Alex Pothen, Md. Mostofa Ali Patwary, Mahantesh Halappanavar, Nadathur Rajagopalan Satish, Narayanan Sundaram, Pradeep Dubey:
Designing scalable b-Matching algorithms on distributed memory multiprocessors by approximation. SC 2016: 773-783 - [c46]Alexander Heinecke, Alexander Breuer, Michael Bader, Pradeep Dubey:
High Order Seismic Simulations on the Intel Xeon Phi Processor (Knights Landing). ISC 2016: 343-362 - [c45]Shihao Ji, S. V. N. Vishwanathan, Nadathur Satish, Michael J. Anderson, Pradeep Dubey:
BlackOut: Speeding up Recurrent Neural Network Language Models With Very Large Vocabularies. ICLR 2016 - [i10]Dipankar Das, Sasikanth Avancha, Dheevatsa Mudigere, Karthikeyan Vaidyanathan, Srinivas Sridharan, Dhiraj D. Kalamkar, Bharat Kaul, Pradeep Dubey:
Distributed Deep Learning Using Synchronous Stochastic Gradient Descent. CoRR abs/1602.06709 (2016) - [i9]Shihao Ji, Nadathur Satish, Sheng Li, Pradeep Dubey:
Parallelizing Word2Vec in Shared and Distributed Memory. CoRR abs/1604.04661 (2016) - [i8]Md. Mostofa Ali Patwary, Nadathur Rajagopalan Satish, Narayanan Sundaram, Jialin Liu, Peter J. Sadowski, Evan Racah, Surendra Byna, Craig Tull, Wahid Bhimji, Prabhat, Pradeep Dubey:
PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures. CoRR abs/1607.08220 (2016) - [i7]Jongsoo Park, Sheng R. Li, Wei Wen, Hai Li, Yiran Chen, Pradeep Dubey:
Holistic SparseCNN: Forging the Trident of Accuracy, Speed, and Size. CoRR abs/1608.01409 (2016) - [i6]Shihao Ji, Nadathur Satish, Sheng Li, Pradeep Dubey:
Parallelizing Word2Vec in Multi-Core and Many-Core Architectures. CoRR abs/1611.06172 (2016) - 2015
- [j33]Nadathur Satish, Changkyu Kim, Jatin Chhugani, Hideki Saito, Rakesh Krishnaiyer, Mikhail Smelyanskiy, Milind Girkar, Pradeep Dubey:
Can traditional programming bridge the ninja performance gap for parallel computing applications? Commun. ACM 58(5): 77-86 (2015) - [j32]R. Glenn Brook, Alexander Heinecke, Anthony B. Costa, Paul Peltz Jr., Vincent C. Betro, Troy Baer, Michael Bader, Pradeep Dubey:
Beacon: Deployment and Application of Intel Xeon Phi Coprocessorsfor Scientific Computing. Comput. Sci. Eng. 17(2): 65-72 (2015) - [j31]Narayanan Sundaram, Nadathur Satish, Md. Mostofa Ali Patwary, Subramanya Dulloor, Michael J. Anderson, Satya Gautam Vadlamudi, Dipankar Das, Pradeep Dubey:
GraphMat: High performance graph analytics made productive. Proc. VLDB Endow. 8(11): 1214-1225 (2015) - [c44]Dheevatsa Mudigere, Srinivas Sridharan, Anand M. Deshpande, Jongsoo Park, Alexander Heinecke, Mikhail Smelyanskiy, Bharat Kaul, Pradeep Dubey, Dinesh K. Kaushik, David E. Keyes:
Exploring Shared-Memory Optimizations for an Unstructured Mesh CFD Application on Modern Parallel Systems. IPDPS 2015: 723-732 - [c43]Sheng Li, Hyeontaek Lim, Victor W. Lee, Jung Ho Ahn, Anuj Kalia, Michael Kaminsky, David G. Andersen, Seongil O, Sukhan Lee, Pradeep Dubey:
Architecting to achieve a billion requests per second throughput on a single key-value store server platform. ISCA 2015: 476-488 - [c42]Md. Mostofa Ali Patwary, Surendra Byna, Nadathur Rajagopalan Satish, Narayanan Sundaram, Zarija Lukic, Vadim Roytershteyn, Michael J. Anderson, Yushu Yao, Prabhat, Pradeep Dubey:
BD-CATS: big data clustering at trillion particle scale. SC 2015: 6:1-6:12 - [c41]Dominique LaSalle, Md. Mostofa Ali Patwary, Nadathur Satish, Narayanan Sundaram, Pradeep Dubey, George Karypis:
Improving graph partitioning for modern graphs and architectures. IA3@SC 2015: 14:1-14:4 - [c40]Jongsoo Park, Mikhail Smelyanskiy, Ulrike Meier Yang, Dheevatsa Mudigere, Pradeep Dubey:
High-performance algebraic multigrid solver optimized for multi-core based distributed parallel systems. SC 2015: 54:1-54:12 - [c39]Md. Mostofa Ali Patwary, Nadathur Rajagopalan Satish, Narayanan Sundaram, Jongsoo Park, Michael J. Anderson, Satya Gautam Vadlamudi, Dipankar Das, Sergey G. Pudov, Vadim O. Pirogov, Pradeep Dubey:
Parallel Efficient Sparse Matrix-Matrix Multiplication on Multicore Platforms. ISC 2015: 48-57 - [i5]Narayanan Sundaram, Nadathur Rajagopalan Satish, Md. Mostofa Ali Patwary, Subramanya Dulloor, Satya Gautam Vadlamudi, Dipankar Das, Pradeep Dubey:
GraphMat: High performance graph analytics made productive. CoRR abs/1503.07241 (2015) - [i4]Pradeep Dubey:
Decentralization of a Machine: Some Definitions. CoRR abs/1511.06384 (2015) - [i3]Pradeep Dubey, Siddhartha Sahi, Martin Shubik:
Money as Minimal Complexity. CoRR abs/1512.02317 (2015) - [i2]Pradeep Dubey, Siddhartha Sahi, Martin Shubik:
Graphical Exchange Mechanisms. CoRR abs/1512.04637 (2015) - 2014
- [c38]Karthikeyan Vaidyanathan, Kiran Pamnany, Dhiraj D. Kalamkar, Alexander Heinecke, Mikhail Smelyanskiy, Jongsoo Park, Daehyun Kim, Aniruddha G. Shet, Bharat Kaul, Bálint Joó, Pradeep Dubey:
Improving Communication Performance and Scalability of Native Applications on Intel Xeon Phi Coprocessor Clusters. IPDPS 2014: 1083-1092 - [c37]Kenneth Czechowski, Victor W. Lee, Ed Grochowski, Ronny Ronen, Ronak Singhal, Richard W. Vuduc, Pradeep Dubey:
Improving the energy efficiency of Big Cores. ISCA 2014: 493-504 - [c36]Alexander Heinecke, Alexander Breuer, Sebastian Rettenberger, Michael Bader, Alice-Agnes Gabriel, Christian Pelties, Arndt Bode, William Barth, Xiangke Liao, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy, Pradeep Dubey:
Petascale High Order Dynamic Rupture Earthquake Simulations on Heterogeneous Supercomputers. SC 2014: 3-14 - [c35]Simon Heybrock, Bálint Joó, Dhiraj D. Kalamkar, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Tilo Wettig, Pradeep Dubey:
Lattice QCD with Domain Decomposition on Intel® Xeon Phi Co-Processors. SC 2014: 69-80 - [c34]Md. Mostofa Ali Patwary, Nadathur Satish, Narayanan Sundaram, Fredrik Manne, Salman Habib, Pradeep Dubey:
Pardicle: Parallel Approximate Density-Based Clustering. SC 2014: 560-571 - [c33]Jongsoo Park, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Alexander Heinecke, Dhiraj D. Kalamkar, Xing Liu, Md. Mostofa Ali Patwary, Yutong Lu, Pradeep Dubey:
Efficient Shared-Memory Implementation of High-Performance Conjugate Gradient Benchmark and its Application to Unstructured Matrices. SC 2014: 945-955 - [c32]Nadathur Satish, Narayanan Sundaram, Md. Mostofa Ali Patwary, Jiwon Seo, Jongsoo Park, Muhammad Amber Hassaan, Shubho Sengupta, Zhaoming Yin, Pradeep Dubey:
Navigating the maze of graph analytics frameworks using massive graph datasets. SIGMOD Conference 2014: 979-990 - [c31]Jongsoo Park, Mikhail Smelyanskiy, Narayanan Sundaram, Pradeep Dubey:
Sparsifying Synchronization for High-Performance Shared-Memory Sparse Triangular Solver. ISC 2014: 124-140 - 2013
- [j30]Narayanan Sundaram, Aizana Turmukhametova, Nadathur Satish, Todd Mostak, Piotr Indyk, Samuel Madden, Pradeep Dubey:
Streaming Similarity Search over one Billion Tweets using Parallel Locality-Sensitive Hashing. Proc. VLDB Endow. 6(14): 1930-1941 (2013) - [j29]Michael Stonebraker, Sam Madden, Pradeep Dubey:
Intel "big data" science and technology center vision and execution plan. SIGMOD Rec. 42(1): 44-49 (2013) - [c30]Xing Liu, Mikhail Smelyanskiy, Edmond Chow, Pradeep Dubey:
Efficient sparse matrix-vector multiplication on x86-based many-core processors. ICS 2013: 273-282 - [c29]Alexander Heinecke, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy, Alexander Kobotov, Roman Dubtsov, Greg Henry, Aniruddha G. Shet, George Chrysos, Pradeep Dubey:
Design and Implementation of the Linpack Benchmark for Single and Multi-node Systems Based on Intel® Xeon Phi Coprocessor. IPDPS 2013: 126-137 - [c28]Jongsoo Park, Ganesh Bikshandi, Karthikeyan Vaidyanathan, Ping Tak Peter Tang, Pradeep Dubey, Daehyun Kim:
Tera-scale 1D FFT with low-communication algorithm and Intel® Xeon Phi™ coprocessors. SC 2013: 34:1-34:12 - [c27]Bálint Joó, Dhiraj D. Kalamkar, Karthikeyan Vaidyanathan, Mikhail Smelyanskiy, Kiran Pamnany, Victor W. Lee, Pradeep Dubey, William A. Watson III:
Lattice QCD on Intel® Xeon PhiTM Coprocessors. ISC 2013: 40-54 - 2012
- [j28]Abhinav Golas, Rahul Narain, Jason Sewall, Pavel Krajcevski, Pradeep Dubey, Ming C. Lin:
Large-scale fluid simulation using velocity-vorticity domain decomposition. ACM Trans. Graph. 31(6): 148:1-148:9 (2012) - [c26]Jatin Chhugani, Nadathur Satish, Changkyu Kim, Jason Sewall, Pradeep Dubey:
Fast and Efficient Graph Traversal Algorithm for CPUs: Maximizing Single-Node Efficiency. IPDPS 2012: 378-389 - [c25]Dhiraj D. Kalamkar, Joshua D. Trzasko, Srinivas Sridharan, Mikhail Smelyanskiy, Daehyun Kim, Armando Manduca, Yunhong Shu, Matt A. Bernstein, Bharat Kaul, Pradeep Dubey:
High Performance Non-uniform FFT on Modern X86-based Multi-core Systems. IPDPS 2012: 449-460 - [c24]Nadathur Satish, Changkyu Kim, Jatin Chhugani, Hideki Saito, Rakesh Krishnaiyer, Mikhail Smelyanskiy, Milind Girkar, Pradeep Dubey:
Can traditional programming bridge the Ninja performance gap for parallel computing applications? ISCA 2012: 440-451 - [c23]Victor C. Valgenti, Jatin Chhugani, Yan Sun, Nadathur Satish, Min Sik Kim, Changkyu Kim, Pradeep Dubey:
GPP-Grep: High-Speed Regular Expression Processing Engine on General Purpose Processors. RAID 2012: 334-353 - [c22]Jatin Chhugani, Changkyu Kim, Hemant Shukla, Jongsoo Park, Pradeep Dubey, John Shalf, Horst D. Simon:
Billion-particle SIMD-friendly two-point correlation on large-scale HPC cluster systems. SC 2012: 1 - [c21]Nadathur Satish, Changkyu Kim, Jatin Chhugani, Pradeep Dubey:
Large-scale energy-efficient graph traversal: a path to efficient data-intensive supercomputing. SC 2012: 14 - [c20]Samuel Williams, Dhiraj D. Kalamkar, Amik Singh, Anand M. Deshpande, Brian van Straalen, Mikhail Smelyanskiy, Ann S. Almgren, Pradeep Dubey, John Shalf, Leonid Oliker:
Optimization of geometric multigrid for emerging multi- and manycore processors. SC 2012: 96 - [c19]Mikhail Smelyanskiy, Jason Sewall, Dhiraj D. Kalamkar, Nadathur Satish, Pradeep Dubey, Nikita Astafiev, Ilya Burylov, Andrey Nikolaev, Sergey Maidanov, Shuo Li, Sunil Kulkarni, Charles H. Finan, Ekaterina Gonina:
Analysis and Optimization of Financial Analytics Benchmark on Modern Multi- and Many-core IA-Based Architectures. SC Companion 2012: 1154-1162 - [c18]Changkyu Kim, Jongsoo Park, Nadathur Satish, Hongrae Lee, Pradeep Dubey, Jatin Chhugani:
CloudRAMSort: fast and efficient large-scale distributed RAM sort on shared-nothing cluster. SIGMOD Conference 2012: 841-850 - [p1]Pradeep Dubey:
Emerging Applications. Fundamentals of Multicore Software Development 2012: 1-19 - 2011
- [j27]Michael Deisher, Mikhail Smelyanskiy, Brian Nickerson, Victor W. Lee, Michael Chuvelev, Pradeep Dubey:
Designing and dynamically load balancing hybrid LU for multi/many-core. Comput. Sci. Res. Dev. 26(3-4): 211-220 (2011) - [j26]Daehyun Kim, Joshua Trzasko, Mikhail Smelyanskiy, Clifton Haider, Pradeep Dubey, Armando Manduca:
High-Performance 3D Compressive Sensing MRI Reconstruction Using Many-Core Architectures. Int. J. Biomed. Imaging 2011: 473128:1-473128:11 (2011) - [j25]Jason Sewall, Jatin Chhugani, Changkyu Kim, Nadathur Satish, Pradeep Dubey:
PALM: Parallel Architecture-Friendly Latch-Free Modifications to B+ Trees on Many-Core Processors. Proc. VLDB Endow. 4(11): 795-806 (2011) - [j24]Jens Krüger, Changkyu Kim, Martin Grund, Nadathur Satish, David Schwalb, Jatin Chhugani, Hasso Plattner, Pradeep Dubey, Alexander Zeier:
Fast Updates on Read-Optimized Databases Using Multi-Core CPUs. Proc. VLDB Endow. 5(1): 61-72 (2011) - [j23]Changkyu Kim, Jatin Chhugani, Nadathur Satish, Eric Sedlar, Anthony D. Nguyen, Tim Kaldewey, Victor W. Lee, Scott A. Brandt, Pradeep Dubey:
Designing fast architecture-sensitive tree search on modern multicore/many-core processors. ACM Trans. Database Syst. 36(4): 22:1-22:34 (2011) - [c17]Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Jee W. Choi, Bálint Joó, Jatin Chhugani, Michael A. Clark, Pradeep Dubey:
High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach. SC 2011: 69:1-69:11 - [c16]Jason Sewall, David Wilkie, Ming C. Lin, Pradeep Dubey:
Interactive hybrid simulation of large-scale traffic. SIGGRAPH Talks 2011: 6 - [c15]Pradeep Dubey, Pat Hanrahan, Ronald Fedkiw, Michael Lentine, Craig A. Schroeder:
PhysBAM: physically based simulation. SIGGRAPH Courses 2011: 10:1-10:22 - [i1]Jens Krüger, Changkyu Kim, Martin Grund, Nadathur Satish, David Schwalb, Jatin Chhugani, Hasso Plattner, Pradeep Dubey, Alexander Zeier:
Fast Updates on Read-Optimized Databases Using Multi-Core CPUs. CoRR abs/1109.6885 (2011) - 2010
- [j22]Pradeep Dubey, Eric Maskin, Yair Tauman:
A celebration of Robert Aumann's achievements on the occasion of his 80th birthday. Games Econ. Behav. 69(1): 1 (2010) - [j21]Pradeep Dubey, John Geanakoplos:
Grading exams: 100, 99, 98, ... or A, B, C? Games Econ. Behav. 69(1): 72-94 (2010) - [j20]John Geanakoplos, Pradeep Dubey:
Credit cards and inflation. Games Econ. Behav. 70(2): 325-353 (2010) - [c14]Victor W. Lee, Changkyu Kim, Jatin Chhugani, Michael Deisher, Daehyun Kim, Anthony D. Nguyen, Nadathur Satish, Mikhail Smelyanskiy, Srinivas Chennupaty, Per Hammarlund, Ronak Singhal, Pradeep Dubey:
Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU. ISCA 2010: 451-460 - [c13]Anthony D. Nguyen, Nadathur Satish, Jatin Chhugani, Changkyu Kim, Pradeep Dubey:
3.5-D Blocking Optimization for Stencil Computations on Modern CPUs and GPUs. SC 2010: 1-13 - [c12]Stephen J. Guy, Jatin Chhugani, Sean Curtis, Pradeep Dubey, Ming C. Lin, Dinesh Manocha:
PLEdestrians: A Least-Effort Approach to Crowd Simulation. Symposium on Computer Animation 2010: 119-128 - [c11]Changkyu Kim, Jatin Chhugani, Nadathur Satish, Eric Sedlar, Anthony D. Nguyen, Tim Kaldewey, Victor W. Lee, Scott A. Brandt, Pradeep Dubey:
FAST: fast architecture sensitive tree search on modern CPUs and GPUs. SIGMOD Conference 2010: 339-350 - [c10]Nadathur Satish, Changkyu Kim, Jatin Chhugani, Anthony D. Nguyen, Victor W. Lee, Daehyun Kim, Pradeep Dubey:
Fast sort on CPUs and GPUs: a case for bandwidth oblivious SIMD sort. SIGMOD Conference 2010: 351-362
2000 – 2009
- 2009
- [j19]Pradeep Dubey, Dieter Sondermann:
Perfect competition in an oligopoly (including bilateral monopoly). Games Econ. Behav. 65(1): 124-141 (2009) - [j18]Larry Seiler, Doug Carmean, Eric Sprangle, Tom Forsyth, Pradeep Dubey, Stephen Junkins, Adam T. Lake, Robert Cavin, Roger Espasa, Ed Grochowski, Toni Juan, Michael Abrash, Jeremy Sugerman, Pat Hanrahan:
Larrabee: A Many-Core x86 Architecture for Visual Computing. IEEE Micro 29(1): 10-21 (2009) - [j17]Changkyu Kim, Eric Sedlar, Jatin Chhugani, Tim Kaldewey, Anthony D. Nguyen, Andrea Di Blas, Victor W. Lee, Nadathur Satish, Pradeep Dubey:
Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs. Proc. VLDB Endow. 2(2): 1378-1389 (2009) - [j16]Mikhail Smelyanskiy, David R. Holmes III, Jatin Chhugani, Alan Larson, Doug Carmean, Dennis P. Hanson, Pradeep Dubey, Kurt Augustine, Daehyun Kim, Alan Kyker, Victor W. Lee, Anthony D. Nguyen, Larry Seiler, Richard A. Robb:
Mapping High-Fidelity Volume Rendering for Medical Imaging to CPU, GPU and Many-Core Architectures. IEEE Trans. Vis. Comput. Graph. 15(6): 1563-1570 (2009) - [c9]Ming C. Lin, Stephen J. Guy, Rahul Narain, Jason Sewall, Sachin Patil, Jatin Chhugani, Abhinav Golas, Jur P. van den Berg, Sean Curtis, David Wilkie, Paul Merrell, Changkyu Kim, Nadathur Satish, Pradeep Dubey, Dinesh Manocha