default search action
Nadathur Satish
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2021
- [j12]Zhaoxia Deng, Jongsoo Park, Ping Tak Peter Tang, Haixin Liu, Jie Yang, Hector Yuen, Jianyu Huang, Daya Shanker Khudia, Xiaohan Wei, Ellie Wen, Dhruv Choudhary, Raghuraman Krishnamoorthi, Carole-Jean Wu, Nadathur Satish, Changkyu Kim, Maxim Naumov, Sam Naghshineh, Mikhail Smelyanskiy:
Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale. IEEE Micro 41(5): 93-100 (2021) - [i12]Zhaoxia Deng, Jongsoo Park, Ping Tak Peter Tang, Haixin Liu, Jie Yang, Hector Yuen, Jianyu Huang, Daya Shanker Khudia, Xiaohan Wei, Ellie Wen, Dhruv Choudhary, Raghuraman Krishnamoorthi, Carole-Jean Wu, Nadathur Satish, Changkyu Kim, Maxim Naumov, Sam Naghshineh, Mikhail Smelyanskiy:
Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale. CoRR abs/2105.12676 (2021) - [i11]Michael J. Anderson, Benny Chen, Stephen Chen, Summer Deng, Jordan Fix, Michael Gschwind, Aravind Kalaiah, Changkyu Kim, Jaewon Lee, Jason Liang, Haixin Liu, Yinghai Lu, Jack Montgomery, Arun Moorthy, Nadathur Satish, Sam Naghshineh, Avinash Nayak, Jongsoo Park, Chris Petersen, Martin Schatz, Narayanan Sundaram, Bangsheng Tang, Peter Tang, Amy Yang, Jiecao Yu, Hector Yuen, Ying Zhang, Aravind Anbudurai, Vandana Balan, Harsha Bojja, Joe Boyd, Matthew Breitbach, Claudio Caldato, Anna Calvo, Garret Catron, Sneh Chandwani, Panos Christeas, Brad Cottel, Brian Coutinho, Arun Dalli, Abhishek Dhanotia, Oniel Duncan, Roman Dzhabarov, Simon Elmir, Chunli Fu, Wenyin Fu, Michael Fulthorp, Adi Gangidi, Nick Gibson, Sean Gordon, Beatriz Padilla Hernandez, Daniel Ho, Yu-Cheng Huang, Olof Johansson, Shishir Juluri, et al.:
First-Generation Inference Accelerator Deployment at Facebook. CoRR abs/2107.04140 (2021)
2010 – 2019
- 2019
- [j11]Shihao Ji, Nadathur Satish, Sheng Li, Pradeep Dubey:
Parallelizing Word2Vec in Shared and Distributed Memory. IEEE Trans. Parallel Distributed Syst. 30(9): 2090-2100 (2019) - 2018
- [i10]Nadav Rotem, Jordan Fix, Saleem Abdulrasool, Summer Deng, Roman Dzhabarov, James Hegeman, Roman Levenstein, Bert Maher, Nadathur Satish, Jakob Olesen, Jongsoo Park, Artem Rakhov, Misha Smelyanskiy:
Glow: Graph Lowering Compiler Techniques for Neural Networks. CoRR abs/1805.00907 (2018) - [i9]Jongsoo Park, Maxim Naumov, Protonu Basu, Summer Deng, Aravind Kalaiah, Daya Shanker Khudia, James Law, Parth Malani, Andrey Malevich, Nadathur Satish, Juan Miguel Pino, Martin Schatz, Alexander Sidorov, Viswanath Sivakumar, Andrew Tulloch, Xiaodong Wang, Yiming Wu, Hector Yuen, Utku Diril, Dmytro Dzhulgakov, Kim M. Hazelwood, Bill Jia, Yangqing Jia, Lin Qiao, Vijay Rao, Nadav Rotem, Sungjoo Yoo, Mikhail Smelyanskiy:
Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications. CoRR abs/1811.09886 (2018) - 2017
- [j10]Michael J. Anderson, Shaden Smith, Narayanan Sundaram, Mihai Capota, Zheguang Zhao, Subramanya Dulloor, Nadathur Satish, Theodore L. Willke:
Bridging the Gap between HPC and Big Data frameworks. Proc. VLDB Endow. 10(8): 901-912 (2017) - [c40]Xiangyao Yu, Christopher J. Hughes, Nadathur Satish, Onur Mutlu, Srinivas Devadas:
Banshee: bandwidth-efficient DRAM caching via software/hardware cooperation. MICRO 2017: 1-14 - [c39]Thorsten Kurth, Jian Zhang, Nadathur Satish, Evan Racah, Ioannis Mitliagkas, Md. Mostofa Ali Patwary, Tareq M. Malas, Narayanan Sundaram, Wahid Bhimji, Mikhail Smorkalov, Jack Deslippe, Mikhail Shiryaev, Srinivas Sridharan, Prabhat, Pradeep Dubey:
Deep learning at 15PF: supervised and semi-supervised classification for scientific data. SC 2017: 7 - [c38]Brian Friesen, Md. Mostofa Ali Patwary, Brian Austin, Nadathur Satish, Zachary Slepian, Narayanan Sundaram, Deborah Bard, Daniel J. Eisenstein, Jack Deslippe, Pradeep Dubey, Prabhat:
Galactos: computing the anisotropic 3-point correlation function for 2 billion galaxies. SC 2017: 20 - [i8]Xiangyao Yu, Christopher J. Hughes, Nadathur Satish, Onur Mutlu, Srinivas Devadas:
Banshee: Bandwidth-Efficient DRAM Caching Via Software/Hardware Cooperation. CoRR abs/1704.02677 (2017) - [i7]Thorsten Kurth, Jian Zhang, Nadathur Satish, Ioannis Mitliagkas, Evan Racah, Md. Mostofa Ali Patwary, Tareq M. Malas, Narayanan Sundaram, Wahid Bhimji, Mikhail Smorkalov, Jack Deslippe, Mikhail Shiryaev, Srinivas Sridharan, Prabhat, Pradeep Dubey:
Deep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific Data. CoRR abs/1708.05256 (2017) - [i6]Brian Friesen, Md. Mostofa Ali Patwary, Brian Austin, Nadathur Satish, Zachary Slepian, Narayanan Sundaram, Deborah Bard, Daniel J. Eisenstein, Jack Deslippe, Pradeep Dubey, Prabhat:
Galactos: Computing the Anisotropic 3-Point Correlation Function for 2 Billion Galaxies. CoRR abs/1709.00086 (2017) - 2016
- [j9]Arif M. Khan, Alex Pothen, Md. Mostofa Ali Patwary, Nadathur Rajagopalan Satish, Narayanan Sundaram, Fredrik Manne, Mahantesh Halappanavar, Pradeep Dubey:
Efficient Approximation Algorithms for Weighted b-Matching. SIAM J. Sci. Comput. 38(5) (2016) - [c37]Subramanya Dulloor, Amitabha Roy, Zheguang Zhao, Narayanan Sundaram, Nadathur Satish, Rajesh Sankaran, Jeff Jackson, Karsten Schwan:
Data tiering in heterogeneous memory systems. EuroSys 2016: 15:1-15:16 - [c36]Michael J. Anderson, Narayanan Sundaram, Nadathur Satish, Md. Mostofa Ali Patwary, Theodore L. Willke, Pradeep Dubey:
GraphPad: Optimized Graph Primitives for Parallel and Distributed Platforms. IPDPS 2016: 313-322 - [c35]Md. Mostofa Ali Patwary, Nadathur Rajagopalan Satish, Narayanan Sundaram, Jialin Liu, Peter J. Sadowski, Evan Racah, Surendra Byna, Craig Tull, Wahid Bhimji, Prabhat, Pradeep Dubey:
PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures. IPDPS 2016: 494-503 - [c34]Scott Sallinen, Nadathur Satish, Mikhail Smelyanskiy, Samantika S. Sury, Christopher Ré:
High Performance Parallel Stochastic Gradient Descent in Shared Memory. IPDPS 2016: 873-882 - [c33]Tae Jun Ham, Lisa Wu, Narayanan Sundaram, Nadathur Satish, Margaret Martonosi:
Graphicionado: A high-performance and energy-efficient accelerator for graph analytics. MICRO 2016: 56:1-56:13 - [c32]Arif M. Khan, Alex Pothen, Md. Mostofa Ali Patwary, Mahantesh Halappanavar, Nadathur Rajagopalan Satish, Narayanan Sundaram, Pradeep Dubey:
Designing scalable b-Matching algorithms on distributed memory multiprocessors by approximation. SC 2016: 773-783 - [c31]Shihao Ji, S. V. N. Vishwanathan, Nadathur Satish, Michael J. Anderson, Pradeep Dubey:
BlackOut: Speeding up Recurrent Neural Network Language Models With Very Large Vocabularies. ICLR 2016 - [i5]Shihao Ji, Nadathur Satish, Sheng Li, Pradeep Dubey:
Parallelizing Word2Vec in Shared and Distributed Memory. CoRR abs/1604.04661 (2016) - [i4]Md. Mostofa Ali Patwary, Nadathur Rajagopalan Satish, Narayanan Sundaram, Jialin Liu, Peter J. Sadowski, Evan Racah, Surendra Byna, Craig Tull, Wahid Bhimji, Prabhat, Pradeep Dubey:
PANDA: Extreme Scale Parallel K-Nearest Neighbor on Distributed Architectures. CoRR abs/1607.08220 (2016) - [i3]Shihao Ji, Nadathur Satish, Sheng Li, Pradeep Dubey:
Parallelizing Word2Vec in Multi-Core and Many-Core Architectures. CoRR abs/1611.06172 (2016) - 2015
- [j8]Nadathur Satish, Changkyu Kim, Jatin Chhugani, Hideki Saito, Rakesh Krishnaiyer, Mikhail Smelyanskiy, Milind Girkar, Pradeep Dubey:
Can traditional programming bridge the ninja performance gap for parallel computing applications? Commun. ACM 58(5): 77-86 (2015) - [j7]Narayanan Sundaram, Nadathur Satish, Md. Mostofa Ali Patwary, Subramanya Dulloor, Michael J. Anderson, Satya Gautam Vadlamudi, Dipankar Das, Pradeep Dubey:
GraphMat: High performance graph analytics made productive. Proc. VLDB Endow. 8(11): 1214-1225 (2015) - [c30]Jasper Snoek, Oren Rippel, Kevin Swersky, Ryan Kiros, Nadathur Satish, Narayanan Sundaram, Md. Mostofa Ali Patwary, Prabhat, Ryan P. Adams:
Scalable Bayesian Optimization Using Deep Neural Networks. ICML 2015: 2171-2180 - [c29]Xiangyao Yu, Christopher J. Hughes, Nadathur Satish, Srinivas Devadas:
IMP: indirect memory prefetcher. MICRO 2015: 178-190 - [c28]Md. Mostofa Ali Patwary, Surendra Byna, Nadathur Rajagopalan Satish, Narayanan Sundaram, Zarija Lukic, Vadim Roytershteyn, Michael J. Anderson, Yushu Yao, Prabhat, Pradeep Dubey:
BD-CATS: big data clustering at trillion particle scale. SC 2015: 6:1-6:12 - [c27]Dominique LaSalle, Md. Mostofa Ali Patwary, Nadathur Satish, Narayanan Sundaram, Pradeep Dubey, George Karypis:
Improving graph partitioning for modern graphs and architectures. IA3@SC 2015: 14:1-14:4 - [c26]Yida Wang, Michael J. Anderson, Jonathan D. Cohen, Alexander Heinecke, Kai Li, Nadathur Satish, Narayanan Sundaram, Nicholas B. Turk-Browne, Theodore L. Willke:
Full correlation matrix analysis of fMRI data on Intel® Xeon Phi™ coprocessors. SC 2015: 23:1-23:12 - [c25]Jasmina Malicevic, Subramanya Dulloor, Narayanan Sundaram, Nadathur Satish, Jeff Jackson, Willy Zwaenepoel:
Exploiting NVM in large-scale graph analytics. INFLOW@SOSP 2015: 2:1-2:9 - [c24]Md. Mostofa Ali Patwary, Nadathur Rajagopalan Satish, Narayanan Sundaram, Jongsoo Park, Michael J. Anderson, Satya Gautam Vadlamudi, Dipankar Das, Sergey G. Pudov, Vadim O. Pirogov, Pradeep Dubey:
Parallel Efficient Sparse Matrix-Matrix Multiplication on Multicore Platforms. ISC 2015: 48-57 - [i2]Narayanan Sundaram, Nadathur Rajagopalan Satish, Md. Mostofa Ali Patwary, Subramanya Dulloor, Satya Gautam Vadlamudi, Dipankar Das, Pradeep Dubey:
GraphMat: High performance graph analytics made productive. CoRR abs/1503.07241 (2015) - 2014
- [c23]Md. Mostofa Ali Patwary, Nadathur Satish, Narayanan Sundaram, Fredrik Manne, Salman Habib, Pradeep Dubey:
Pardicle: Parallel Approximate Density-Based Clustering. SC 2014: 560-571 - [c22]Rebecca Taft, Manasi Vartak, Nadathur Rajagopalan Satish, Narayanan Sundaram, Samuel Madden, Michael Stonebraker:
GenBase: a complex analytics genomics benchmark. SIGMOD Conference 2014: 177-188 - [c21]Nadathur Satish, Narayanan Sundaram, Md. Mostofa Ali Patwary, Jiwon Seo, Jongsoo Park, Muhammad Amber Hassaan, Shubho Sengupta, Zhaoming Yin, Pradeep Dubey:
Navigating the maze of graph analytics frameworks using massive graph datasets. SIGMOD Conference 2014: 979-990 - 2013
- [j6]Narayanan Sundaram, Aizana Turmukhametova, Nadathur Satish, Todd Mostak, Piotr Indyk, Samuel Madden, Pradeep Dubey:
Streaming Similarity Search over one Billion Tweets using Parallel Locality-Sensitive Hashing. Proc. VLDB Endow. 6(14): 1930-1941 (2013) - 2012
- [j5]Venkatraman Govindaraju, Chen-Han Ho, Tony Nowatzki, Jatin Chhugani, Nadathur Satish, Karthikeyan Sankaralingam, Changkyu Kim:
DySER: Unifying Functionality and Parallelism Specialization for Energy-Efficient Computing. IEEE Micro 32(5): 38-51 (2012) - [c20]Jatin Chhugani, Nadathur Satish, Changkyu Kim, Jason Sewall, Pradeep Dubey:
Fast and Efficient Graph Traversal Algorithm for CPUs: Maximizing Single-Node Efficiency. IPDPS 2012: 378-389 - [c19]Nadathur Satish, Changkyu Kim, Jatin Chhugani, Hideki Saito, Rakesh Krishnaiyer, Mikhail Smelyanskiy, Milind Girkar, Pradeep Dubey:
Can traditional programming bridge the Ninja performance gap for parallel computing applications? ISCA 2012: 440-451 - [c18]Victor C. Valgenti, Jatin Chhugani, Yan Sun, Nadathur Satish, Min Sik Kim, Changkyu Kim, Pradeep Dubey:
GPP-Grep: High-Speed Regular Expression Processing Engine on General Purpose Processors. RAID 2012: 334-353 - [c17]Nadathur Satish, Changkyu Kim, Jatin Chhugani, Pradeep Dubey:
Large-scale energy-efficient graph traversal: a path to efficient data-intensive supercomputing. SC 2012: 14 - [c16]Mikhail Smelyanskiy, Jason Sewall, Dhiraj D. Kalamkar, Nadathur Satish, Pradeep Dubey, Nikita Astafiev, Ilya Burylov, Andrey Nikolaev, Sergey Maidanov, Shuo Li, Sunil Kulkarni, Charles H. Finan, Ekaterina Gonina:
Analysis and Optimization of Financial Analytics Benchmark on Modern Multi- and Many-core IA-Based Architectures. SC Companion 2012: 1154-1162 - [c15]Changkyu Kim, Jongsoo Park, Nadathur Satish, Hongrae Lee, Pradeep Dubey, Jatin Chhugani:
CloudRAMSort: fast and efficient large-scale distributed RAM sort on shared-nothing cluster. SIGMOD Conference 2012: 841-850 - 2011
- [j4]Jason Sewall, Jatin Chhugani, Changkyu Kim, Nadathur Satish, Pradeep Dubey:
PALM: Parallel Architecture-Friendly Latch-Free Modifications to B+ Trees on Many-Core Processors. Proc. VLDB Endow. 4(11): 795-806 (2011) - [j3]Jens Krüger, Changkyu Kim, Martin Grund, Nadathur Satish, David Schwalb, Jatin Chhugani, Hasso Plattner, Pradeep Dubey, Alexander Zeier:
Fast Updates on Read-Optimized Databases Using Multi-Core CPUs. Proc. VLDB Endow. 5(1): 61-72 (2011) - [j2]Changkyu Kim, Jatin Chhugani, Nadathur Satish, Eric Sedlar, Anthony D. Nguyen, Tim Kaldewey, Victor W. Lee, Scott A. Brandt, Pradeep Dubey:
Designing fast architecture-sensitive tree search on modern multicore/many-core processors. ACM Trans. Database Syst. 36(4): 22:1-22:34 (2011) - [i1]Jens Krüger, Changkyu Kim, Martin Grund, Nadathur Satish, David Schwalb, Jatin Chhugani, Hasso Plattner, Pradeep Dubey, Alexander Zeier:
Fast Updates on Read-Optimized Databases Using Multi-Core CPUs. CoRR abs/1109.6885 (2011) - 2010
- [c14]Victor W. Lee, Changkyu Kim, Jatin Chhugani, Michael Deisher, Daehyun Kim, Anthony D. Nguyen, Nadathur Satish, Mikhail Smelyanskiy, Srinivas Chennupaty, Per Hammarlund, Ronak Singhal, Pradeep Dubey:
Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU. ISCA 2010: 451-460 - [c13]Anthony D. Nguyen, Nadathur Satish, Jatin Chhugani, Changkyu Kim, Pradeep Dubey:
3.5-D Blocking Optimization for Stencil Computations on Modern CPUs and GPUs. SC 2010: 1-13 - [c12]Changkyu Kim, Jatin Chhugani, Nadathur Satish, Eric Sedlar, Anthony D. Nguyen, Tim Kaldewey, Victor W. Lee, Scott A. Brandt, Pradeep Dubey:
FAST: fast architecture sensitive tree search on modern CPUs and GPUs. SIGMOD Conference 2010: 339-350 - [c11]Nadathur Satish, Changkyu Kim, Jatin Chhugani, Anthony D. Nguyen, Victor W. Lee, Daehyun Kim, Pradeep Dubey:
Fast sort on CPUs and GPUs: a case for bandwidth oblivious SIMD sort. SIGMOD Conference 2010: 351-362
2000 – 2009
- 2009
- [j1]Changkyu Kim, Eric Sedlar, Jatin Chhugani, Tim Kaldewey, Anthony D. Nguyen, Andrea Di Blas, Victor W. Lee, Nadathur Satish, Pradeep Dubey:
Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs. Proc. VLDB Endow. 2(2): 1378-1389 (2009) - [c10]Nadathur Satish, Narayanan Sundaram, Kurt Keutzer:
Optimizing the use of GPU memory in applications with large data sets. HiPC 2009: 408-418 - [c9]Nadathur Satish, Mark J. Harris, Michael Garland:
Designing efficient sorting algorithms for manycore GPUs. IPDPS 2009: 1-10 - [c8]Ming C. Lin, Stephen J. Guy, Rahul Narain, Jason Sewall, Sachin Patil, Jatin Chhugani, Abhinav Golas, Jur P. van den Berg, Sean Curtis, David Wilkie, Paul Merrell, Changkyu Kim, Nadathur Satish, Pradeep Dubey, Dinesh Manocha:
Interactive Modeling, Simulation and Control of Large-Scale Crowds and Traffic. MIG 2009: 94-103 - [c7]Stephen J. Guy, Jatin Chhugani, Changkyu Kim, Nadathur Satish, Ming C. Lin, Dinesh Manocha, Pradeep Dubey:
ClearPath: highly parallel collision avoidance for multi-agent simulation. Symposium on Computer Animation 2009: 177-187 - 2008
- [c6]Nadathur Satish, Kaushik Ravindran, Kurt Keutzer:
Scheduling task dependence graphs with variable task execution times onto heterogeneous multiprocessors. EMSOFT 2008: 149-158 - 2007
- [c5]Nadathur Satish, Kaushik Ravindran, Kurt Keutzer:
A decomposition-based constraint optimization approach for statically scheduling task graphs with communication delays to multiprocessors. DATE 2007: 57-62 - [c4]Jike Chong, Nadathur Satish, Bryan Catanzaro, Kaushik Ravindran, Kurt Keutzer:
Efficient Parallelization of H.264 Decoding with Macro Block Level Scheduling. ICME 2007: 1874-1877 - 2005
- [c3]Yujia Jin, Nadathur Satish, Kaushik Ravindran, Kurt Keutzer:
An automated exploration framework for FPGA-based soft multiprocessor systems. CODES+ISSS 2005: 273-278 - [c2]Yujia Jin, William Plishker, Kaushik Ravindran, Nadathur Satish, Kurt Keutzer:
Soft multiprocessor systems for network applications (abstract only). FPGA 2005: 271 - [c1]Kaushik Ravindran, Nadathur Satish, Yujia Jin, Kurt Keutzer:
An FPGA-based Soft Multiprocessor System for IPv4 Packet Forwarding. FPL 2005: 487-492
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:19 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint