


Остановите войну!
for scientists:


default search action
Kunle Olukotun
Oyekunle A. Olukotun
Person information

- affiliation: Stanford University, USA
- award (2018): Harry H. Goode Memorial Award
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j28]Manya Bansal
, Olivia Hsu
, Kunle Olukotun
, Fredrik Kjolstad
:
Mosaic: An Interoperable Compiler for Tensor Algebra. Proc. ACM Program. Lang. 7(PLDI): 394-419 (2023) - [c137]Tushar Swamy, Annus Zulfiqar
, Luigi Nardi, Muhammad Shahbaz, Kunle Olukotun:
Homunculus: Auto-Generating Efficient Data-Plane ML Pipelines for Datacenter Networks. ASPLOS (3) 2023: 329-342 - [c136]Olivia Hsu, Maxwell Strange, Ritvik Sharma, Jaeyeon Won, Kunle Olukotun, Joel S. Emer, Mark A. Horowitz, Fredrik Kjølstad:
The Sparse Abstract Machine. ASPLOS (3) 2023: 710-726 - [c135]Tian Zhao, Alexander Rucker, Kunle Olukotun:
Sigma: Compiling Einstein Summations to Locality-Aware Dataflow. ASPLOS (2) 2023: 718-732 - [i26]Alexander Rucker, Shiv Sundram, Coleman Smith, Matthew Vilim, Raghu Prabhakar, Fredrik Kjolstad, Kunle Olukotun:
Revet: A Language and Compiler for Dataflow Threads. CoRR abs/2302.06124 (2023) - 2022
- [j27]Luiz André Barroso, Tanzeem Choudhury, Manish Gupta, Oyekunle A. Olukotun, Raluca Ada Popa, Dawn Xiaodong Song, David A. Patterson:
Global perspectives of diversity, equity, and inclusion. Commun. ACM 65(12): 30-31 (2022) - [c134]Tushar Swamy, Alexander Rucker, Muhammad Shahbaz
, Ishan Gaur, Kunle Olukotun
:
Taurus: a data plane architecture for per-packet ML. ASPLOS 2022: 1099-1114 - [c133]Sho Ko, Alexander Rucker, Yaqi Zhang, Paul Mure
, Kunle Olukotun
:
Accelerating SLIDE: Exploiting Sparsity on Accelerator Architectures. IPDPS Workshops 2022: 663-670 - [i25]Matthew Feldman, Tian Zhao, Kunle Olukotun:
Efficient Memory Partitioning in Software Defined Hardware. CoRR abs/2202.01261 (2022) - [i24]Tushar Swamy, Annus Zulfiqar, Luigi Nardi, Muhammad Shahbaz, Kunle Olukotun:
Homunculus: Auto-Generating Efficient Data-Plane ML Pipelines for Datacenter Networks. CoRR abs/2206.05592 (2022) - [i23]Olivia Hsu, Maxwell Strange, Jaeyeon Won, Ritvik Sharma, Kunle Olukotun, Joel S. Emer, Mark Horowitz, Fredrik Kjolstad:
The Sparse Abstract Machine. CoRR abs/2208.14610 (2022) - [i22]Olivia Hsu, Alexander Rucker, Tian Zhao, Kunle Olukotun, Fredrik Kjolstad:
Stardust: Compiling Sparse Tensor Algebra to a Reconfigurable Dataflow Architecture. CoRR abs/2211.03251 (2022) - [i21]Erik Hellsten, Artur L. F. Souza, Johannes Lenfers, Rubens Lacouture, Olivia Hsu, Adel Ejjeh, Fredrik Kjolstad, Michel Steuwer, Kunle Olukotun, Luigi Nardi:
BaCO: A Fast and Portable Bayesian Compiler Optimization Framework. CoRR abs/2212.11142 (2022) - 2021
- [j26]Alexander Rucker
, Muhammad Shahbaz
, Kunle Olukotun
:
Chopping off the Tail: Bounded Non-Determinism for Real-Time Accelerators. IEEE Comput. Archit. Lett. 20(2): 110-113 (2021) - [j25]Rawn Henry, Olivia Hsu
, Rohan Yadav
, Stephen Chou, Kunle Olukotun
, Saman P. Amarasinghe, Fredrik Kjolstad
:
Compilation of sparse array programming models. Proc. ACM Program. Lang. 5(OOPSLA): 1-29 (2021) - [c132]Kunle Olukotun:
"Let the Data Flow!". CIDR 2021 - [c131]Nathan Zhang, Matthew Feldman, Kunle Olukotun
:
High performance lattice regression on FPGAs via a high level hardware description language. FPT 2021: 1-10 - [c130]Matthew Vilim, Alexander Rucker, Kunle Olukotun
:
Aurochs: An Architecture for Dataflow Threads. ISCA 2021: 402-415 - [c129]Yaqi Zhang, Nathan Zhang, Tian Zhao, Matt Vilim, Muhammad Shahbaz
, Kunle Olukotun
:
SARA: Scaling a Reconfigurable Dataflow Accelerator. ISCA 2021: 1041-1054 - [c128]Alexander Rucker, Matthew Vilim, Tian Zhao, Yaqi Zhang, Raghu Prabhakar, Kunle Olukotun:
Capstan: A Vector RDA for Sparsity. MICRO 2021: 1022-1035 - [c127]Artur L. F. Souza, Luigi Nardi, Leonardo B. Oliveira, Kunle Olukotun
, Marius Lindauer, Frank Hutter:
Bayesian Optimization with a Prior for the Optimum. ECML/PKDD (3) 2021: 265-296 - [i20]Alexander Rucker, Matthew Vilim, Tian Zhao, Yaqi Zhang, Raghu Prabhakar, Kunle Olukotun:
Capstan: A Vector RDA for Sparsity. CoRR abs/2104.12760 (2021) - 2020
- [c126]Matthew Vilim, Alexander Rucker, Yaqi Zhang, Sophia Liu, Kunle Olukotun:
Gorgon: Accelerating Machine Learning from Relational Data. ISCA 2020: 309-321 - [i19]Tushar Swamy, Alexander Rucker, Muhammad Shahbaz, Kunle Olukotun:
Taurus: An Intelligent Data Plane. CoRR abs/2002.08987 (2020) - [i18]Artur L. F. Souza, Luigi Nardi, Leonardo B. Oliveira, Kunle Olukotun, Marius Lindauer, Frank Hutter:
Prior-guided Bayesian Optimization. CoRR abs/2006.14608 (2020)
2010 – 2019
- 2019
- [j24]Cody Coleman, Daniel Kang
, Deepak Narayanan, Luigi Nardi, Tian Zhao, Jian Zhang, Peter Bailis, Kunle Olukotun, Christopher Ré, Matei Zaharia:
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark. ACM SIGOPS Oper. Syst. Rev. 53(1): 14-25 (2019) - [c125]Alexander Rucker, Muhammad Shahbaz, Tushar Swamy, Kunle Olukotun
:
Elastic RSS: Co-Scheduling Packets and Cores Using Programmable NICs. APNet 2019: 71-77 - [c124]Stefan Hadjis, Kunle Olukotun
:
TensorFlow to Cloud FPGAs: Tradeoffs for Accelerating Deep Neural Networks. FPL 2019: 360-366 - [c123]Rekha Singhal, Nathan Zhang, Luigi Nardi, Muhammad Shahbaz, Kunle Olukotun
:
Polystore++: Accelerated Polystore System for Heterogeneous Workloads. ICDCS 2019: 1641-1651 - [c122]Yaqi Zhang, Alexander Rucker, Matthew Vilim, Raghu Prabhakar, William Hwang, Kunle Olukotun
:
Scalable interconnects for reconfigurable spatial architectures. ISCA 2019: 615-628 - [c121]Luigi Nardi, David Koeplinger, Kunle Olukotun:
Practical Design Space Exploration. MASCOTS 2019: 347-358 - [c120]Luigi Nardi, Artur L. F. Souza, David Koeplinger, Kunle Olukotun:
HyperMapper: a Practical Design Space Exploration Framework. MASCOTS 2019: 425-426 - [c119]Tian Zhao, Yaqi Zhang, Kunle Olukotun:
Serving Recurrent Neural Networks Efficiently with a Spatial Accelerator. MLSys 2019 - [c118]Rekha Singhal, Yaqi Zhang, Jeffrey D. Ullman, Raghu Prabhakar, Kunle Olukotun:
Efficient Multiway Hash Join on Reconfigurable Hardware. TPCTC 2019: 19-38 - [i17]Artur L. F. Souza, Leonardo B. Oliveira, Sabine Hollatz, Matthew Feldman, Kunle Olukotun, James M. Holton, Aina E. Cohen, Luigi Nardi:
DeepFreak: Learning Crystallography Diffraction Patterns with Automated Machine Learning. CoRR abs/1904.11834 (2019) - [i16]Rekha Singhal, Nathan Zhang, Luigi Nardi, Muhammad Shahbaz, Kunle Olukotun:
Polystore++: Accelerated Polystore System for Heterogeneous Workloads. CoRR abs/1905.10336 (2019) - [i15]Kunle Olukotun, Raghu Prabhakar, Rekha Singhal, Jeffrey D. Ullman, Yaqi Zhang:
Efficient Multiway Hash Join on Reconfigurable Hardware. CoRR abs/1905.13376 (2019) - [i14]Tian Zhao, Yaqi Zhang, Kunle Olukotun:
Serving Recurrent Neural Networks Efficiently with a Spatial Accelerator. CoRR abs/1909.13654 (2019) - 2018
- [j23]Raghu Prabhakar, Yaqi Zhang
, David Koeplinger, Matthew Feldman, Tian Zhao, Stefan Hadjis, Ardavan Pedram, Christos Kozyrakis, Kunle Olukotun:
Plasticine: A Reconfigurable Accelerator for Parallel Patterns. IEEE Micro 38(3): 20-31 (2018) - [c117]Christopher R. Aberger, Andrew Lamb, Kunle Olukotun
, Christopher Ré:
LevelHeaded: A Unified Engine for Business Intelligence and Linear Algebra Querying. ICDE 2018: 449-460 - [c116]Grégory M. Essertel, Ruby Y. Tahboub, James M. Decker, Kevin J. Brown, Kunle Olukotun, Tiark Rompf:
Flare: Optimizing Apache Spark with Native Compilation for Scale-Up Architectures and Medium-Size Data. OSDI 2018: 799-815 - [c115]David Koeplinger, Matthew Feldman, Raghu Prabhakar, Yaqi Zhang, Stefan Hadjis, Ruben Fiszel, Tian Zhao, Luigi Nardi, Ardavan Pedram, Christos Kozyrakis, Kunle Olukotun:
Spatial: a language and compiler for application accelerators. PLDI 2018: 296-311 - [c114]Jian Zhang, Max Lam, Stephanie Wang, Paroma Varma, Luigi Nardi, Kunle Olukotun
, Christopher Ré:
Exploring the Utility of Developer Exhaust. DEEM@SIGMOD 2018: 7:1-7:7 - [i13]Christopher De Sa, Megan Leszczynski, Jian Zhang, Alana Marzoev, Christopher R. Aberger, Kunle Olukotun, Christopher Ré:
High-Accuracy Low-Precision Training. CoRR abs/1803.03383 (2018) - [i12]Cody Coleman, Daniel Kang, Deepak Narayanan, Luigi Nardi, Tian Zhao, Jian Zhang, Peter Bailis, Kunle Olukotun, Christopher Ré, Matei Zaharia:
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark. CoRR abs/1806.01427 (2018) - [i11]Luigi Nardi, David Koeplinger, Kunle Olukotun:
Practical Design Space Exploration. CoRR abs/1810.05236 (2018) - 2017
- [j22]Christopher R. Aberger, Andrew Lamb, Kunle Olukotun, Christopher Ré:
Mind the Gap: Bridging Multi-Domain Query Workloads with EmptyHeaded. Proc. VLDB Endow. 10(12): 1849-1852 (2017) - [j21]Christopher R. Aberger, Andrew Lamb, Susan Tu, Andres Nötzli
, Kunle Olukotun
, Christopher Ré:
EmptyHeaded: A Relational Engine for Graph Processing. ACM Trans. Database Syst. 42(4): 20:1-20:44 (2017) - [c113]Christopher De Sa, Kunle Olukotun, Christopher Ré:
Ensuring Rapid Mixing and Low Bias for Asynchronous Gibbs Sampling. IJCAI 2017: 4811-4815 - [c112]Raghu Prabhakar, Yaqi Zhang
, David Koeplinger, Matthew Feldman, Tian Zhao, Stefan Hadjis, Ardavan Pedram, Christos Kozyrakis, Kunle Olukotun
:
Plasticine: A Reconfigurable Architecture For Parallel Paterns. ISCA 2017: 389-402 - [c111]Christopher De Sa, Matthew Feldman, Christopher Ré, Kunle Olukotun:
Understanding and Optimizing Asynchronous Low-Precision Stochastic Gradient Descent. ISCA 2017: 561-574 - [i10]Grégory M. Essertel, Ruby Y. Tahboub, James M. Decker, Kevin J. Brown, Kunle Olukotun, Tiark Rompf:
Flare: Native Compilation for Heterogeneous Workloads in Apache Spark. CoRR abs/1703.08219 (2017) - [i9]Peter Bailis, Kunle Olukotun, Christopher Ré, Matei Zaharia:
Infrastructure for Usable Machine Learning: The Stanford DAWN Project. CoRR abs/1705.07538 (2017) - [i8]Christopher R. Aberger, Andrew Lamb, Kunle Olukotun, Christopher Ré:
LevelHeaded: Making Worst-Case Optimal Joins Work in the Common Case. CoRR abs/1708.07859 (2017) - 2016
- [c110]Kunle Olukotun:
Scaling Data Analytics with Moore's Law. PACT 2016: 313 - [c109]Raghu Prabhakar, David Koeplinger, Kevin J. Brown, HyoukJoong Lee, Christopher De Sa, Christos Kozyrakis, Kunle Olukotun:
Generating Configurable Hardware from Parallel Patterns. ASPLOS 2016: 651-665 - [c108]Kevin J. Brown, HyoukJoong Lee, Tiark Rompf, Arvind K. Sujeeth, Christopher De Sa, Christopher R. Aberger, Kunle Olukotun:
Have abstraction and eat performance, too: optimized heterogeneous computing with parallel patterns. CGO 2016: 194-205 - [c107]Tayo Oguntebi, Kunle Olukotun:
GraphOps: A Dataflow Library for Graph Analytics Acceleration. FPGA 2016: 111-117 - [c106]Christopher R. Aberger, Susan Tu, Kunle Olukotun, Christopher Ré:
Old techniques for new join algorithms: A case study in RDF processing. ICDE Workshops 2016: 97-102 - [c105]Christopher De Sa, Christopher Ré, Kunle Olukotun:
Ensuring Rapid Mixing and Low Bias for Asynchronous Gibbs Sampling. ICML 2016: 1567-1576 - [c104]David Koeplinger, Raghu Prabhakar, Yaqi Zhang
, Christina Delimitrou, Christos Kozyrakis, Kunle Olukotun:
Automatic Generation of Efficient Accelerators for Reconfigurable Hardware. ISCA 2016: 115-127 - [c103]Christopher R. Aberger, Susan Tu, Kunle Olukotun
, Christopher Ré:
EmptyHeaded: A Relational Engine for Graph Processing. SIGMOD Conference 2016: 431-446 - [i7]Christopher R. Aberger, Susan Tu, Kunle Olukotun, Christopher Ré:
Old Techniques for New Join Algorithms: A Case Study in RDF Processing. CoRR abs/1602.03557 (2016) - [i6]Christopher De Sa, Kunle Olukotun, Christopher Ré:
Ensuring Rapid Mixing and Low Bias for Asynchronous Gibbs Sampling. CoRR abs/1602.07415 (2016) - 2015
- [j20]Mohamed M. Sabry, Mingyu Gao, Gage Hills, Chi-Shuen Lee, Greg Pitner, Max M. Shulaker, Tony F. Wu, Mehdi Asheghi, Jeffrey Bokor, Franz Franchetti, Kenneth E. Goodson, Christos Kozyrakis, Igor L. Markov, Kunle Olukotun, Larry T. Pileggi, Eric Pop, Jan M. Rabaey, Christopher Ré, H.-S. Philip Wong, Subhasish Mitra
:
Energy-Efficient Abundant-Data Computing: The N3XT 1, 000x. Computer 48(12): 24-33 (2015) - [c102]Lawrence C. McAfee, Kunle Olukotun:
EMEURO: a framework for generating multi-purpose accelerators via deep learning. CGO 2015: 125-135 - [c101]Nithin George, HyoukJoong Lee, David Novo, Muhsen Owaida, David Andrews
, Kunle Olukotun, Paolo Ienne:
Automatic support for multi-module parallelism from computational patterns. FPL 2015: 1-8 - [c100]Christopher De Sa, Christopher Ré, Kunle Olukotun:
Global Convergence of Stochastic Gradient Descent for Some Non-convex Matrix Problems. ICML 2015: 2332-2341 - [c99]Christopher De Sa, Ce Zhang, Kunle Olukotun, Christopher Ré:
Taming the Wild: A Unified Analysis of Hogwild-Style Algorithms. NIPS 2015: 2674-2682 - [c98]Christopher De Sa, Ce Zhang, Kunle Olukotun, Christopher Ré:
Rapidly Mixing Gibbs Sampling for a Class of Factor Graphs Using Hierarchy Width. NIPS 2015: 3097-3105 - [c97]Tiark Rompf, Kevin J. Brown, HyoukJoong Lee, Arvind K. Sujeeth, Manohar Jonnalagedda, Nada Amin, Georg Ofenbeck, Alen Stojanov, Yannis Klonatos, Mohammad Dashti, Christoph Koch, Markus Püschel, Kunle Olukotun:
Go Meta! A Case for Generative Programming and DSLs in Performance Critical Systems. SNAPL 2015: 238-261 - [e3]Kunle Olukotun, Aaron Smith, Robert Hundt, Jason Mars:
Proceedings of the 13th Annual IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2015, San Francisco, CA, USA, February 07 - 11, 2015. IEEE Computer Society 2015, ISBN 978-1-4799-8161-8 [contents] - [i5]Christopher R. Aberger, Andres Nötzli, Kunle Olukotun, Christopher Ré:
EmptyHeaded: Boolean Algebra Based Graph Processing. CoRR abs/1503.02368 (2015) - [i4]Christopher De Sa, Ce Zhang, Kunle Olukotun, Christopher Ré:
Taming the Wild: A Unified Analysis of Hogwild!-Style Algorithms. CoRR abs/1506.06438 (2015) - [i3]Christopher De Sa, Ce Zhang, Kunle Olukotun, Christopher Ré:
Rapidly Mixing Gibbs Sampling for a Class of Factor Graphs Using Hierarchy Width. CoRR abs/1510.00756 (2015) - [i2]Raghu Prabhakar, David Koeplinger, Kevin J. Brown, HyoukJoong Lee, Christopher De Sa, Christos Kozyrakis, Kunle Olukotun:
Generating Configurable Hardware from Parallel Patterns. CoRR abs/1511.06968 (2015) - 2014
- [j19]Alba de Melo
, Jean-Luc Gaudiot, Luiz De Rose, Kunle Olukotun, Albert Y. Zomaya
:
Guest Editorial. Int. J. Parallel Program. 42(1): 1-3 (2014) - [j18]Arvind K. Sujeeth, Kevin J. Brown, HyoukJoong Lee, Tiark Rompf, Hassan Chafi, Martin Odersky, Kunle Olukotun
:
Delite: A Compiler Architecture for Performance-Oriented Embedded Domain-Specific Languages. ACM Trans. Embed. Comput. Syst. 13(4s): 134:1-134:25 (2014) - [c96]Sungpack Hong, Semih Salihoglu, Jennifer Widom, Kunle Olukotun:
Simplifying Scalable Graph Processing with a Domain-Specific Language. CGO 2014: 208 - [c95]Jared Casper, Kunle Olukotun:
Hardware acceleration of database operations. FPGA 2014: 151-160 - [c94]Nithin George, HyoukJoong Lee, David Novo, Tiark Rompf, Kevin J. Brown, Arvind K. Sujeeth, Martin Odersky, Kunle Olukotun, Paolo Ienne:
Hardware system synthesis from Domain-Specific Languages. FPL 2014: 1-8 - [c93]Kunle Olukotun, Lance Hammond, Mark Willey:
Author's retrospective for: improving the performance of speculatively parallel applications on the hydra CMP. ICS 25th Anniversary 2014: 51-53 - [c92]HyoukJoong Lee, Kevin J. Brown, Arvind K. Sujeeth, Tiark Rompf, Kunle Olukotun:
Locality-Aware Mapping of Nested Parallel Patterns on GPUs. MICRO 2014: 63-74 - [c91]Tiark Rompf, Arvind K. Sujeeth, Kevin J. Brown, HyoukJoong Lee, Hassan Chafi, Kunle Olukotun
:
Surgical precision JIT compilers. PLDI 2014: 41-52 - [c90]Kunle Olukotun
:
Beyond parallel programming with domain specific languages. PPoPP 2014: 179-180 - [i1]Christopher De Sa, Kunle Olukotun, Christopher Ré:
Global Convergence of Stochastic Gradient Descent for Some Nonconvex Matrix Problems. CoRR abs/1411.1134 (2014) - 2013
- [c89]Arvind K. Sujeeth, Tiark Rompf, Kevin J. Brown, HyoukJoong Lee, Hassan Chafi, Victoria Popic, Michael Wu, Aleksandar Prokopec
, Vojin Jovanovic, Martin Odersky, Kunle Olukotun:
Composition and Reuse with Compiled Domain-Specific Languages. ECOOP 2013: 52-78 - [c88]Arvind K. Sujeeth, Austin Gibbons, Kevin J. Brown, HyoukJoong Lee, Tiark Rompf, Martin Odersky, Kunle Olukotun
:
Forge: generating a high performance DSL implementation from a declarative specification. GPCE 2013: 145-154 - [c87]Tiark Rompf, Arvind K. Sujeeth, Nada Amin, Kevin J. Brown, Vojin Jovanovic, HyoukJoong Lee, Manohar Jonnalagedda, Kunle Olukotun
, Martin Odersky:
Optimizing data structures in high-level programs: new directions for extensible compilers based on staging. POPL 2013: 497-510 - [c86]Sungpack Hong, Nicole C. Rodia, Kunle Olukotun:
On fast parallel detection of strongly connected components (SCC) in small-world graphs. SC 2013: 92:1-92:11 - 2012
- [c85]Sungpack Hong, Hassan Chafi, Eric Sedlar, Kunle Olukotun:
Green-Marl: a DSL for easy and efficient graph analysis. ASPLOS 2012: 349-362 - [c84]Sungpack Hong, Tayo Oguntebi, Jared Casper, Nathan Grasso Bronson, Christos Kozyrakis, Kunle Olukotun:
A case of system-level hardware/software co-design and co-verification of a commodity multi-processor system with custom hardware. CODES+ISSS 2012: 513-520 - [c83]Kunle Olukotun
:
High performance embedded domain specific languages. ICFP 2012: 139-140 - [c82]Lawrence C. McAfee, Kunle Olukotun:
Utilizing Static Analysis and Code Generation to Accelerate Neural Networks. ICML 2012 - 2011
- [j17]HyoukJoong Lee, Kevin J. Brown, Arvind K. Sujeeth, Hassan Chafi, Tiark Rompf, Martin Odersky, Kunle Olukotun:
Implementing Domain-Specific Languages for Heterogeneous Parallel Computing. IEEE Micro 31(5): 42-53 (2011) - [c81]Sungpack Hong, Tayo Oguntebi, Kunle Olukotun:
Efficient Parallel Graph Exploration on Multi-Core CPU and GPU. PACT 2011: 78-88 - [c80]Kevin J. Brown, Arvind K. Sujeeth, HyoukJoong Lee, Tiark Rompf, Hassan Chafi, Martin Odersky, Kunle Olukotun:
A Heterogeneous Parallel Framework for Domain-Specific Languages. PACT 2011: 89-100 - [c79]Jared Casper, Tayo Oguntebi, Sungpack Hong, Nathan Grasso Bronson, Christos Kozyrakis, Kunle Olukotun
:
Hardware acceleration of transactional memory on commodity systems. ASPLOS 2011: 27-38 - [c78]Ben Hertzberg, Kunle Olukotun:
Runtime automatic speculative parallelization. CGO 2011: 64-73 - [c77]Arvind K. Sujeeth, HyoukJoong Lee, Kevin J. Brown, Tiark Rompf, Hassan Chafi, Michael Wu, Anand R. Atreya, Martin Odersky, Kunle Olukotun:
OptiML: An Implicitly Parallel Domain-Specific Language for Machine Learning. ICML 2011: 609-616 - [c76]Per Stenström, Doug Burger, Wen-mei W. Hwu, Vipin Kumar, Kunle Olukotun, David A. Padua, Burton Smith:
Panel Statement. IPDPS 2011: 877 - [c75]Hassan Chafi, Arvind K. Sujeeth, Kevin J. Brown, HyoukJoong Lee, Anand R. Atreya, Kunle Olukotun:
A domain-specific approach to heterogeneous parallelism. PPoPP 2011: 35-46 - [c74]Sungpack Hong, Sang Kyun Kim, Tayo Oguntebi, Kunle Olukotun:
Accelerating CUDA graph algorithms at maximum warp. PPoPP 2011: 267-276 - [c73]Tiark Rompf, Arvind K. Sujeeth, HyoukJoong Lee, Kevin J. Brown, Hassan Chafi, Martin Odersky, Kunle Olukotun:
Building-Blocks for Performance Oriented DSLs. DSL 2011: 93-117 - 2010
- [j16]Bryan Catanzaro, Armando Fox, Kurt Keutzer, David A. Patterson, Bor-Yiing Su, Marc Snir, Kunle Olukotun, Pat Hanrahan, Hassan Chafi:
Ubiquitous Parallel Computing from Berkeley, Illinois, and Stanford. IEEE Micro 30(2): 41-55 (2010) - [c72]Xiaobo Sharon Hu
, Richard C. Murphy, Sudip S. Dosanjh, Kunle Olukotun, Stephen Poole:
Hardware/software co-design for high performance computing: challenges and opportunities. CODES+ISSS 2010: 63-64 - [c71]Sang Kyun Kim, Peter Leonard McMahon, Kunle Olukotun:
A Large-Scale Architecture for Restricted Boltzmann Machines. FCCM 2010: 201-208 - [c70]Tayo Oguntebi, Sungpack Hong, Jared Casper, Nathan Grasso Bronson, Christos Kozyrakis, Kunle Olukotun:
FARM: A Prototyping Environment for Tightly-Coupled, Heterogeneous Architectures. FCCM 2010: 221-228 - [c69]Josep Torrellas, Bill Gropp, Vivek Sarkar, Jaime H. Moreno, Kunle Olukotun:
Extreme scale computing: Challenges and opportunities. HPCA 2010: 1 - [c68]Woongki Baek, Nathan Grasso Bronson, Christos Kozyrakis, Kunle Olukotun:
Implementing and Evaluating a Model Checker for Transactional Memory Systems. ICECCS 2010: 117-126 - [c67]Woongki Baek, Nathan Grasso Bronson, Christos Kozyrakis, Kunle Olukotun:
Making nested parallel transactions practical using lightweight hardware support. ICS 2010: 61-71 - [c66]Sungpack Hong, Tayo Oguntebi, Jared Casper, Nathan Grasso Bronson, Christos Kozyrakis, Kunle Olukotun:
Eigenbench: A simple exploration tool for orthogonal TM characteristics. IISWC 2010: 1-11 - [c65]