Остановите войну!
for scientists:
default search action
Madan Musuvathi
- > Home > Persons > Madan Musuvathi
Publications
- 2024
- [c90]Abhinav Jangda, Saeed Maleki, Maryam Mehri Dehnavi, Madan Musuvathi, Olli Saarikivi:
A Framework for Fine-Grained Synchronization of Dependent GPU Kernels. CGO 2024: 93-105 - 2023
- [c89]Meghan Cowan, Saeed Maleki, Madanlal Musuvathi, Olli Saarikivi, Yifan Xiong:
MSCCLang: Microsoft Collective Communication Language. ASPLOS (2) 2023: 502-514 - [c85]Aashaka Shah, Vijay Chidambaram, Meghan Cowan, Saeed Maleki, Madan Musuvathi, Todd Mytkowicz, Jacob Nelson, Olli Saarikivi:
TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches. NSDI 2023: 593-612 - [i15]Abhinav Jangda, Saeed Maleki, Maryam Mehri Dehnavi, Madan Musuvathi, Olli Saarikivi:
A Framework for Fine-Grained Synchronization of Dependent GPU Kernels. CoRR abs/2305.13450 (2023) - 2022
- [c84]Abhinav Jangda, Jun Huang, Guodong Liu, Amir Hossein Nodehi Sabet, Saeed Maleki, Youshan Miao, Madanlal Musuvathi, Todd Mytkowicz, Olli Saarikivi:
Breaking the computation and communication abstraction barrier in distributed machine learning workloads. ASPLOS 2022: 402-416 - [i13]Meghan Cowan, Saeed Maleki, Madanlal Musuvathi, Olli Saarikivi, Yifan Xiong:
MSCCL: Microsoft Collective Communication Library. CoRR abs/2201.11840 (2022) - 2021
- [c80]Gurbinder Gill, Roshan Dathathri, Saeed Maleki, Madan Musuvathi, Todd Mytkowicz, Olli Saarikivi:
Distributed Training of Embeddings using Graph Analytics. IPDPS 2021: 973-983 - [c79]Saeed Maleki, Madan Musuvathi, Todd Mytkowicz, Olli Saarikivi, Tianju Xu, Vadim Eksarevskiy, Jaliya Ekanayake, Emad Barsoum:
Scaling Distributed Training with Adaptive Summation. MLSys 2021 - [c78]Zixian Cai, Zhengyang Liu, Saeed Maleki, Madanlal Musuvathi, Todd Mytkowicz, Jacob Nelson, Olli Saarikivi:
Synthesizing optimal collective algorithms. PPoPP 2021: 62-75 - [i10]Abhinav Jangda, Jun Huang, Guodong Liu, Amir Hossein Nodehi Sabet, Saeed Maleki, Youshan Miao, Madanlal Musuvathi, Todd Mytkowicz, Olli Saarikivi:
CoCoNet: Co-Optimizing Computation and Communication for Distributed Machine Learning. CoRR abs/2105.05720 (2021) - [i9]Aashaka Shah, Vijay Chidambaram, Meghan Cowan, Saeed Maleki, Madan Musuvathi, Todd Mytkowicz, Jacob Nelson, Olli Saarikivi, Rachee Singh:
Synthesizing Collective Communication Algorithms for Heterogeneous Networks with TACCL. CoRR abs/2111.04867 (2021) - 2020
- [i8]Saeed Maleki, Madan Musuvathi, Todd Mytkowicz, Olli Saarikivi, Tianju Xu, Vadim Eksarevskiy, Jaliya Ekanayake, Emad Barsoum:
Scaling Distributed Training with Adaptive Summation. CoRR abs/2006.02924 (2020) - [i7]Zixian Cai, Zhengyang Liu, Saeed Maleki, Madan Musuvathi, Todd Mytkowicz, Jacob Nelson, Olli Saarikivi:
Synthesizing Optimal Collective Algorithms. CoRR abs/2008.08708 (2020) - 2019
- [c73]Roshan Dathathri, Olli Saarikivi, Hao Chen, Kim Laine, Kristin E. Lauter, Saeed Maleki, Madanlal Musuvathi, Todd Mytkowicz:
CHET: an optimizing compiler for fully-homomorphic neural-network inferencing. PLDI 2019: 142-156 - [i5]Gurbinder Gill, Roshan Dathathri, Saeed Maleki, Madan Musuvathi, Todd Mytkowicz, Olli Saarikivi:
Distributed Word2Vec using Graph Analytics Frameworks. CoRR abs/1909.03359 (2019) - 2018
- [c68]Saeed Maleki, Madanlal Musuvathi, Todd Mytkowicz:
Semantics-Preserving Parallelization of Stochastic Gradient Descent. IPDPS 2018: 224-233 - [i3]Roshan Dathathri, Olli Saarikivi, Hao Chen, Kim Laine, Kristin E. Lauter, Saeed Maleki, Madanlal Musuvathi, Todd Mytkowicz:
CHET: Compiler and Runtime for Homomorphic Evaluation of Tensor Programs. CoRR abs/1810.00845 (2018) - 2017
- [i2]Saeed Maleki, Madanlal Musuvathi, Todd Mytkowicz:
Parallel Stochastic Gradient Descent with Sound Combiners. CoRR abs/1705.08030 (2017) - 2016
- [j7]Saeed Maleki, Madanlal Musuvathi, Todd Mytkowicz:
Efficient parallelization using rank convergence in dynamic programming algorithms. Commun. ACM 59(10): 85-92 (2016) - [j6]Saeed Maleki, Madanlal Musuvathi, Todd Mytkowicz:
Low-Rank Methods for Parallelizing Dynamic Programming Algorithms. ACM Trans. Parallel Comput. 2(4): 26:1-26:32 (2016) - [c63]Charith Mendis, Jasha Droppo, Saeed Maleki, Madanlal Musuvathi, Todd Mytkowicz, Geoffrey Zweig:
Parallelizing WFST speech decoders. ICASSP 2016: 5325-5329 - 2014
- [c51]Saeed Maleki, Madanlal Musuvathi, Todd Mytkowicz:
Parallelizing dynamic programming through rank convergence. PPoPP 2014: 219-232
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-04-25 23:38 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint