default search action

combined dblp search
author search
venue search
publication search

ask others

Nikunj Saunshi

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c19]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/PanigrahiSLMRKK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/PanigrahiSLMRKK25
Abhishek Panigrahi, Nikunj Saunshi, Kaifeng Lyu, Sobhan Miryoosefi, Sashank J. Reddi, Satyen Kale, Sanjiv Kumar:
Efficient stagewise pretraining via progressive subnetworks. ICLR 2025
[c18]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/SaunshiDLKR25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SaunshiDLKR25
Nikunj Saunshi, Nishanth Dikkala, Zhiyuan Li, Sanjiv Kumar, Sashank J. Reddi:
Reasoning with Latent Thoughts: On the Power of Looped Transformers. ICLR 2025
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-15665
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-15665
Dylan J. Cutler, Arun Kandoor, Nishanth Dikkala, Nikunj Saunshi, Xin Wang, Rina Panigrahy:
StagFormer: Time Staggering Transformer Decoding for RunningLayers In Parallel. CoRR abs/2501.15665 (2025)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-11517
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-11517
Tian Jin, Ellie Y. Cheng, Zachary Ankner, Nikunj Saunshi, Blake M. Elias, Amir Yazdanbakhsh, Jonathan Ragan-Kelley, Suvinay Subramanian, Michael Carbin:
Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding. CoRR abs/2502.11517 (2025)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-17416
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-17416
Nikunj Saunshi, Nishanth Dikkala, Zhiyuan Li, Sanjiv Kumar, Sashank J. Reddi:
Reasoning with Latent Thoughts: On the Power of Looped Transformers. CoRR abs/2502.17416 (2025)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-06261
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-06261
Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit S. Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu, Toby Boyd, Brad Hekman, Aaron Parisi, Chaoyi Zhang, Kornraphop Kawintiranon, Tania Bedrax-Weiss, Oliver Wang, Ya Xu, Ollie Purkiss, Uri Mendlovic, Ilaï Deutel, Nam Nguyen, Adam Langley, Flip Korn, Lucia Rossazza, Alexandre Ramé, Sagar Waghmare, Helen Miller, Nathan Byrd, Ashrith Sheshan, Raia Hadsell Sangnie Bhardwaj, Pawel Janus, Tero Rissa, Dan Horgan, Sharon Silver, Ayzaan Wahid, Sergey Brin, Yves Raimond, Klemen Kloboves, Cindy Wang, Nitesh Bharadwaj Gundavarapu, Ilia Shumailov, Bo Wang, Mantas Pajarskas, Joe Heyward, Martin Nikoltchev, Maciej Kula, Hao Zhou, Zachary Garrett, Sushant Kafle, Sercan Arik, Ankita Goel, Mingyao Yang, Jiho Park, Koji Kojima, Parsa Mahmoudieh, Koray Kavukcuoglu, Grace Chen, Doug Fritz, Anton Bulyenov, Sudeshna Roy, Dimitris Paparas, Hadar Shemtov, Bo-Juen Chen, Robin Strudel, David Reitter, Aurko Roy, Andrey Vlasov, Changwan Ryu, Chas Leichner, Haichuan Yang, Zelda Mariet, Denis Vnukov, Tim Sohn, Amy Stuart, Wei Liang, Minmin Chen, Praynaa Rawlani, Christy Koh, JD Co-Reyes, Guangda Lai, Praseem Banzal, Dimitrios Vytiniotis, Jieru Mei, Mu Cai:
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities. CoRR abs/2507.06261 (2025)
2024
[c17]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/GatmirySRJK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GatmirySRJK24
Khashayar Gatmiry, Nikunj Saunshi, Sashank J. Reddi, Stefanie Jegelka, Sanjiv Kumar:
Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning? ICML 2024
[c16]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/SaunshiKKMRK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SaunshiKKMRK24
Nikunj Saunshi, Stefani Karp, Shankar Krishnan, Sobhan Miryoosefi, Sashank Jakkam Reddi, Sanjiv Kumar:
On the Inductive Bias of Stacking Towards Improving Reasoning. NeurIPS 2024
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05913
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05913
Abhishek Panigrahi, Nikunj Saunshi, Kaifeng Lyu, Sobhan Miryoosefi, Sashank J. Reddi, Satyen Kale, Sanjiv Kumar:
Efficient Stagewise Pretraining via Progressive Subnetworks. CoRR abs/2402.05913 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-02469
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-02469
Stefani Karp, Nikunj Saunshi, Sobhan Miryoosefi, Sashank J. Reddi, Sanjiv Kumar:
Landscape-Aware Growing: The Power of a Little LAG. CoRR abs/2406.02469 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-19044
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-19044
Nikunj Saunshi, Stefani Karp, Shankar Krishnan, Sobhan Miryoosefi, Sashank J. Reddi, Sanjiv Kumar:
On the Inductive Bias of Stacking Towards Improving Reasoning. CoRR abs/2409.19044 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-08292
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-08292
Khashayar Gatmiry, Nikunj Saunshi, Sashank J. Reddi, Stefanie Jegelka, Sanjiv Kumar:
Can Looped Transformers Learn to Implement Multi-step Gradient Descent for In-context Learning? CoRR abs/2410.08292 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-18779
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-18779
Ankit Singh Rawat, Veeranjaneyulu Sadhanala, Afshin Rostamizadeh, Ayan Chakrabarti, Wittawat Jitkrittum, Vladimir Feinberg, Seungyeon Kim, Hrayr Harutyunyan, Nikunj Saunshi, Zachary Nado, Rakesh Shivanna, Sashank J. Reddi, Aditya Krishna Menon, Rohan Anil, Sanjiv Kumar:
A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs. CoRR abs/2410.18779 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-21698
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-21698
Khashayar Gatmiry, Nikunj Saunshi, Sashank J. Reddi, Stefanie Jegelka, Sanjiv Kumar:
On the Role of Depth and Looping for In-Context Learning with Task Diversity. CoRR abs/2410.21698 (2024)
2023
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/GaurS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/GaurS23
Vedant Gaur, Nikunj Saunshi:
Reasoning in Large Language Models Through Symbolic Math Word Problems. ACL (Findings) 2023: 5889-5903
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/SaunshiGBA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SaunshiGBA23
Nikunj Saunshi, Arushi Gupta, Mark Braverman, Sanjeev Arora:
Understanding Influence Functions and Datamodels via Harmonic Analysis. ICLR 2023
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/PanigrahiSZA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/PanigrahiSZA23
Abhishek Panigrahi, Nikunj Saunshi, Haoyu Zhao, Sanjeev Arora:
Task-Specific Skill Localization in Fine-tuned Language Models. ICML 2023: 27011-27033
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-06600
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-06600
Abhishek Panigrahi, Nikunj Saunshi, Haoyu Zhao, Sanjeev Arora:
Task-Specific Skill Localization in Fine-tuned Language Models. CoRR abs/2302.06600 (2023)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-01906
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-01906
Vedant Gaur, Nikunj Saunshi:
Reasoning in Large Language Models Through Symbolic Math Word Problems. CoRR abs/2308.01906 (2023)
2022
[c12]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0074GSA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0074GSA22
Yi Zhang, Arushi Gupta, Nikunj Saunshi, Sanjeev Arora:
On Predicting Generalization using GANs. ICLR 2022
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SaunshiAGMZAKK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SaunshiAGMZAKK22
Nikunj Saunshi, Jordan T. Ash, Surbhi Goel, Dipendra Misra, Cyril Zhang, Sanjeev Arora, Sham M. Kakade, Akshay Krishnamurthy:
Understanding Contrastive Learning Requires Incorporating Inductive Biases. ICML 2022: 19250-19286
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/GuptaSYLA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/GuptaSYLA22
Arushi Gupta, Nikunj Saunshi, Dingli Yu, Kaifeng Lyu, Sanjeev Arora:
New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound. NeurIPS 2022
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-14037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-14037
Nikunj Saunshi, Jordan T. Ash, Surbhi Goel, Dipendra Misra, Cyril Zhang, Sanjeev Arora, Sham M. Kakade, Akshay Krishnamurthy:
Understanding Contrastive Learning Requires Incorporating Inductive Biases. CoRR abs/2202.14037 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-01072
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-01072
Nikunj Saunshi, Arushi Gupta, Mark Braverman, Sanjeev Arora:
Understanding Influence Functions and Datamodels via Harmonic Analysis. CoRR abs/2210.01072 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-02912
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-02912
Arushi Gupta, Nikunj Saunshi, Dingli Yu, Kaifeng Lyu, Sanjeev Arora:
New Definitions and Evaluations for Saliency Methods: Staying Intrinsic, Complete and Sound. CoRR abs/2211.02912 (2022)
2021
[c9]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/SaunshiMA21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SaunshiMA21
Nikunj Saunshi, Sadhika Malladi, Sanjeev Arora:
A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks. ICLR 2021
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SaunshiGH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SaunshiGH21
Nikunj Saunshi, Arushi Gupta, Wei Hu:
A Representation Learning Perspective on the Importance of Train-Validation Splitting in Meta-Learning. ICML 2021: 9333-9343
[c7]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LeeLSZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LeeLSZ21
Jason D. Lee, Qi Lei, Nikunj Saunshi, Jiacheng Zhuo:
Predicting What You Already Know Helps: Provable Self-Supervised Learning. NeurIPS 2021: 309-323
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-15615
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-15615
Nikunj Saunshi, Arushi Gupta, Wei Hu:
A Representation Learning Perspective on the Importance of Train-Validation Splitting in Meta-Learning. CoRR abs/2106.15615 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-14212
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-14212
Yi Zhang, Arushi Gupta, Nikunj Saunshi, Sanjeev Arora:
On Predicting Generalization using GANs. CoRR abs/2111.14212 (2021)
2020
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/AroraDKLS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AroraDKLS20
Sanjeev Arora, Simon S. Du, Sham M. Kakade, Yuping Luo, Nikunj Saunshi:
Provable Representation Learning for Imitation Learning via Bi-level Optimization. ICML 2020: 367-376
[c5]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SaunshiZKA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SaunshiZKA20
Nikunj Saunshi, Yi Zhang, Mikhail Khodak, Sanjeev Arora:
A Sample Complexity Separation between Non-Convex and Convex Meta-Learning. ICML 2020: 8512-8521
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-10544
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-10544
Sanjeev Arora, Simon S. Du, Sham M. Kakade, Yuping Luo, Nikunj Saunshi:
Provable Representation Learning for Imitation Learning via Bi-level Optimization. CoRR abs/2002.10544 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-11172
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-11172
Nikunj Saunshi, Yi Zhang, Mikhail Khodak, Sanjeev Arora:
A Sample Complexity Separation between Non-Convex and Convex Meta-Learning. CoRR abs/2002.11172 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-01064
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-01064
Jason D. Lee, Qi Lei, Nikunj Saunshi, Jiacheng Zhuo:
Predicting What You Already Know Helps: Provable Self-Supervised Learning. CoRR abs/2008.01064 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-03648
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-03648
Nikunj Saunshi, Sadhika Malladi, Sanjeev Arora:
A Mathematical Exploration of Why Language Models Help Solve Downstream Tasks. CoRR abs/2010.03648 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/SaunshiPAKK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/SaunshiPAKK19
Nikunj Saunshi, Orestis Plevrakis, Sanjeev Arora, Mikhail Khodak, Hrishikesh Khandeparkar:
A Theoretical Analysis of Contrastive Unsupervised Representation Learning. ICML 2019: 5628-5637
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-09229
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-09229
Sanjeev Arora, Hrishikesh Khandeparkar, Mikhail Khodak, Orestis Plevrakis, Nikunj Saunshi:
A Theoretical Analysis of Contrastive Unsupervised Representation Learning. CoRR abs/1902.09229 (2019)
2018
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/AroraLMKSS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/AroraLMKSS18
Mikhail Khodak, Nikunj Saunshi, Yingyu Liang, Tengyu Ma, Brandon Stewart, Sanjeev Arora:
A La Carte Embedding: Cheap but Effective Induction of Semantic Feature Vectors. ACL (1) 2018: 12-22
[c2]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/AroraKSV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/AroraKSV18
Sanjeev Arora, Mikhail Khodak, Nikunj Saunshi, Kiran Vodrahalli:
A Compressed Sensing View of Unsupervised Text Embeddings, Bag-of-n-Grams, and LSTMs. ICLR (Poster) 2018
[c1]
- view
  - electronic edition @ lrec-conf.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lrec/KhodakSV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/KhodakSV18
Mikhail Khodak, Nikunj Saunshi, Kiran Vodrahalli:
A Large Self-Annotated Corpus for Sarcasm. LREC 2018
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-05388
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-05388
Mikhail Khodak, Nikunj Saunshi, Yingyu Liang, Tengyu Ma, Brandon Stewart, Sanjeev Arora:
A La Carte Embedding: Cheap but Effective Induction of Semantic Feature Vectors. CoRR abs/1805.05388 (2018)
2017
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/KhodakSV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KhodakSV17
Mikhail Khodak, Nikunj Saunshi, Kiran Vodrahalli:
A Large Self-Annotated Corpus for Sarcasm. CoRR abs/1704.05579 (2017)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.