
Kanishka Rao
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2020
- [c23]Kanishka Rao, Chris Harris, Alex Irpan, Sergey Levine, Julian Ibarz, Mohi Khansari:
RL-CycleGAN: Reinforcement Learning Aware Simulation-to-Real. CVPR 2020: 11154-11163 - [i15]Kanishka Rao, Chris Harris, Alex Irpan, Sergey Levine, Julian Ibarz, Mohi Khansari:
RL-CycleGAN: Reinforcement Learning Aware Simulation-To-Real. CoRR abs/2006.09001 (2020) - [i14]Daniel Ho, Kanishka Rao, Zhuo Xu, Eric Jang, Mohi Khansari, Yunfei Bai:
RetinaGAN: An Object-aware Approach to Sim-to-Real Transfer. CoRR abs/2011.03148 (2020)
2010 – 2019
- 2019
- [c22]Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition for Mobile Devices. ICASSP 2019: 6381-6385 - [c21]Brendan Shillingford, Yannis M. Assael, Matthew W. Hoffman, Thomas Paine, Cían Hughes, Utsav Prabhu, Hank Liao, Hasim Sak, Kanishka Rao, Lorrayne Bennett, Marie Mulville, Misha Denil, Ben Coppin, Ben Laurie, Andrew W. Senior, Nando de Freitas:
Large-Scale Visual Speech Recognition. INTERSPEECH 2019: 4135-4139 - [c20]Alexander Irpan, Kanishka Rao, Konstantinos Bousmalis, Chris Harris, Julian Ibarz, Sergey Levine:
Off-Policy Evaluation via Off-Policy Classification. NeurIPS 2019: 5438-5449 - [i13]Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia Xu Chen, Ye Jia, Anjuli Kannan, Tara N. Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob, Bowen Liang, HyoukJoong Lee, Ciprian Chelba, Sébastien Jean, Bo Li, Melvin Johnson, Rohan Anil, Rajat Tibrewal, Xiaobing Liu, Akiko Eriguchi, Navdeep Jaitly, Naveen Ari, Colin Cherry, Parisa Haghani, Otavio Good, Youlong Cheng, Raziel Alvarez, Isaac Caswell, Wei-Ning Hsu, Zongheng Yang, Kuan-Chieh Wang, Ekaterina Gonina, Katrin Tomanek, Ben Vanik, Zelin Wu, Llion Jones, Mike Schuster, Yanping Huang, Dehao Chen, Kazuki Irie, George F. Foster, John Richardson, Klaus Macherey, Antoine Bruguier, Heiga Zen, Colin Raffel, Shankar Kumar, Kanishka Rao, David Rybach, Matthew Murray, Vijayaditya Peddinti, Maxim Krikun, Michiel Bacchiani, Thomas B. Jablin, Robert Suderman, Ian Williams, Benjamin Lee, Deepti Bhatia, Justin Carlson, Semih Yavuz, Yu Zhang, Ian McGraw, Max Galkin, Qi Ge, Golan Pundak, Chad Whipkey, Todd Wang, Uri Alon, Dmitry Lepikhin, Ye Tian, Sara Sabour, William Chan, Shubham Toshniwal, Baohua Liao, Michael Nirschl, Pat Rondon:
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling. CoRR abs/1902.08295 (2019) - [i12]Alex Irpan, Kanishka Rao, Konstantinos Bousmalis, Chris Harris, Julian Ibarz, Sergey Levine:
Off-Policy Evaluation via Off-Policy Classification. CoRR abs/1906.01624 (2019) - [i11]Swaroop Ramaswamy, Rajiv Mathews, Kanishka Rao, Françoise Beaufays:
Federated Learning for Emoji Prediction in a Mobile Keyboard. CoRR abs/1906.04329 (2019) - 2018
- [c19]Bo Li, Tara N. Sainath, Khe Chai Sim, Michiel Bacchiani, Eugene Weinstein, Patrick Nguyen, Zhifeng Chen, Yanghui Wu, Kanishka Rao:
Multi-Dialect Speech Recognition with a Single Sequence-to-Sequence Model. ICASSP 2018: 4749-4753 - [c18]Chung-Cheng Chiu, Tara N. Sainath, Yonghui Wu, Rohit Prabhavalkar, Patrick Nguyen, Zhifeng Chen, Anjuli Kannan, Ron J. Weiss, Kanishka Rao, Ekaterina Gonina, Navdeep Jaitly, Bo Li, Jan Chorowski
, Michiel Bacchiani:
State-of-the-Art Speech Recognition with Sequence-to-Sequence Models. ICASSP 2018: 4774-4778 - [c17]Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro J. Moreno, Eugene Weinstein, Kanishka Rao:
Multilingual Speech Recognition with a Single End-to-End Model. ICASSP 2018: 4904-4908 - [c16]Tom Bagby, Kanishka Rao, Khe Chai Sim:
Efficient Implementation of Recurrent Neural Network Transducer in Tensorflow. SLT 2018: 506-512 - [i10]Kanishka Rao, Hasim Sak, Rohit Prabhavalkar:
Exploring Architectures, Data and Units For Streaming End-to-End Speech Recognition with RNN-Transducer. CoRR abs/1801.00841 (2018) - [i9]Brendan Shillingford, Yannis M. Assael, Matthew W. Hoffman, Thomas Paine, Cían Hughes, Utsav Prabhu, Hank Liao, Hasim Sak, Kanishka Rao, Lorrayne Bennett, Marie Mulville, Ben Coppin, Ben Laurie, Andrew W. Senior, Nando de Freitas:
Large-Scale Visual Speech Recognition. CoRR abs/1807.05162 (2018) - [i8]Andrew Hard, Kanishka Rao, Rajiv Mathews, Françoise Beaufays, Sean Augenstein, Hubert Eichner, Chloé Kiddon, Daniel Ramage:
Federated Learning for Mobile Keyboard Prediction. CoRR abs/1811.03604 (2018) - [i7]Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition For Mobile Devices. CoRR abs/1811.06621 (2018) - 2017
- [c15]Kanishka Rao, Hasim Sak, Rohit Prabhavalkar:
Exploring architectures, data and units for streaming end-to-end speech recognition with RNN-transducer. ASRU 2017: 193-199 - [c14]Yanzhang He, Rohit Prabhavalkar, Kanishka Rao, Wei Li, Anton Bakhtin, Ian McGraw:
Streaming small-footprint keyword spotting using sequence-to-sequence models. ASRU 2017: 474-481 - [c13]Kanishka Rao, Hasim Sak:
Multi-accent speech recognition with hierarchical grapheme based models. ICASSP 2017: 4815-4819 - [c12]Rohit Prabhavalkar, Kanishka Rao, Tara N. Sainath, Bo Li, Leif Johnson, Navdeep Jaitly:
A Comparison of Sequence-to-Sequence Models for Speech Recognition. INTERSPEECH 2017: 939-943 - [c11]Hasim Sak, Matt Shannon, Kanishka Rao, Françoise Beaufays:
Recurrent Neural Aligner: An Encoder-Decoder Neural Network Model for Sequence to Sequence Mapping. INTERSPEECH 2017: 1298-1302 - [c10]Antoine Bruguier, Danushen Gnanapragasam, Leif Johnson, Kanishka Rao, Françoise Beaufays:
Pronunciation Learning with RNN-Transducers. INTERSPEECH 2017: 2556-2560 - [c9]Rohit Prabhavalkar, Tara N. Sainath, Bo Li, Kanishka Rao, Navdeep Jaitly:
An Analysis of "Attention" in Sequence-to-Sequence Models. INTERSPEECH 2017: 3702-3706 - [i6]Yanzhang He, Rohit Prabhavalkar, Kanishka Rao, Wei Li, Anton Bakhtin, Ian McGraw:
Streaming Small-Footprint Keyword Spotting using Sequence-to-Sequence Models. CoRR abs/1710.09617 (2017) - [i5]Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro J. Moreno, Eugene Weinstein, Kanishka Rao:
Multilingual Speech Recognition With A Single End-To-End Model. CoRR abs/1711.01694 (2017) - [i4]Bo Li, Tara N. Sainath, Khe Chai Sim, Michiel Bacchiani, Eugene Weinstein, Patrick Nguyen, Zhifeng Chen, Yonghui Wu, Kanishka Rao:
Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model. CoRR abs/1712.01541 (2017) - [i3]Chung-Cheng Chiu, Tara N. Sainath, Yonghui Wu, Rohit Prabhavalkar, Patrick Nguyen, Zhifeng Chen, Anjuli Kannan, Ron J. Weiss, Kanishka Rao, Katya Gonina, Navdeep Jaitly, Bo Li, Jan Chorowski, Michiel Bacchiani:
State-of-the-art Speech Recognition With Sequence-to-Sequence Models. CoRR abs/1712.01769 (2017) - 2016
- [c8]Kanishka Rao, Andrew W. Senior, Hasim Sak:
Flat start training of CD-CTC-SMBR LSTM RNN acoustic models. ICASSP 2016: 5405-5409 - [c7]Ian McGraw, Rohit Prabhavalkar, Raziel Alvarez, Montse Gonzalez Arenas, Kanishka Rao, David Rybach, Ouais Alsharif, Hasim Sak, Alexander Gruenstein, Françoise Beaufays, Carolina Parada:
Personalized speech recognition on mobile devices. ICASSP 2016: 5955-5959 - [c6]Daan van Esch, Mason Chua, Kanishka Rao:
Predicting Pronunciations with Syllabification and Stress with Recurrent Neural Networks. INTERSPEECH 2016: 2841-2845 - [i2]Ian McGraw, Rohit Prabhavalkar, Raziel Alvarez, Montse Gonzalez Arenas, Kanishka Rao, David Rybach, Ouais Alsharif, Hasim Sak, Alexander Gruenstein, Françoise Beaufays, Carolina Parada:
Personalized Speech recognition on mobile devices. CoRR abs/1603.03185 (2016) - 2015
- [c5]Andrew W. Senior, Hasim Sak, Felix de Chaumont Quitry, Tara N. Sainath, Kanishka Rao:
Acoustic modelling with CD-CTC-SMBR LSTM RNNS. ASRU 2015: 604-609 - [c4]Kanishka Rao, Fuchun Peng, Hasim Sak, Françoise Beaufays:
Grapheme-to-phoneme conversion using Long Short-Term Memory recurrent neural networks. ICASSP 2015: 4225-4229 - [c3]Hasim Sak, Andrew W. Senior, Kanishka Rao, Ozan Irsoy, Alex Graves, Françoise Beaufays, Johan Schalkwyk:
Learning acoustic frame labeling for speech recognition with recurrent neural networks. ICASSP 2015: 4280-4284 - [c2]Kanishka Rao, Fuchun Peng, Françoise Beaufays:
Automatic pronunciation verification for speech recognition. ICASSP 2015: 5162-5166 - [c1]Hasim Sak, Andrew W. Senior, Kanishka Rao, Françoise Beaufays:
Fast and accurate recurrent neural network acoustic models for speech recognition. INTERSPEECH 2015: 1468-1472 - [i1]Hasim Sak, Andrew W. Senior, Kanishka Rao, Françoise Beaufays:
Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition. CoRR abs/1507.06947 (2015)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
load content from web.archive.org
Privacy notice: By enabling the option above, your browser will contact the API of web.archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
Tweets on dblp homepage
Show tweets from on the dblp homepage.
Privacy notice: By enabling the option above, your browser will contact twitter.com and twimg.com to load tweets curated by our Twitter account. At the same time, Twitter will persistently store several cookies with your web browser. While we did signal Twitter to not track our users by setting the "dnt" flag, we do not have any control over how Twitter uses your data. So please proceed with care and consider checking the Twitter privacy policy.
last updated on 2021-02-26 23:18 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint