default search action
Kevin J. Shih
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [j4]Guilin Liu, Aysegul Dundar, Kevin J. Shih, Ting-Chun Wang, Fitsum A. Reda, Karan Sapra, Zhiding Yu, Xiaodong Yang, Andrew Tao, Bryan Catanzaro:
Partial Convolution for Padding, Inpainting, and Image Synthesis. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 6096-6110 (2023) - [c17]Rohan Badlani, Akshit Arora, Subhankar Ghosh, Rafael Valle, Kevin J. Shih, João Felipe Santos, Boris Ginsburg, Bryan Catanzaro:
Vani: Very-Lightweight Accent-Controllable TTS for Native And Non-Native Speakers With Identity Preservation. ICASSP 2023: 1-2 - [c16]Rafael Valle, João Felipe Santos, Kevin J. Shih, Rohan Badlani, Bryan Catanzaro:
High-Acoustic Fidelity Text To Speech Synthesis With Fine-Grained Control Of Speech Attributes. ICASSP 2023: 1-5 - [c15]Nannan Li, Kevin J. Shih, Bryan A. Plummer:
Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures. ICCV 2023: 7092-7103 - [c14]Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro:
RAD-MMM: Multilingual Multiaccented Multispeaker Text To Speech. INTERSPEECH 2023: 626-630 - [c13]Sungwon Kim, Kevin J. Shih, Rohan Badlani, João Felipe Santos, Evelina Bakhturina, Mikyas Desta, Rafael Valle, Sungroh Yoon, Bryan Catanzaro:
P-Flow: A Fast and Data-Efficient Zero-Shot TTS through Speech Prompting. NeurIPS 2023 - [i21]Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro:
Multilingual Multiaccented Multispeaker TTS with RADTTS. CoRR abs/2301.10335 (2023) - [i20]Rohan Badlani, Akshit Arora, Subhankar Ghosh, Rafael Valle, Kevin J. Shih, João Felipe Santos, Boris Ginsburg, Bryan Catanzaro:
VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation. CoRR abs/2303.07578 (2023) - 2022
- [j3]Bryan A. Plummer, Kevin J. Shih, Yichen Li, Ke Xu, Svetlana Lazebnik, Stan Sclaroff, Kate Saenko:
Revisiting Image-Language Networks for Open-Ended Phrase Detection. IEEE Trans. Pattern Anal. Mach. Intell. 44(4): 2155-2167 (2022) - [j2]Aysegul Dundar, Kevin J. Shih, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro:
Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos. IEEE Trans. Pattern Anal. Mach. Intell. 44(7): 3883-3894 (2022) - [c12]Rohan Badlani, Adrian Lancucki, Kevin J. Shih, Rafael Valle, Wei Ping, Bryan Catanzaro:
One TTS Alignment to Rule Them All. ICASSP 2022: 6092-6096 - [i19]Kevin J. Shih, Rafael Valle, Rohan Badlani, João Felipe Santos, Bryan Catanzaro:
Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows. CoRR abs/2203.01786 (2022) - [i18]Nannan Li, Kevin J. Shih, Bryan A. Plummer:
Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures. CoRR abs/2210.01887 (2022) - 2021
- [c11]Rafael Valle, Kevin J. Shih, Ryan Prenger, Bryan Catanzaro:
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis. ICLR 2021 - [i17]Rohan Badlani, Adrian Lancucki, Kevin J. Shih, Rafael Valle, Wei Ping, Bryan Catanzaro:
One TTS Alignment To Rule Them All. CoRR abs/2108.10447 (2021) - 2020
- [i16]Aysegul Dundar, Kevin J. Shih, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro:
Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos. CoRR abs/2001.09518 (2020) - [i15]Rafael Valle, Kevin J. Shih, Ryan Prenger, Bryan Catanzaro:
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis. CoRR abs/2005.05957 (2020)
2010 – 2019
- 2019
- [c10]Yi Zhu, Karan Sapra, Fitsum A. Reda, Kevin J. Shih, Shawn D. Newsam, Andrew Tao, Bryan Catanzaro:
Improving Semantic Segmentation via Video Propagation and Label Relaxation. CVPR 2019: 8856-8865 - [c9]Ji Zhang, Kevin J. Shih, Ahmed Elgammal, Andrew Tao, Bryan Catanzaro:
Graphical Contrastive Losses for Scene Graph Parsing. CVPR 2019: 11535-11543 - [c8]Fitsum A. Reda, Deqing Sun, Aysegul Dundar, Mohammad Shoeybi, Guilin Liu, Kevin J. Shih, Andrew Tao, Jan Kautz, Bryan Catanzaro:
Unsupervised Video Interpolation Using Cycle Consistency. ICCV 2019: 892-900 - [i14]Ji Zhang, Kevin J. Shih, Ahmed Elgammal, Andrew Tao, Bryan Catanzaro:
Graphical Contrastive Losses for Scene Graph Generation. CoRR abs/1903.02728 (2019) - [i13]Fitsum A. Reda, Deqing Sun, Aysegul Dundar, Mohammad Shoeybi, Guilin Liu, Kevin J. Shih, Andrew Tao, Jan Kautz, Bryan Catanzaro:
Unsupervised Video Interpolation Using Cycle Consistency. CoRR abs/1906.05928 (2019) - [i12]Kevin J. Shih, Aysegul Dundar, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro:
Video Interpolation and Prediction with Unsupervised Landmarks. CoRR abs/1909.02749 (2019) - 2018
- [c7]Yonatan Bisk, Kevin J. Shih, Yejin Choi, Daniel Marcu:
Learning Interpretable Spatial Operations in a Rich 3D Blocks World. AAAI 2018: 5028-5036 - [c6]Guilin Liu, Fitsum A. Reda, Kevin J. Shih, Ting-Chun Wang, Andrew Tao, Bryan Catanzaro:
Image Inpainting for Irregular Holes Using Partial Convolutions. ECCV (11) 2018: 89-105 - [c5]Fitsum A. Reda, Guilin Liu, Kevin J. Shih, Robert Kirby, Jon Barker, David Tarjan, Andrew Tao, Bryan Catanzaro:
SDC-Net: Video Prediction Using Spatially-Displaced Convolution. ECCV (7) 2018: 747-763 - [i11]Guilin Liu, Fitsum A. Reda, Kevin J. Shih, Ting-Chun Wang, Andrew Tao, Bryan Catanzaro:
Image Inpainting for Irregular Holes Using Partial Convolutions. CoRR abs/1804.07723 (2018) - [i10]Ji Zhang, Kevin J. Shih, Andrew Tao, Bryan Catanzaro, Ahmed Elgammal:
Introduction to the 1st Place Winning Model of OpenImages Relationship Detection Challenge. CoRR abs/1811.00662 (2018) - [i9]Fitsum A. Reda, Guilin Liu, Kevin J. Shih, Robert Kirby, Jon Barker, David Tarjan, Andrew Tao, Bryan Catanzaro:
SDCNet: Video Prediction Using Spatially-Displaced Convolution. CoRR abs/1811.00684 (2018) - [i8]Bryan A. Plummer, Kevin J. Shih, Yichen Li, Ke Xu, Svetlana Lazebnik, Stan Sclaroff, Kate Saenko:
Open-vocabulary Phrase Detection. CoRR abs/1811.07212 (2018) - [i7]Ji Zhang, Kevin J. Shih, Andrew Tao, Bryan Catanzaro, Ahmed Elgammal:
An Interpretable Model for Scene Graph Generation. CoRR abs/1811.09543 (2018) - [i6]Guilin Liu, Kevin J. Shih, Ting-Chun Wang, Fitsum A. Reda, Karan Sapra, Zhiding Yu, Andrew Tao, Bryan Catanzaro:
Partial Convolution based Padding. CoRR abs/1811.11718 (2018) - [i5]Yi Zhu, Karan Sapra, Fitsum A. Reda, Kevin J. Shih, Shawn D. Newsam, Andrew Tao, Bryan Catanzaro:
Improving Semantic Segmentation via Video Propagation and Label Relaxation. CoRR abs/1812.01593 (2018) - 2017
- [b1]Kevin J. Shih:
Learning visual tasks with selective attention. University of Illinois Urbana-Champaign, USA, 2017 - [c4]Tanmay Gupta, Kevin J. Shih, Saurabh Singh, Derek Hoiem:
Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks. ICCV 2017: 4223-4232 - [i4]Tanmay Gupta, Kevin J. Shih, Saurabh Singh, Derek Hoiem:
Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks. CoRR abs/1704.00260 (2017) - [i3]Yonatan Bisk, Kevin J. Shih, Yejin Choi, Daniel Marcu:
Learning Interpretable Spatial Operations in a Rich 3D Blocks World. CoRR abs/1712.03463 (2017) - 2016
- [c3]Kevin J. Shih, Saurabh Singh, Derek Hoiem:
Where to Look: Focus Regions for Visual Question Answering. CVPR 2016: 4613-4621 - 2015
- [j1]Kevin J. Shih, Ian Endres, Derek Hoiem:
Learning Discriminative Collections of Part Detectors for Object Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 37(8): 1571-1584 (2015) - [c2]Kevin J. Shih, Arun Mallya, Saurabh Singh, Derek Hoiem:
Part Localization using Multi-Proposal Consensus for Fine-Grained Categorization. BMVC 2015: 128.1-128.12 - [i2]Kevin J. Shih, Arun Mallya, Saurabh Singh, Derek Hoiem:
Part Localization using Multi-Proposal Consensus for Fine-Grained Categorization. CoRR abs/1507.06332 (2015) - [i1]Kevin J. Shih, Saurabh Singh, Derek Hoiem:
Where To Look: Focus Regions for Visual Question Answering. CoRR abs/1511.07394 (2015) - 2013
- [c1]Ian Endres, Kevin J. Shih, Johnston Jiaa, Derek Hoiem:
Learning Collections of Part Models for Object Recognition. CVPR 2013: 939-946
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-07-29 22:25 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint