Michael Picheny

Name: dblp XML data dump
Creator: Schloss Dagstuhl - Leibniz Center for Informatics
Published: 1993
License: https://creativecommons.org/publicdomain/zero/1.0/
Keywords: dblp, XML, computer science, scholarly publications, metadata

◀ ▶ joint publications with Kartik Audhkhasi

> Home > Persons > Michael Picheny

Publications

2021
[c118]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RouditchenkoBHC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RouditchenkoBHC21
Andrew Rouditchenko, Angie W. Boggust, David Harwath, Brian Chen, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Hilde Kuehne, Rameswar Panda, Rogério Schmidt Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James R. Glass:
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos. Interspeech 2021: 1584-1588
2020
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangK0KAKHP20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangK0KAKHP20
Yinghui Huang, Hong-Kwang Kuo, Samuel Thomas, Zvi Kons, Kartik Audhkhasi, Brian Kingsbury, Ron Hoory, Michael Picheny:
Leveraging Unpaired Text Data for Training End-To-End Speech-to-Intent Systems. ICASSP 2020: 7984-7988
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-09199
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-09199
Andrew Rouditchenko, Angie W. Boggust, David Harwath, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Rogério Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James R. Glass:
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos. CoRR abs/2006.09199 (2020)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04284
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04284
Yinghui Huang, Hong-Kwang Kuo, Samuel Thomas, Zvi Kons, Kartik Audhkhasi, Brian Kingsbury, Ron Hoory, Michael Picheny:
Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems. CoRR abs/2010.04284 (2020)
2019
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SaonTAKPT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SaonTAKPT19
George Saon, Zoltán Tüske, Kartik Audhkhasi, Brian Kingsbury, Michael Picheny, Samuel Thomas:
Simplified LSTMS for Speech Recognition. ASRU 2019: 547-553
[c111]
- view
  - electronic edition @ thecvf.com (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/cvpr/BoggustAJHTFGZ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/BoggustAJHTFGZ019
Angie W. Boggust, Kartik Audhkhasi, Dhiraj Joshi, David Harwath, Samuel Thomas, Rogério Schmidt Feris, Danny Gutfreund, Yang Zhang, Antonio Torralba, Michael Picheny, James R. Glass:
Grounding Spoken Words in Unlabeled Video. CVPR Workshops 2019: 29-32
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SettleALP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SettleALP19
Shane Settle, Kartik Audhkhasi, Karen Livescu, Michael Picheny:
Acoustically Grounded Word Embeddings for Improved Acoustics-to-word Speech Recognition. ICASSP 2019: 5641-5645
[c105]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PichenyTKACS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PichenyTKACS19
Michael Picheny, Zoltán Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon:
Challenging the Boundaries of Speech Recognition: The MALACH Corpus. INTERSPEECH 2019: 326-330
[c102]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AudhkhasiSTKP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AudhkhasiSTKP19
Kartik Audhkhasi, George Saon, Zoltán Tüske, Brian Kingsbury, Michael Picheny:
Forget a Bit to Learn Better: Soft Forgetting for CTC-Based Automatic Speech Recognition. INTERSPEECH 2019: 2618-2622
[c100]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ThomasATHP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ThomasATHP19
Samuel Thomas, Kartik Audhkhasi, Zoltán Tüske, Yinghui Huang, Michael Picheny:
Detection and Recovery of OOVs for Improved English Broadcast News Captioning. INTERSPEECH 2019: 2973-2977
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1903-12306
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1903-12306
Shane Settle, Kartik Audhkhasi, Karen Livescu, Michael Picheny:
Acoustically Grounded Word Embeddings for Improved Acoustics-to-Word Speech Recognition. CoRR abs/1903.12306 (2019)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1908-03455
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-03455
Michael Picheny, Zoltán Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon:
Challenging the Boundaries of Speech Recognition: The MALACH Corpus. CoRR abs/1908.03455 (2019)
2018
[c99]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AudhkhasiKRSP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AudhkhasiKRSP18
Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Michael Picheny:
Building Competitive Direct Acoustics-to-Word Models for English Conversational Speech Recognition. ICASSP 2018: 4759-4763
2017
[j21]
- view
  authority control:
- export record
  dblp key:
  - journals/ibmrd/AudhkhasiRSSRCP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ibmrd/AudhkhasiRSSRCP17
Kartik Audhkhasi, Andrew Rosenberg, George Saon, Abhinav Sethy, Bhuvana Ramabhadran, Stanley F. Chen, Michael Picheny:
Recent progress in deep end-to-end models for spoken language processing. IBM J. Res. Dev. 61(4-5): 2:1-2:10 (2017)
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RosenbergASRP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RosenbergASRP17
Andrew Rosenberg, Kartik Audhkhasi, Abhinav Sethy, Bhuvana Ramabhadran, Michael Picheny:
End-to-end speech recognition and keyword search on low-resource languages. ICASSP 2017: 5280-5284
[c95]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonKSATDCRPLRH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonKSATDCRPLRH17
George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall:
English Conversational Telephone Speech Recognition by Humans and Machines. INTERSPEECH 2017: 132-136
[c94]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AudhkhasiRSPN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AudhkhasiRSPN17
Kartik Audhkhasi, Bhuvana Ramabhadran, George Saon, Michael Picheny, David Nahamoo:
Direct Acoustics-to-Word Models for English Conversational Speech Recognition. INTERSPEECH 2017: 959-963
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/SaonKSATDCRPLRH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SaonKSATDCRPLRH17
George Saon, Gakuto Kurata, Tom Sercu, Kartik Audhkhasi, Samuel Thomas, Dimitrios Dimitriadis, Xiaodong Cui, Bhuvana Ramabhadran, Michael Picheny, Lynn-Li Lim, Bergul Roomi, Phil Hall:
English Conversational Telephone Speech Recognition by Humans and Machines. CoRR abs/1703.02136 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/AudhkhasiRSPN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/AudhkhasiRSPN17
Kartik Audhkhasi, Bhuvana Ramabhadran, George Saon, Michael Picheny, David Nahamoo:
Direct Acoustics-to-Word Models for English Conversational Speech Recognition. CoRR abs/1703.07754 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1712-03133
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-03133
Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Michael Picheny:
Building competitive direct acoustics-to-word models for English conversational speech recognition. CoRR abs/1712.03133 (2017)
2015
[c91]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/CuiKRSACKMNPTGS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/CuiKRSACKMNPTGS15
Jia Cui, Brian Kingsbury, Bhuvana Ramabhadran, Abhinav Sethy, Kartik Audhkhasi, Xiaodong Cui, Ellen Kislal, Lidia Mangu, Markus Nußbaum-Thom, Michael Picheny, Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney, Mark J. F. Gales, Kate M. Knill, Anton Ragni, Haipeng Wang, Philip C. Woodland:
Multilingual representations for low resource speech recognition and keyword search. ASRU 2015: 259-266

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.