


Остановите войну!
for scientists:
Jim Glass
James R. Glass
Person information

- affiliation: Massachusetts Institute of Technology (MIT), CSAIL, Cambridge, MA, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2022
- [j35]Kevin P. Schneider
, Jim Glass
, Cecilia Klauber
, Thomas Ben Ollis
, Matthew J. Reno
, Michael Burck
, Lelic Muhidin, Anamika Dubey
, Wei Du
, Thanh Long Vu
, Jing Xie
, David Nordy
, William Dawson, Javier Hernandez-Alvidrez, Anjan Bose
, Dan Ton, Guohui Yuan
:
A Framework for Coordinated Self-Assembly of Networked Microgrids Using Consensus Algorithms. IEEE Access 10: 3864-3878 (2022) - [c305]Alexander H. Liu, SouYoung Jin, Cheng-I Lai, Andrew Rouditchenko, Aude Oliva, James R. Glass:
Cross-Modal Discrete Representation Learning. ACL (1) 2022: 3013-3035 - [c304]Jiabao Ji, Yoon Kim, James R. Glass, Tianxing He:
Controlling the Focus of Pretrained Language Generation Models. ACL (Findings) 2022: 3291-3306 - [i117]Jiabao Ji, Yoon Kim, James R. Glass, Tianxing He:
Controlling the Focus of Pretrained Language Generation Models. CoRR abs/2203.01146 (2022) - [i116]Yuan Gong, Sameer Khurana, Andrew Rouditchenko, James R. Glass:
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification. CoRR abs/2203.06760 (2022) - [i115]Alexander H. Liu, Cheng-I Jeff Lai, Wei-Ning Hsu, Michael Auli, Alexei Baevski, James R. Glass:
Simple and Effective Unsupervised Speech Synthesis. CoRR abs/2204.02524 (2022) - [i114]Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljacic, Shang-Wen Li, Wen-tau Yih, Yoon Kim, James R. Glass:
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings. CoRR abs/2204.10298 (2022) - [i113]Yuan Gong, Ziyi Chen, Iek-Heng Chu, Peng Chang, James R. Glass:
Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment. CoRR abs/2205.03432 (2022) - [i112]Yuan Gong, Jin Yu, James R. Glass:
Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition. CoRR abs/2205.03433 (2022) - 2021
- [j34]Lin Zhu
, Chengwen Zhang
, He Yin
, Dingrui Li, Yu Su
, Ishita Ray
, Jiaojiao Dong
, Fred Wang, Leon M. Tolbert
, Yilu Liu
, Yiwei Ma, Bruce Rogers, Jim Glass
, Lilian Bruce, Samuel Delay, Peter Gregory, Mario Garcia-Sanz, Mirjana Marden:
A Smart and Flexible Microgrid With a Low-Cost Scalable Open-Source Controller. IEEE Access 9: 162214-162230 (2021) - [j33]Yuan Gong
, Yu-An Chung
, James R. Glass:
PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3292-3306 (2021) - [c303]Wei-Ning Hsu, David Harwath, Tyler Miller, Christopher Song, James R. Glass:
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units. ACL/IJCNLP (1) 2021: 5284-5300 - [c302]Mathew Monfort, SouYoung Jin, Alexander H. Liu, David Harwath, Rogério Feris, James R. Glass, Aude Oliva:
Spoken Moments: Learning Joint Audio-Visual Representations From Video Descriptions. CVPR 2021: 14871-14881 - [c301]Tianxing He, Jun Liu, Kyunghyun Cho, Myle Ott, Bing Liu, James R. Glass, Fuchun Peng:
Analyzing the Forgetting Problem in Pretrain-Finetuning of Open-domain Dialogue Response Models. EACL 2021: 1121-1133 - [c300]Tianxing He, Jingzhao Zhang, Zhiming Zhou, James R. Glass:
Exposure Bias versus Self-Recovery: Are Distortions Really Incremental for Autoregressive Text Generation? EMNLP (1) 2021: 5087-5102 - [c299]Yu-An Chung, Yonatan Belinkov, James R. Glass:
Similarity Analysis of Self-Supervised Speech Representations. ICASSP 2021: 3040-3044 - [c298]Cheng-I Lai, Yung-Sung Chuang, Hung-Yi Lee, Shang-Wen Li, James R. Glass:
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining. ICASSP 2021: 7468-7472 - [c297]Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne, Samuel Thomas, Angie W. Boggust, Rameswar Panda, Brian Kingsbury, Rogério Feris, David Harwath, James R. Glass, Michael Picheny, Shih-Fu Chang:
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos. ICCV 2021: 7992-8001 - [c296]Yuan Gong
, Yu-An Chung, James R. Glass:
AST: Audio Spectrogram Transformer. Interspeech 2021: 571-575 - [c295]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Brian Chen, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Hilde Kuehne, Rameswar Panda, Rogério Schmidt Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James R. Glass:
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos. Interspeech 2021: 1584-1588 - [c294]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Samuel Thomas, Hilde Kuehne, Brian Chen, Rameswar Panda, Rogério Feris, Brian Kingsbury, Michael Picheny, James R. Glass:
Cascaded Multilingual Audio-Visual Learning from Videos. Interspeech 2021: 3006-3010 - [c293]Hongyin Luo, James R. Glass, Garima Lalwani, Yi Zhang, Shang-Wen Li:
Joint Retrieval-Extraction Training for Evidence-Aware Dialog Response Selection. Interspeech 2021: 3241-3245 - [c292]Ian Palmer, Andrew Rouditchenko, Andrei Barbu, Boris Katz, James R. Glass:
Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset. Interspeech 2021: 3650-3654 - [c291]Alexander H. Liu, Yu-An Chung, James R. Glass:
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies. Interspeech 2021: 3730-3734 - [c290]Cheng-I Jeff Lai, Yang Zhang, Alexander H. Liu, Shiyu Chang, Yi-Lun Liao, Yung-Sung Chuang, Kaizhi Qian, Sameer Khurana, David D. Cox, Jim Glass:
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition. NeurIPS 2021: 21256-21272 - [c289]Seunghak Yu, Giovanni Da San Martino, Mitra Mohtarami, James R. Glass, Preslav Nakov:
Interpretable Propaganda Detection in News Articles. RANLP 2021: 1597-1605 - [i111]Hongyin Luo, Shang-Wen Li, James R. Glass:
Knowledge Grounded Conversational Symptom Detection with Graph Memory Networks. CoRR abs/2101.09773 (2021) - [i110]Yuan Gong, Yu-An Chung, James R. Glass:
PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation. CoRR abs/2102.01243 (2021) - [i109]Hongyin Luo, Shang-Wen Li, Seunghak Yu, James R. Glass:
Cooperative Learning of Zero-Shot Machine Reading Comprehension. CoRR abs/2103.07449 (2021) - [i108]Yuan Gong, Yu-An Chung, James R. Glass:
AST: Audio Spectrogram Transformer. CoRR abs/2104.01778 (2021) - [i107]Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne, Samuel Thomas, Angie W. Boggust, Rameswar Panda, Brian Kingsbury, Rogério Schmidt Feris, David Harwath, James R. Glass, Michael Picheny, Shih-Fu Chang:
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos. CoRR abs/2104.12671 (2021) - [i106]Mathew Monfort, SouYoung Jin, Alexander H. Liu, David Harwath, Rogério Feris, James R. Glass, Aude Oliva:
Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions. CoRR abs/2105.04489 (2021) - [i105]Alexander H. Liu, SouYoung Jin, Cheng-I Jeff Lai, Andrew Rouditchenko, Aude Oliva, James R. Glass:
Cross-Modal Discrete Representation Learning. CoRR abs/2106.05438 (2021) - [i104]Cheng-I Jeff Lai, Yang Zhang, Alexander H. Liu, Shiyu Chang, Yi-Lun Liao, Yung-Sung Chuang, Kaizhi Qian, Sameer Khurana, David D. Cox, James R. Glass:
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition. CoRR abs/2106.05933 (2021) - [i103]Yung-Sung Chuang, Mingye Gao, Hongyin Luo, James R. Glass, Hung-Yi Lee, Yun-Nung Chen, Shang-Wen Li:
Mitigating Biases in Toxic Language Detection through Invariant Rationalization. CoRR abs/2106.07240 (2021) - [i102]Seunghak Yu, Giovanni Da San Martino, Mitra Mohtarami, James R. Glass, Preslav Nakov:
Interpretable Propaganda Detection in News Articles. CoRR abs/2108.12802 (2021) - [i101]Tianxing He, Kyunghyun Cho, James R. Glass:
An Empirical Study on Few-shot Knowledge Probing for Pretrained Language Models. CoRR abs/2109.02772 (2021) - [i100]Cheng-I Jeff Lai, Erica Cooper, Yang Zhang, Shiyu Chang, Kaizhi Qian, Yi-Lun Liao, Yung-Sung Chuang, Alexander H. Liu, Junichi Yamagishi, David D. Cox, James R. Glass:
On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis. CoRR abs/2110.01147 (2021) - [i99]Sameer Khurana, Antoine Laurent, James R. Glass:
Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0. CoRR abs/2110.03560 (2021) - [i98]Ian Palmer, Andrew Rouditchenko, Andrei Barbu, Boris Katz, James R. Glass:
Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset. CoRR abs/2110.07575 (2021) - [i97]Yuan Gong, Cheng-I Jeff Lai, Yu-An Chung, James R. Glass:
SSAST: Self-Supervised Audio Spectrogram Transformer. CoRR abs/2110.09784 (2021) - [i96]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Samuel Thomas, Hilde Kuehne, Brian Chen, Rameswar Panda, Rogério Feris, Brian Kingsbury, Michael Picheny, James R. Glass:
Cascaded Multilingual Audio-Visual Learning from Videos. CoRR abs/2111.04823 (2021) - [i95]Kevin Duarte, Brian Chen, Nina Shvetsova, Andrew Rouditchenko, Samuel Thomas, Alexander H. Liu, David Harwath, James R. Glass, Hilde Kuehne, Mubarak Shah:
Routing with Self-Attention for Multimodal Capsule Networks. CoRR abs/2112.00775 (2021) - [i94]Nina Shvetsova, Brian Chen, Andrew Rouditchenko, Samuel Thomas, Brian Kingsbury, Rogério Feris, David Harwath, James R. Glass, Hilde Kuehne:
Everything at Once - Multi-modal Fusion Transformer for Video Retrieval. CoRR abs/2112.04446 (2021) - 2020
- [j32]Yonatan Belinkov, Nadir Durrani, Fahim Dalvi, Hassan Sajjad, James R. Glass:
On the Linguistic Representational Power of Neural Machine Translation Models. Comput. Linguistics 46(1): 1-52 (2020) - [j31]David Harwath
, Adrià Recasens, Dídac Surís, Galen Chuang, Antonio Torralba, James R. Glass:
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input. Int. J. Comput. Vis. 128(3): 620-641 (2020) - [c288]Tianxing He, James R. Glass:
Negative Training for Neural Dialogue Response Generation. ACL 2020: 2044-2058 - [c287]Yu-An Chung, James R. Glass:
Improved Speech Representations with Multi-Target Autoregressive Predictive Coding. ACL 2020: 2353-2358 - [c286]Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James R. Glass, Preslav Nakov:
What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context. ACL 2020: 3364-3374 - [c285]John M. Wu, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Similarity Analysis of Contextual Word Representation Models. ACL 2020: 4638-4655 - [c284]Hongyin Luo, Shang-Wen Li, James R. Glass:
Knowledge Grounded Conversational Symptom Detection with Graph Memory Networks. ClinicalNLP@EMNLP 2020: 136-145 - [c283]Ramy Baly, Giovanni Da San Martino, James R. Glass, Preslav Nakov:
We Can Detect Your Bias: Predicting the Political Ideology of News Articles. EMNLP (1) 2020: 4982-4991 - [c282]Yu-An Chung, James R. Glass:
Generative Pre-Training for Speech with Autoregressive Predictive Coding. ICASSP 2020: 3497-3501 - [c281]Yasunori Ohishi, Akisato Kimura, Takahito Kawanishi, Kunio Kashino, David Harwath, James R. Glass:
Trilingual Semantic Embeddings of Visually Grounded Speech with Self-Attention Mechanisms. ICASSP 2020: 4352-4356 - [c280]François Grondin, Hao Tang, James R. Glass:
Audio-Visual Calibration with Polynomial Regression for 2-D Projection Using SVD-PHAT. ICASSP 2020: 4856-4860 - [c279]Jennifer Drexler, James R. Glass:
Learning a Subword Inventory Jointly with End-to-End Automatic Speech Recognition. ICASSP 2020: 6439-6443 - [c278]Suwon Shon, Ahmed Ali, Younes Samih, Hamdy Mubarak, James R. Glass:
ADI17: A Fine-Grained Arabic Dialect Identification Dataset. ICASSP 2020: 8244-8248 - [c277]David Harwath, Wei-Ning Hsu, James R. Glass:
Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech. ICLR 2020 - [c276]Moin Nadeem, Tianxing He, Kyunghyun Cho, James R. Glass:
A Systematic Characterization of Sampling Algorithms for Open-ended Language Generation. AACL/IJCNLP 2020: 334-346 - [c275]Michael Gump, Wei-Ning Hsu, James R. Glass:
Unsupervised Methods for Evaluating Speech Representations. INTERSPEECH 2020: 170-174 - [c274]Shammur A. Chowdhury, Ahmed Ali, Suwon Shon, James R. Glass:
What Does an End-to-End Dialect Identification Model Learn About Non-Dialectal Information? INTERSPEECH 2020: 462-466 - [c273]Yasunori Ohishi, Akisato Kimura, Takahito Kawanishi, Kunio Kashino, David Harwath, James R. Glass:
Pair Expansion for Learning Multilingual Semantic Embeddings Using Disjoint Visually-Grounded Speech Audio Datasets. INTERSPEECH 2020: 1486-1490 - [c272]Suwon Shon, James R. Glass:
Multimodal Association for Speaker Verification. INTERSPEECH 2020: 2247-2251 - [c271]Yu-An Chung, Hao Tang, James R. Glass:
Vector-Quantized Autoregressive Predictive Coding. INTERSPEECH 2020: 3760-3764 - [c270]Sameer Khurana, Antoine Laurent, Wei-Ning Hsu, Jan Chorowski
, Adrian Lancucki, Ricard Marxer, James R. Glass:
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning. INTERSPEECH 2020: 3790-3794 - [c269]Hongyin Luo, Shang-Wen Li, James R. Glass:
Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption. INTERSPEECH 2020: 3895-3899 - [i93]François Grondin, Hao Tang, James R. Glass:
Audio-Visual Calibration with Polynomial Regression for 2-D Projection Using SVD-PHAT. CoRR abs/2002.01440 (2020) - [i92]Yu-An Chung, James R. Glass:
Improved Speech Representations with Multi-Target Autoregressive Predictive Coding. CoRR abs/2004.05274 (2020) - [i91]John M. Wu, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Similarity Analysis of Contextual Word Representation Models. CoRR abs/2005.01172 (2020) - [i90]Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James R. Glass, Preslav Nakov:
What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context. CoRR abs/2005.04518 (2020) - [i89]Yu-An Chung, Hao Tang, James R. Glass:
Vector-Quantized Autoregressive Predictive Coding. CoRR abs/2005.08392 (2020) - [i88]Hongyin Luo, Shang-Wen Li, James R. Glass:
Prototypical Q Networks for Automatic Conversational Diagnosis and Few-Shot New Disease Adaption. CoRR abs/2005.11153 (2020) - [i87]Sameer Khurana, Antoine Laurent, Wei-Ning Hsu, Jan Chorowski, Adrian Lancucki, Ricard Marxer, James R. Glass:
A Convolutional Deep Markov Model for Unsupervised Speech Representation Learning. CoRR abs/2006.02547 (2020) - [i86]Sameer Khurana, Antoine Laurent, James R. Glass:
CSTNet: Contrastive Speech Translation Network for Self-Supervised Speech Representation Learning. CoRR abs/2006.02814 (2020) - [i85]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Rogério Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James R. Glass:
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos. CoRR abs/2006.09199 (2020) - [i84]Seunghak Yu, Tianxing He, James R. Glass:
Constructing a Knowledge Graph from Unstructured Documents without External Alignment. CoRR abs/2008.08995 (2020) - [i83]Moin Nadeem, Tianxing He, Kyunghyun Cho, James R. Glass:
A Systematic Characterization of Sampling Algorithms for Open-ended Language Generation. CoRR abs/2009.07243 (2020) - [i82]Ramy Baly, Giovanni Da San Martino, James R. Glass, Preslav Nakov:
We Can Detect Your Bias: Predicting the Political Ideology of News Articles. CoRR abs/2010.05338 (2020) - [i81]Yu-An Chung, Yonatan Belinkov, James R. Glass:
Similarity Analysis of Self-Supervised Speech Representations. CoRR abs/2010.11481 (2020) - [i80]Cheng-I Lai, Yung-Sung Chuang, Hung-yi Lee, Shang-wen Li, James R. Glass:
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining. CoRR abs/2010.13826 (2020) - [i79]Alexander H. Liu, Yu-An Chung, James R. Glass:
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies. CoRR abs/2011.00406 (2020) - [i78]Wei-Ning Hsu, David Harwath, Christopher Song, James R. Glass:
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units. CoRR abs/2012.15454 (2020)
2010 – 2019
- 2019
- [j30]Salvatore Romeo, Giovanni Da San Martino
, Yonatan Belinkov, Alberto Barrón-Cedeño, Mohamed Eldesouki
, Kareem Darwish, Hamdy Mubarak, James R. Glass, Alessandro Moschitti:
Language processing and learning models for community question answering in Arabic. Inf. Process. Manag. 56(2): 274-290 (2019) - [j29]Pepa Atanasova, Preslav Nakov
, Lluís Màrquez, Alberto Barrón-Cedeño, Georgi Karadzhov, Tsvetomila Mihaylova, Mitra Mohtarami, James R. Glass:
Automatic Fact-Checking Using Context and Discourse Information. ACM J. Data Inf. Qual. 11(3): 12:1-12:27 (2019) - [j28]Yonatan Belinkov, James R. Glass:
Analysis Methods in Neural Language Processing: A Survey. Trans. Assoc. Comput. Linguistics 7: 49-72 (2019) - [j27]Achintya Kumar Sarkar
, Zheng-Hua Tan
, Hao Tang, Suwon Shon, James R. Glass:
Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 27(8): 1267-1279 (2019) - [j26]Mandy Korpusik
, James R. Glass:
Deep Learning for Database Mapping and Asking Clarification Questions in Dialogue Systems. IEEE ACM Trans. Audio Speech Lang. Process. 27(8): 1321-1334 (2019) - [c268]Fahim Dalvi, Nadir Durrani, Hassan Sajjad, Yonatan Belinkov, Anthony Bau, James R. Glass:
What Is One Grain of Sand in the Desert? Analyzing Individual Neurons in Deep NLP Models. AAAI 2019: 6309-6317 - [c267]Fahim Dalvi, Avery Nortonsmith, Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, James R. Glass:
NeuroX: A Toolkit for Analyzing Individual Neurons in Neural Networks. AAAI 2019: 9851-9852 - [c266]Hongyin Luo, Lan Jiang, Yonatan Belinkov, Jim Glass:
Improving Neural Language Models by Segmenting, Attending, and Predicting the Future. ACL (1) 2019: 1483-1493 - [c265]Jennifer Drexler, James R. Glass:
Explicit Alignment of Text and Speech Encodings for Attention-Based End-to-End Speech Recognition. ASRU 2019: 913-919 - [c264]Ahmed Ali, Suwon Shon, Younes Samih, Hamdy Mubarak, Ahmed Abdelali, James R. Glass, Steve Renals
, Khalid Choukri:
The MGB-5 Challenge: Recognition and Dialect Identification of Dialectal Arabic Speech. ASRU 2019: 1026-1033 - [c263]Angie W. Boggust, Kartik Audhkhasi, Dhiraj Joshi, David Harwath, Samuel Thomas, Rogério Schmidt Feris, Danny Gutfreund, Yang Zhang, Antonio Torralba, Michael Picheny, James R. Glass:
Grounding Spoken Words in Unlabeled Video. CVPR Workshops 2019: 29-32 - [c262]Didac Suris, Adrià Recasens, David Bau
, David Harwath, James R. Glass, Antonio Torralba:
Learning Words by Drawing Images. CVPR 2019: 2029-2038 - [c261]François Grondin, Iwona Sobieraj, Mark D. Plumbley, James R. Glass:
Sound Event Localization and Detection Using CRNN on Pairs of Microphones. DCASE 2019: 84-88 - [c260]Yifan Zhang, Giovanni Da San Martino, Alberto Barrón-Cedeño, Salvatore Romeo, Jisun An, Haewoon Kwak
, Todor Staykovski, Israa Jaradat, Georgi Karadzhov, Ramy Baly, Kareem Darwish, James R. Glass, Preslav Nakov:
Tanbih: Get To Know What You Are Reading. EMNLP/IJCNLP (3) 2019: 223-228 - [c259]Mitra Mohtarami, James R. Glass, Preslav Nakov:
Contrastive Language Adaptation for Cross-Lingual Stance Detection. EMNLP/IJCNLP (1) 2019: 4441-4451 - [c258]David Harwath, James R. Glass:
Towards Visually Grounded Sub-word Speech Unit Discovery. ICASSP 2019: 3017-3021 - [c257]Suwon Shon, Tae-Hyun Oh, James R. Glass:
Noise-tolerant Audio-visual Online Person Verification Using an Attention-based Neural Network Fusion. ICASSP 2019: 3995-3999 - [c256]François Grondin, James R. Glass:
SVD-PHAT: A Fast Sound Source Localization Method. ICASSP 2019: 4140-4144 - [c255]Wei-Ning Hsu, Yu Zhang, Ron J. Weiss, Yu-An Chung, Yuxuan Wang, Yonghui Wu, James R. Glass:
Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization. ICASSP 2019: 5901-5905 - [c254]Suwon Shon, Ahmed Ali, James R. Glass:
Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain. ICASSP 2019: 5951-5955 - [c253]Jennifer Drexler, James R. Glass:
Subword Regularization and Beam Search Decoding for End-to-end Automatic Speech Recognition. ICASSP 2019: 6266-6270 - [c252]Sameer Khurana, Shafiq Rayhan Joty, Ahmed Ali, James R. Glass:
A Factorial Deep Markov Model for Unsupervised Disentangled Representation Learning from Speech. ICASSP 2019: 6540-6544 - [c251]Yu-An Chung, Wei-Hung Weng, Schrasing Tong, James R. Glass:
Towards Unsupervised Speech-to-text Translation. ICASSP 2019: 7170-7174 - [c250]Mandy Korpusik, James R. Glass:
Dialogue State Tracking with Convolutional Semantic Taggers. ICASSP 2019: 7220-7224 - [c249]Anthony Bau, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Identifying and Controlling Important Neurons in Neural Machine Translation. ICLR (Poster) 2019 - [c248]Tianxing He, James R. Glass:
Detecting Egregious Responses in Neural Sequence-to-sequence Models. ICLR (Poster) 2019 - [c247]Yonatan Belinkov, Ahmed Ali, James R. Glass:
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition. INTERSPEECH 2019: 81-85 - [c246]Yu-An Chung, Wei-Ning Hsu, Hao Tang, James R. Glass:
An Unsupervised Autoregressive Model for Speech Representation Learning. INTERSPEECH 2019: 146-150 - [c245]Emmanuel Azuh, David Harwath, James R. Glass:
Towards Bilingual Lexicon Discovery From Visually Grounded Speech Audio. INTERSPEECH 2019: 276-280 - [c244]Suwon Shon, Najim Dehak
, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-Target Speaker Detection and Identification Challenge Evaluation. INTERSPEECH 2019: 356-360 - [c243]Hongyin Luo, Mitra Mohtarami, James R. Glass, Karthik Krishnamurthy, Brigitte Richardson:
Integrating Video Retrieval and Moment Detection in a Unified Corpus for Video Question Answering. INTERSPEECH 2019: 599-603 - [c242]Mandy Korpusik, Zoe Liu, James R. Glass:
A Comparison of Deep Learning Methods for Language Understanding. INTERSPEECH 2019: 849-853 - [c241]Logan Ford, Hao Tang, François Grondin, James R. Glass:
A Deep Residual Network for Large-Scale Acoustic Scene Analysis. INTERSPEECH 2019: 2568-2572 - [c240]François Grondin, James R. Glass:
Multiple Sound Source Localization with SVD-PHAT. INTERSPEECH 2019: 2698-2702 - [c239]Suwon Shon, Hao Tang, James R. Glass:
VoiceID Loss: Speech Enhancement for Speaker Verification. INTERSPEECH 2019: 2888-2892