default search action
Bhiksha Raj
Bhiksha Ramakrishnan
Person information
- affiliation: Carnegie Mellon University, Pittsburgh, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j39]Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso:
Privacy-Oriented Manipulation of Speaker Representations. IEEE Access 12: 82949-82971 (2024) - [j38]Fan Yang, Muqiao Yang, Xiang Li, Yuxuan Wu, Zhiyuan Zhao, Bhiksha Raj, Rita Singh:
A closer look at reinforcement learning-based automatic speech recognition. Comput. Speech Lang. 87: 101641 (2024) - [c260]Umberto Cappellazzo, Enrico Fini, Muqiao Yang, Daniele Falavigna, Alessio Brutti, Bhiksha Raj:
Continual Contrastive Spoken Language Understanding. ACL (Findings) 2024: 3727-3741 - [c259]Roshan Sharma, Suwon Shon, Mark Lindsey, Hira Dhamyal, Bhiksha Raj:
Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization? ACL (1) 2024: 14779-14797 - [c258]Yizhou Zhao, Tuanfeng Yang Wang, Bhiksha Raj, Min Xu, Jimei Yang, Chun-Hao Paul Huang:
Synergistic Global-Space Camera and Human Reconstruction from Videos. CVPR 2024: 1216-1226 - [c257]Xiang Li, Jinglu Wang, Xiaohao Xu, Xiulian Peng, Rita Singh, Yan Lu, Bhiksha Raj:
QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition. CVPR 2024: 3402-3413 - [c256]Xiang Li, Kai Qiu, Jinglu Wang, Xiaohao Xu, Rita Singh, Kashu Yamazaki, Hao Chen, Xiaonan Huang, Bhiksha Raj:
R2-Bench: Benchmarking the Robustness of Referring Perception Models Under Perturbations. ECCV (9) 2024: 211-230 - [c255]Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang:
Training Audio Captioning Models without Audio. ICASSP 2024: 371-375 - [c254]Muhammad A. Shah, Bhiksha Raj:
Fixed Inter-Neuron Covariability Induces Adversarial Robustness. ICASSP 2024: 7005-7009 - [c253]Muqiao Yang, Umberto Cappellazzo, Xiang Li, Bhiksha Raj:
Improving Continual Learning of Acoustic Scene Classification via Mutual Information Optimization. ICASSP 2024: 7105-7109 - [c252]Muqiao Yang, Chunlei Zhang, Yong Xu, Zhongweiyang Xu, Heming Wang, Bhiksha Raj, Dong Yu:
uSee: Unified Speech Enhancement And Editing with Conditional Diffusion Models. ICASSP 2024: 7125-7129 - [c251]Ankit Shah, Fuyu Tang, Zelin Ye, Rita Singh, Bhiksha Raj:
Importance of Negative Sampling in Weak Label Learning. ICASSP 2024: 7530-7534 - [c250]Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh:
Prompting Audios Using Acoustic Properties for Emotion Representation. ICASSP 2024: 11936-11940 - [c249]Jee-Weon Jung, Roshan S. Sharma, William Chen, Bhiksha Raj, Shinji Watanabe:
AugSumm: Towards Generalizable Speech Summarization Using Synthetic Labels from Large Language Models. ICASSP 2024: 12071-12075 - [c248]Chien-Yu Huang, Ke-Han Lu, Shih-Heng Wang, Chi-Yuan Hsiao, Chun-Yi Kuan, Haibin Wu, Siddhant Arora, Kai-Wei Chang, Jiatong Shi, Yifan Peng, Roshan S. Sharma, Shinji Watanabe, Bhiksha Ramakrishnan, Shady Shehata, Hung-Yi Lee:
Dynamic-Superb: Towards a Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark For Speech. ICASSP 2024: 12136-12140 - [c247]Hao Chen, Jindong Wang, Ankit Shah, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj:
Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks. ICLR 2024 - [c246]Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj:
A General Framework for Learning from Weak Supervision. ICML 2024 - [c245]Xiang Li, Yinpeng Chen, Chung-Ching Lin, Hao Chen, Kai Hu, Rita Singh, Bhiksha Raj, Lijuan Wang, Zicheng Liu:
Completing Visual Objects via Bridging Generation and Segmentation. ICML 2024 - [c244]Jimin Sohn, Haeji Jung, Zhiwen Yan, Vibha Masti, Xiang Li, Bhiksha Raj:
Fashion Image Retrieval with Occlusion. ICPR (21) 2024: 31-46 - [c243]Roshan Sharma, Ruchira Sharma, Hira Dhamyal, Rita Singh, Bhiksha Raj:
R-BASS : Relevance-aided Block-wise Adaptation for Speech Summarization. NAACL-HLT (Findings) 2024: 848-857 - [c242]Zhaorun Chen, Zhuokai Zhao, Zhihong Zhu, Ruiqi Zhang, Xiang Li, Bhiksha Raj, Huaxiu Yao:
AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition. NAACL-HLT 2024: 1346-1362 - [i161]Jee-weon Jung, Roshan S. Sharma, William Chen, Bhiksha Raj, Shinji Watanabe:
AugSumm: towards generalizable speech summarization using synthetic labels from large language model. CoRR abs/2401.06806 (2024) - [i160]Soham Deshmukh, Dareen Alharthi, Benjamin Elizalde, Hannes Gamper, Mahmoud Al Ismail, Rita Singh, Bhiksha Raj, Huaming Wang:
PAM: Prompting Audio-Language Models for Audio Quality Assessment. CoRR abs/2402.00282 (2024) - [i159]Hao Chen, Bhiksha Raj, Xing Xie, Jindong Wang:
On Catastrophic Inheritance of Large Foundation Models. CoRR abs/2402.01909 (2024) - [i158]Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj:
A General Framework for Learning from Weak Supervision. CoRR abs/2402.01922 (2024) - [i157]Xiaohao Xu, Tianyi Zhang, Sibo Wang, Xiang Li, Yongqi Chen, Ye Li, Bhiksha Raj, Matthew Johnson-Roberson, Xiaonan Huang:
Customizable Perturbation Synthesis for Robust SLAM Benchmarking. CoRR abs/2402.08125 (2024) - [i156]Soham Deshmukh, Rita Singh, Bhiksha Raj:
Domain Adaptation for Contrastive Audio-Language Models. CoRR abs/2402.09585 (2024) - [i155]Muqiao Yang, Xiang Li, Umberto Cappellazzo, Shinji Watanabe, Bhiksha Raj:
Evaluating and Improving Continual Learning in Spoken Language Understanding. CoRR abs/2402.10427 (2024) - [i154]Zhaorun Chen, Zhuokai Zhao, Zhihong Zhu, Ruiqi Zhang, Xiang Li, Bhiksha Raj, Huaxiu Yao:
AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition. CoRR abs/2402.11452 (2024) - [i153]Xiang Li, Kai Qiu, Jinglu Wang, Xiaohao Xu, Rita Singh, Kashu Yamazaki, Hao Chen, Xiaonan Huang, Bhiksha Raj:
R2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations. CoRR abs/2403.04924 (2024) - [i152]Hao Chen, Jindong Wang, Zihan Wang, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj:
Learning with Noisy Foundation Models. CoRR abs/2403.06869 (2024) - [i151]Francisco Teixeira, Karla Pizzi, Raphaël Olivier, Alberto Abad, Bhiksha Raj, Isabel Trancoso:
Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features. CoRR abs/2405.01207 (2024) - [i150]Yizhou Zhao, Tuanfeng Y. Wang, Bhiksha Raj, Min Xu, Jimei Yang, Chun-Hao Paul Huang:
Synergistic Global-space Camera and Human Reconstruction from Videos. CoRR abs/2405.14855 (2024) - [i149]Hao Chen, Yujin Han, Diganta Misra, Xiang Li, Kai Hu, Difan Zou, Masashi Sugiyama, Jindong Wang, Bhiksha Raj:
Slight Corruption in Pre-training Data Makes Better Diffusion Models. CoRR abs/2405.20494 (2024) - [i148]Thanh-Dat Truong, Utsav Prabhu, Dongyi Wang, Bhiksha Raj, Susan Gauch, Jeyamkondan Subbiah, Khoa Luu:
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding. CoRR abs/2406.01429 (2024) - [i147]Thanh-Dat Truong, Xin Li, Bhiksha Raj, Jackson David Cothren, Khoa Luu:
ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models. CoRR abs/2406.01432 (2024) - [i146]Xiang Li, Kai Qiu, Hao Chen, Jason Kuen, Zhe Lin, Rita Singh, Bhiksha Raj:
ControlVAR: Exploring Controllable Visual Autoregressive Modeling. CoRR abs/2406.09750 (2024) - [i145]Xiaohao Xu, Tianyi Zhang, Sibo Wang, Xiang Li, Yongqi Chen, Ye Li, Bhiksha Raj, Matthew Johnson-Roberson, Xiaonan Huang:
From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking. CoRR abs/2406.16850 (2024) - [i144]Yuxuan Wu, Ziyu Wang, Bhiksha Raj, Gus Xia:
Emergent Interpretable Symbols and Content-Style Disentanglement via Variance-Invariance Constraints. CoRR abs/2407.03824 (2024) - [i143]Hazim T. Bukhari, Soham Deshmukh, Hira Dhamyal, Bhiksha Raj, Rita Singh:
SELM: Enhancing Speech Emotion Recognition for Out-of-Domain Scenarios. CoRR abs/2407.15300 (2024) - [i142]Soham Deshmukh, Shuo Han, Hazim T. Bukhari, Benjamin Elizalde, Hannes Gamper, Rita Singh, Bhiksha Raj:
Audio Entailment: Assessing Deductive Reasoning for Audio Understanding. CoRR abs/2407.18062 (2024) - [i141]Roshan S. Sharma, Suwon Shon, Mark Lindsey, Hira Dhamyal, Rita Singh, Bhiksha Raj:
Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization? CoRR abs/2408.07277 (2024) - [i140]Kai Qiu, Xiang Li, Hao Chen, Jie Sun, Jinglu Wang, Zhe Lin, Marios Savvides, Bhiksha Raj:
Efficient Autoregressive Audio Modeling via Next-Scale Prediction. CoRR abs/2408.09027 (2024) - [i139]Massa Baali, Abdulhamid Aldoobi, Hira Dhamyal, Rita Singh, Bhiksha Raj:
PDAF: A Phonetic Debiasing Attention Framework For Speaker Verification. CoRR abs/2409.05799 (2024) - [i138]Kuang Yuan, Shuo Han, Swarun Kumar, Bhiksha Raj:
DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing. CoRR abs/2409.06137 (2024) - [i137]Jiatong Shi, Jinchuan Tian, Yihan Wu, Jee-weon Jung, Jia Qi Yip, Yoshiki Masuyama, William Chen, Yuning Wu, Yuxun Tang, Massa Baali, Dareen Alharthi, Dong Zhang, Ruifan Deng, Tejes Srivastava, Haibin Wu, Alexander H. Liu, Bhiksha Raj, Qin Jin, Ruihua Song, Shinji Watanabe:
ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech. CoRR abs/2409.15897 (2024) - [i136]Muhammad A. Shah, Bhiksha Raj:
Revisiting Acoustic Features for Robust ASR. CoRR abs/2409.16399 (2024) - [i135]Xiang Li, Kai Qiu, Hao Chen, Jason Kuen, Jiuxiang Gu, Bhiksha Raj, Zhe Lin:
ImageFolder: Autoregressive Image Generation with Folded Tokens. CoRR abs/2410.01756 (2024) - [i134]Ksheeraja Raghavan, Samiran Gode, Ankit Shah, Surabhi Raghavan, Wolfram Burgard, Bhiksha Raj, Rita Singh:
Did You Hear That? Introducing AADG: A Framework for Generating Benchmark Data in Audio Anomaly Detection. CoRR abs/2410.03904 (2024) - [i133]Ibrahim Aldarmaki, Thamar Solorio, Bhiksha Raj, Hanan Aldarmaki:
RelUNet: Relative Channel Fusion U-Net for Multichannel Speech Enhancement. CoRR abs/2410.05019 (2024) - [i132]Satvik Dixit, Massa Baali, Rita Singh, Bhiksha Raj:
Improving Speaker Representations Using Contrastive Losses on Multi-scale Features. CoRR abs/2410.05037 (2024) - [i131]Abdul Waheed, Hanin Atwany, Bhiksha Raj, Rita Singh:
What Do Speech Foundation Models Not Learn About Speech? CoRR abs/2410.12948 (2024) - [i130]Hao Chen, Abdul Waheed, Xiang Li, Yidong Wang, Jindong Wang, Bhiksha Raj, Marah I Abdin:
On the Diversity of Synthetic Data and its Impact on Training Large Language Models. CoRR abs/2410.15226 (2024) - [i129]Ravi Teja N. V. S. Chappa, Page Daniel Dobbs, Bhiksha Raj, Khoa Luu:
FLAASH: Flow-Attention Adaptive Semantic Hierarchical Fusion for Multi-Modal Tobacco Content Analysis. CoRR abs/2410.19896 (2024) - [i128]Satvik Dixit, Soham Deshmukh, Bhiksha Raj:
MACE: Leveraging Audio for Evaluating Audio Captioning Systems. CoRR abs/2411.00321 (2024) - [i127]Yichen Wang, Jie Wang, Fulin Wang, Xiang Li, Hao Yin, Bhiksha Raj:
Perturbation Ontology based Graph Attention Networks. CoRR abs/2411.18520 (2024) - 2023
- [j37]Samiran Gode, Supreeth Bare, Bhiksha Raj, Hyungon Yoo:
Understanding political polarization using language models: A dataset and method. AI Mag. 44(3): 248-254 (2023) - [j36]Viet-Khoa Vo-Ho, Sang Truong, Kashu Yamazaki, Bhiksha Raj, Minh-Triet Tran, Ngan Le:
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation. Int. J. Comput. Vis. 131(1): 302-323 (2023) - [j35]Weiyang Liu, Yandong Wen, Bhiksha Raj, Rita Singh, Adrian Weller:
SphereFace Revived: Unifying Hyperspherical Face Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 45(2): 2458-2474 (2023) - [c241]Xiang Li, Haoyuan Cao, Shijie Zhao, Junlin Li, Li Zhang, Bhiksha Raj:
Panoramic Video Salient Object Detection with Ambisonic Audio Guidance. AAAI 2023: 1424-1432 - [c240]Kashu Yamazaki, Khoa Vo, Quang Sang Truong, Bhiksha Raj, Ngan Le:
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning. AAAI 2023: 3081-3090 - [c239]Roshan S. Sharma, William Chen, Takatomo Kano, Ruchira Sharma, Siddhant Arora, Shinji Watanabe, Atsunori Ogawa, Marc Delcroix, Rita Singh, Bhiksha Raj:
Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems. ASRU 2023: 1-8 - [c238]Thanh-Dat Truong, Ngan Le, Bhiksha Raj, Jackson David Cothren, Khoa Luu:
FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding. CVPR 2023: 19988-19997 - [c237]Xiang Li, Jinglu Wang, Xiaohao Xu, Muqiao Yang, Fan Yang, Yizhou Zhao, Rita Singh, Bhiksha Raj:
Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text. EMNLP 2023: 2283-2296 - [c236]Yutian Chen, Hao Kang, Vivian Zhai, Liangze Li, Rita Singh, Bhiksha Raj:
Token Prediction as Implicit Classification to Identify LLM-Generated Text. EMNLP 2023: 13112-13120 - [c235]Ankit Shah, Larry Tang, Po Hao Chou, Yi Yu Zheng, Ziqian Ge, Bhiksha Raj:
An Approach to Ontological Learning from Weak Labels. ICASSP 2023: 1-5 - [c234]Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso:
Privacy-Preserving Automatic Speaker Diarization. ICASSP 2023: 1-5 - [c233]Muqiao Yang, Joseph Konan, David Bick, Yunyang Zeng, Shuo Han, Anurag Kumar, Shinji Watanabe, Bhiksha Raj:
Paaploss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement. ICASSP 2023: 1-5 - [c232]Yunyang Zeng, Joseph Konan, Shuo Han, David Bick, Muqiao Yang, Anurag Kumar, Shinji Watanabe, Bhiksha Raj:
TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement. ICASSP 2023: 1-5 - [c231]Yandong Wen, Weiyang Liu, Yao Feng, Bhiksha Raj, Rita Singh, Adrian Weller, Michael J. Black, Bernhard Schölkopf:
Pairwise Similarity Learning is SimPLE. ICCV 2023: 5285-5295 - [c230]Xiang Li, Jinglu Wang, Xiaohao Xu, Xiao Li, Bhiksha Raj, Yan Lu:
Robust Referring Video Object Segmentation with Cyclic Structural Consensus. ICCV 2023: 22179-22188 - [c229]Hao Chen, Ran Tao, Yue Fan, Yidong Wang, Jindong Wang, Bernt Schiele, Xing Xie, Bhiksha Raj, Marios Savvides:
SoftMatch: Addressing the Quantity-Quality Tradeoff in Semi-supervised Learning. ICLR 2023 - [c228]Yidong Wang, Hao Chen, Qiang Heng, Wenxin Hou, Yue Fan, Zhen Wu, Jindong Wang, Marios Savvides, Takahiro Shinozaki, Bhiksha Raj, Bernt Schiele, Xing Xie:
FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning. ICLR 2023 - [c227]Raphaël Olivier, Bhiksha Raj:
How Many Perturbations Break This Model? Evaluating Robustness Beyond Adversarial Accuracy. ICML 2023: 26583-26598 - [c226]Roshan Sharma, Siddhant Arora, Kenneth Zheng, Shinji Watanabe, Rita Singh, Bhiksha Raj:
BASS: Block-wise Adaptation for Speech Summarization. INTERSPEECH 2023: 1454-1458 - [c225]Liao Qu, Xianwei Zou, Xiang Li, Yandong Wen, Rita Singh, Bhiksha Raj:
The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features. INTERSPEECH 2023: 2578-2582 - [c224]Raphaël Olivier, Bhiksha Raj:
There is more than one kind of robustness: Fooling Whisper with adversarial examples. INTERSPEECH 2023: 4394-4398 - [c223]Xiang Li, Yandong Wen, Muqiao Yang, Jinglu Wang, Rita Singh, Bhiksha Raj:
Rethinking Voice-Face Correlation: A Geometry View. ACM Multimedia 2023: 2458-2467 - [c222]Xiang Li, Chung-Ching Lin, Yinpeng Chen, Zicheng Liu, Jinglu Wang, Rita Singh, Bhiksha Raj:
PaintSeg: Painting Pixels for Training-free Segmentation. NeurIPS 2023 - [c221]Shentong Mo, Bhiksha Raj:
Weakly-Supervised Audio-Visual Segmentation. NeurIPS 2023 - [c220]Muhammad A. Shah, Aqsa Kashaf, Bhiksha Raj:
Training on Foveated Images Improves Robustness to Adversarial Attacks. NeurIPS 2023 - [c219]Thanh-Dat Truong, Hoang-Quan Nguyen, Bhiksha Raj, Khoa Luu:
Fairness Continual Learning Approach to Semantic Scene Understanding in Open-World Environments. NeurIPS 2023 - [i126]Samiran Gode, Supreeth Bare, Bhiksha Raj, Hyungon Yoo:
Understanding Political Polarisation using Language Models: A dataset and method. CoRR abs/2301.00891 (2023) - [i125]Hao Chen, Ran Tao, Yue Fan, Yidong Wang, Jindong Wang, Bernt Schiele, Xing Xie, Bhiksha Raj, Marios Savvides:
SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning. CoRR abs/2301.10921 (2023) - [i124]Yunyang Zeng, Joseph Konan, Shuo Han, David Bick, Muqiao Yang, Anurag Kumar, Shinji Watanabe, Bhiksha Raj:
TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement. CoRR abs/2302.08088 (2023) - [i123]Muqiao Yang, Joseph Konan, David Bick, Yunyang Zeng, Shuo Han, Anurag Kumar, Shinji Watanabe, Bhiksha Raj:
PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement. CoRR abs/2302.08095 (2023) - [i122]Laurie M. Heller, Benjamin Elizalde, Bhiksha Raj, Soham Deshmukh:
Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session. CoRR abs/2302.09719 (2023) - [i121]Ankit Shah, Shuyi Chen, Kejun Zhou, Yue Chen, Bhiksha Raj:
Approach to Learning Generalized Audio Representation Through Batch Embedding Covariance Regularization and Constant-Q Transforms. CoRR abs/2303.03591 (2023) - [i120]Joseph Konan, Ojas Bhargave, Shikhar Agnihotri, Hojeong Lee, Ankit Shah, Shuo Han, Yunyang Zeng, Amanda Shu, Haohui Liu, Xuankai Chang, Hamza Khalid, Minseon Gwak, Kawon Lee, Minjeong Kim, Bhiksha Raj:
Improving Perceptual Quality, Intelligibility, and Acoustics on VoIP Platforms. CoRR abs/2303.09048 (2023) - [i119]Thanh-Dat Truong, Ngan Le, Bhiksha Raj, Jackson David Cothren, Khoa Luu:
FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding. CoRR abs/2304.02135 (2023) - [i118]Yutian Chen, Hao Kang, Vivian Zhai, Liangze Li, Rita Singh, Bhiksha Raj:
GPT-Sentinel: Distinguishing Human and ChatGPT Generated Content. CoRR abs/2305.07969 (2023) - [i117]Hao Chen, Ankit Shah, Jindong Wang, Ran Tao, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj:
Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations. CoRR abs/2305.12715 (2023) - [i116]Thanh-Dat Truong, Hoang-Quan Nguyen, Bhiksha Raj, Khoa Luu:
Fairness Continual Learning Approach to Semantic Scene Understanding in Open-World Environments. CoRR abs/2305.15700 (2023) - [i115]Xiang Li, Chung-Ching Lin, Yinpeng Chen, Zicheng Liu, Jinglu Wang, Bhiksha Raj:
PaintSeg: Training-free Segmentation via Painting. CoRR abs/2305.19406 (2023) - [i114]Pha A. Nguyen, Kha Gia Quach, John Gauch, Samee U. Khan, Bhiksha Raj, Khoa Luu:
UTOPIA: Unconstrained Tracking Objects without Preliminary Examination via Cross-Domain Adaptation. CoRR abs/2306.09613 (2023) - [i113]Roshan S. Sharma, Kenneth Zheng, Siddhant Arora, Shinji Watanabe, Rita Singh, Bhiksha Raj:
BASS: Block-wise Adaptation for Speech Summarization. CoRR abs/2307.08217 (2023) - [i112]Xiang Li, Yandong Wen, Muqiao Yang, Jinglu Wang, Rita Singh, Bhiksha Raj:
Rethinking Voice-Face Correlation: A Geometry View. CoRR abs/2307.13948 (2023) - [i111]Liao Qu, Xianwei Zou, Xiang Li, Yandong Wen, Rita Singh, Bhiksha Raj:
The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features. CoRR abs/2307.13953 (2023) - [i110]Muhammad A. Shah, Bhiksha Raj:
Training on Foveated Images Improves Robustness to Adversarial Attacks. CoRR abs/2308.00854 (2023) - [i109]Muhammad Ahmed Shah, Bhiksha Raj:
Fixed Inter-Neuron Covariability Induces Adversarial Robustness. CoRR abs/2308.03956 (2023) - [i108]Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang:
Training Audio Captioning Models without Audio. CoRR abs/2309.07372 (2023) - [i107]Chien-yu Huang, Ke-Han Lu, Shih-Heng Wang, Chi-Yuan Hsiao, Chun-Yi Kuan, Haibin Wu, Siddhant Arora, Kai-Wei Chang, Jiatong Shi, Yifan Peng, Roshan S. Sharma, Shinji Watanabe, Bhiksha Ramakrishnan, Shady Shehata, Hung-yi Lee:
Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech. CoRR abs/2309.09510 (2023) - [i106]Ankit Shah, Fuyu Tang, Zelin Ye, Rita Singh, Bhiksha Raj:
Importance of negative sampling in weak label learning. CoRR abs/2309.13227 (2023) - [i105]Hao Chen, Jindong Wang, Ankit Shah, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj:
Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks. CoRR abs/2309.17002 (2023) - [i104]Xiang Li, Jinglu Wang, Xiaohao Xu, Xiulian Peng, Rita Singh, Yan Lu, Bhiksha Raj:
Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition. CoRR abs/2310.00132 (2023) - [i103]Dareen Alharthi, Roshan Sharma, Hira Dhamyal, Soumi Maiti, Bhiksha Raj, Rita Singh:
Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech. CoRR abs/2310.00706 (2023) - [i102]Xiang Li, Yinpeng Chen, Chung-Ching Lin, Rita Singh, Bhiksha Raj, Zicheng Liu:
Completing Visual Objects via Bridging Generation and Segmentation. CoRR abs/2310.00808 (2023) - [i101]Muqiao Yang, Chunlei Zhang, Yong Xu, Zhongweiyang Xu, Heming Wang, Bhiksha Raj, Dong Yu:
uSee: Unified Speech Enhancement and Editing with Conditional Diffusion Models. CoRR abs/2310.00900 (2023) - [i100]Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh:
Prompting Audios Using Acoustic Properties For Emotion Representation. CoRR abs/2310.02298 (2023) - [i99]