Stop the war!
Остановите войну!
for scientists:
default search action
Bhiksha Raj
Bhiksha Ramakrishnan
Person information
- affiliation: Carnegie Mellon University, Pittsburgh, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j39]Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso:
Privacy-Oriented Manipulation of Speaker Representations. IEEE Access 12: 82949-82971 (2024) - [j38]Fan Yang, Muqiao Yang, Xiang Li, Yuxuan Wu, Zhiyuan Zhao, Bhiksha Raj, Rita Singh:
A closer look at reinforcement learning-based automatic speech recognition. Comput. Speech Lang. 87: 101641 (2024) - [c255]Umberto Cappellazzo, Enrico Fini, Muqiao Yang, Daniele Falavigna, Alessio Brutti, Bhiksha Raj:
Continual Contrastive Spoken Language Understanding. ACL (Findings) 2024: 3727-3741 - [c254]Roshan Sharma, Suwon Shon, Mark Lindsey, Hira Dhamyal, Bhiksha Raj:
Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization? ACL (1) 2024: 14779-14797 - [c253]Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang:
Training Audio Captioning Models without Audio. ICASSP 2024: 371-375 - [c252]Muhammad A. Shah, Bhiksha Raj:
Fixed Inter-Neuron Covariability Induces Adversarial Robustness. ICASSP 2024: 7005-7009 - [c251]Muqiao Yang, Umberto Cappellazzo, Xiang Li, Bhiksha Raj:
Improving Continual Learning of Acoustic Scene Classification via Mutual Information Optimization. ICASSP 2024: 7105-7109 - [c250]Muqiao Yang, Chunlei Zhang, Yong Xu, Zhongweiyang Xu, Heming Wang, Bhiksha Raj, Dong Yu:
uSee: Unified Speech Enhancement And Editing with Conditional Diffusion Models. ICASSP 2024: 7125-7129 - [c249]Ankit Shah, Fuyu Tang, Zelin Ye, Rita Singh, Bhiksha Raj:
Importance of Negative Sampling in Weak Label Learning. ICASSP 2024: 7530-7534 - [c248]Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh:
Prompting Audios Using Acoustic Properties for Emotion Representation. ICASSP 2024: 11936-11940 - [c247]Jee-Weon Jung, Roshan S. Sharma, William Chen, Bhiksha Raj, Shinji Watanabe:
AugSumm: Towards Generalizable Speech Summarization Using Synthetic Labels from Large Language Models. ICASSP 2024: 12071-12075 - [c246]Chien-Yu Huang, Ke-Han Lu, Shih-Heng Wang, Chi-Yuan Hsiao, Chun-Yi Kuan, Haibin Wu, Siddhant Arora, Kai-Wei Chang, Jiatong Shi, Yifan Peng, Roshan S. Sharma, Shinji Watanabe, Bhiksha Ramakrishnan, Shady Shehata, Hung-Yi Lee:
Dynamic-Superb: Towards a Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark For Speech. ICASSP 2024: 12136-12140 - [c245]Hao Chen, Jindong Wang, Ankit Shah, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj:
Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks. ICLR 2024 - [c244]Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj:
A General Framework for Learning from Weak Supervision. ICML 2024 - [c243]Xiang Li, Yinpeng Chen, Chung-Ching Lin, Hao Chen, Kai Hu, Rita Singh, Bhiksha Raj, Lijuan Wang, Zicheng Liu:
Completing Visual Objects via Bridging Generation and Segmentation. ICML 2024 - [c242]Zhaorun Chen, Zhuokai Zhao, Zhihong Zhu, Ruiqi Zhang, Xiang Li, Bhiksha Raj, Huaxiu Yao:
AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition. NAACL-HLT 2024: 1346-1362 - [i146]Jee-weon Jung, Roshan S. Sharma, William Chen, Bhiksha Raj, Shinji Watanabe:
AugSumm: towards generalizable speech summarization using synthetic labels from large language model. CoRR abs/2401.06806 (2024) - [i145]Soham Deshmukh, Dareen Alharthi, Benjamin Elizalde, Hannes Gamper, Mahmoud Al Ismail, Rita Singh, Bhiksha Raj, Huaming Wang:
PAM: Prompting Audio-Language Models for Audio Quality Assessment. CoRR abs/2402.00282 (2024) - [i144]Hao Chen, Bhiksha Raj, Xing Xie, Jindong Wang:
On Catastrophic Inheritance of Large Foundation Models. CoRR abs/2402.01909 (2024) - [i143]Hao Chen, Jindong Wang, Lei Feng, Xiang Li, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj:
A General Framework for Learning from Weak Supervision. CoRR abs/2402.01922 (2024) - [i142]Xiaohao Xu, Tianyi Zhang, Sibo Wang, Xiang Li, Yongqi Chen, Ye Li, Bhiksha Raj, Matthew Johnson-Roberson, Xiaonan Huang:
Customizable Perturbation Synthesis for Robust SLAM Benchmarking. CoRR abs/2402.08125 (2024) - [i141]Soham Deshmukh, Rita Singh, Bhiksha Raj:
Domain Adaptation for Contrastive Audio-Language Models. CoRR abs/2402.09585 (2024) - [i140]Muqiao Yang, Xiang Li, Umberto Cappellazzo, Shinji Watanabe, Bhiksha Raj:
Evaluating and Improving Continual Learning in Spoken Language Understanding. CoRR abs/2402.10427 (2024) - [i139]Zhaorun Chen, Zhuokai Zhao, Zhihong Zhu, Ruiqi Zhang, Xiang Li, Bhiksha Raj, Huaxiu Yao:
AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition. CoRR abs/2402.11452 (2024) - [i138]Xiang Li, Kai Qiu, Jinglu Wang, Xiaohao Xu, Rita Singh, Kashu Yamazaki, Hao Chen, Xiaonan Huang, Bhiksha Raj:
R2-Bench: Benchmarking the Robustness of Referring Perception Models under Perturbations. CoRR abs/2403.04924 (2024) - [i137]Hao Chen, Jindong Wang, Zihan Wang, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj:
Learning with Noisy Foundation Models. CoRR abs/2403.06869 (2024) - [i136]Francisco Teixeira, Karla Pizzi, Raphaël Olivier, Alberto Abad, Bhiksha Raj, Isabel Trancoso:
Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features. CoRR abs/2405.01207 (2024) - [i135]Yizhou Zhao, Tuanfeng Y. Wang, Bhiksha Raj, Min Xu, Jimei Yang, Chun-Hao Paul Huang:
Synergistic Global-space Camera and Human Reconstruction from Videos. CoRR abs/2405.14855 (2024) - [i134]Hao Chen, Yujin Han, Diganta Misra, Xiang Li, Kai Hu, Difan Zou, Masashi Sugiyama, Jindong Wang, Bhiksha Raj:
Slight Corruption in Pre-training Data Makes Better Diffusion Models. CoRR abs/2405.20494 (2024) - [i133]Thanh-Dat Truong, Utsav Prabhu, Dongyi Wang, Bhiksha Raj, Susan Gauch, Jeyamkondan Subbiah, Khoa Luu:
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding. CoRR abs/2406.01429 (2024) - [i132]Thanh-Dat Truong, Xin Li, Bhiksha Raj, Jackson David Cothren, Khoa Luu:
ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models. CoRR abs/2406.01432 (2024) - [i131]Xiang Li, Kai Qiu, Hao Chen, Jason Kuen, Zhe Lin, Rita Singh, Bhiksha Raj:
ControlVAR: Exploring Controllable Visual Autoregressive Modeling. CoRR abs/2406.09750 (2024) - [i130]Xiaohao Xu, Tianyi Zhang, Sibo Wang, Xiang Li, Yongqi Chen, Ye Li, Bhiksha Raj, Matthew Johnson-Roberson, Xiaonan Huang:
From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking. CoRR abs/2406.16850 (2024) - [i129]Yuxuan Wu, Ziyu Wang, Bhiksha Raj, Gus Xia:
Emergent Interpretable Symbols and Content-Style Disentanglement via Variance-Invariance Constraints. CoRR abs/2407.03824 (2024) - [i128]Hazim T. Bukhari, Soham Deshmukh, Hira Dhamyal, Bhiksha Raj, Rita Singh:
SELM: Enhancing Speech Emotion Recognition for Out-of-Domain Scenarios. CoRR abs/2407.15300 (2024) - [i127]Soham Deshmukh, Shuo Han, Hazim T. Bukhari, Benjamin Elizalde, Hannes Gamper, Rita Singh, Bhiksha Raj:
Audio Entailment: Assessing Deductive Reasoning for Audio Understanding. CoRR abs/2407.18062 (2024) - 2023
- [j37]Samiran Gode, Supreeth Bare, Bhiksha Raj, Hyungon Yoo:
Understanding political polarization using language models: A dataset and method. AI Mag. 44(3): 248-254 (2023) - [j36]Viet-Khoa Vo-Ho, Sang Truong, Kashu Yamazaki, Bhiksha Raj, Minh-Triet Tran, Ngan Le:
AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation. Int. J. Comput. Vis. 131(1): 302-323 (2023) - [j35]Weiyang Liu, Yandong Wen, Bhiksha Raj, Rita Singh, Adrian Weller:
SphereFace Revived: Unifying Hyperspherical Face Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 45(2): 2458-2474 (2023) - [c241]Xiang Li, Haoyuan Cao, Shijie Zhao, Junlin Li, Li Zhang, Bhiksha Raj:
Panoramic Video Salient Object Detection with Ambisonic Audio Guidance. AAAI 2023: 1424-1432 - [c240]Kashu Yamazaki, Khoa Vo, Quang Sang Truong, Bhiksha Raj, Ngan Le:
VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning. AAAI 2023: 3081-3090 - [c239]Roshan S. Sharma, William Chen, Takatomo Kano, Ruchira Sharma, Siddhant Arora, Shinji Watanabe, Atsunori Ogawa, Marc Delcroix, Rita Singh, Bhiksha Raj:
Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems. ASRU 2023: 1-8 - [c238]Thanh-Dat Truong, Ngan Le, Bhiksha Raj, Jackson David Cothren, Khoa Luu:
FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding. CVPR 2023: 19988-19997 - [c237]Xiang Li, Jinglu Wang, Xiaohao Xu, Muqiao Yang, Fan Yang, Yizhou Zhao, Rita Singh, Bhiksha Raj:
Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text. EMNLP 2023: 2283-2296 - [c236]Yutian Chen, Hao Kang, Vivian Zhai, Liangze Li, Rita Singh, Bhiksha Raj:
Token Prediction as Implicit Classification to Identify LLM-Generated Text. EMNLP 2023: 13112-13120 - [c235]Ankit Shah, Larry Tang, Po Hao Chou, Yi Yu Zheng, Ziqian Ge, Bhiksha Raj:
An Approach to Ontological Learning from Weak Labels. ICASSP 2023: 1-5 - [c234]Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso:
Privacy-Preserving Automatic Speaker Diarization. ICASSP 2023: 1-5 - [c233]Muqiao Yang, Joseph Konan, David Bick, Yunyang Zeng, Shuo Han, Anurag Kumar, Shinji Watanabe, Bhiksha Raj:
Paaploss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement. ICASSP 2023: 1-5 - [c232]Yunyang Zeng, Joseph Konan, Shuo Han, David Bick, Muqiao Yang, Anurag Kumar, Shinji Watanabe, Bhiksha Raj:
TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement. ICASSP 2023: 1-5 - [c231]Yandong Wen, Weiyang Liu, Yao Feng, Bhiksha Raj, Rita Singh, Adrian Weller, Michael J. Black, Bernhard Schölkopf:
Pairwise Similarity Learning is SimPLE. ICCV 2023: 5285-5295 - [c230]Xiang Li, Jinglu Wang, Xiaohao Xu, Xiao Li, Bhiksha Raj, Yan Lu:
Robust Referring Video Object Segmentation with Cyclic Structural Consensus. ICCV 2023: 22179-22188 - [c229]Hao Chen, Ran Tao, Yue Fan, Yidong Wang, Jindong Wang, Bernt Schiele, Xing Xie, Bhiksha Raj, Marios Savvides:
SoftMatch: Addressing the Quantity-Quality Tradeoff in Semi-supervised Learning. ICLR 2023 - [c228]Yidong Wang, Hao Chen, Qiang Heng, Wenxin Hou, Yue Fan, Zhen Wu, Jindong Wang, Marios Savvides, Takahiro Shinozaki, Bhiksha Raj, Bernt Schiele, Xing Xie:
FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning. ICLR 2023 - [c227]Raphaël Olivier, Bhiksha Raj:
How Many Perturbations Break This Model? Evaluating Robustness Beyond Adversarial Accuracy. ICML 2023: 26583-26598 - [c226]Roshan Sharma, Siddhant Arora, Kenneth Zheng, Shinji Watanabe, Rita Singh, Bhiksha Raj:
BASS: Block-wise Adaptation for Speech Summarization. INTERSPEECH 2023: 1454-1458 - [c225]Liao Qu, Xianwei Zou, Xiang Li, Yandong Wen, Rita Singh, Bhiksha Raj:
The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features. INTERSPEECH 2023: 2578-2582 - [c224]Raphaël Olivier, Bhiksha Raj:
There is more than one kind of robustness: Fooling Whisper with adversarial examples. INTERSPEECH 2023: 4394-4398 - [c223]Xiang Li, Yandong Wen, Muqiao Yang, Jinglu Wang, Rita Singh, Bhiksha Raj:
Rethinking Voice-Face Correlation: A Geometry View. ACM Multimedia 2023: 2458-2467 - [c222]Xiang Li, Chung-Ching Lin, Yinpeng Chen, Zicheng Liu, Jinglu Wang, Rita Singh, Bhiksha Raj:
PaintSeg: Painting Pixels for Training-free Segmentation. NeurIPS 2023 - [c221]Shentong Mo, Bhiksha Raj:
Weakly-Supervised Audio-Visual Segmentation. NeurIPS 2023 - [c220]Muhammad Shah, Aqsa Kashaf, Bhiksha Raj:
Training on Foveated Images Improves Robustness to Adversarial Attacks. NeurIPS 2023 - [c219]Thanh-Dat Truong, Hoang-Quan Nguyen, Bhiksha Raj, Khoa Luu:
Fairness Continual Learning Approach to Semantic Scene Understanding in Open-World Environments. NeurIPS 2023 - [i126]Samiran Gode, Supreeth Bare, Bhiksha Raj, Hyungon Yoo:
Understanding Political Polarisation using Language Models: A dataset and method. CoRR abs/2301.00891 (2023) - [i125]Hao Chen, Ran Tao, Yue Fan, Yidong Wang, Jindong Wang, Bernt Schiele, Xing Xie, Bhiksha Raj, Marios Savvides:
SoftMatch: Addressing the Quantity-Quality Trade-off in Semi-supervised Learning. CoRR abs/2301.10921 (2023) - [i124]Yunyang Zeng, Joseph Konan, Shuo Han, David Bick, Muqiao Yang, Anurag Kumar, Shinji Watanabe, Bhiksha Raj:
TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement. CoRR abs/2302.08088 (2023) - [i123]Muqiao Yang, Joseph Konan, David Bick, Yunyang Zeng, Shuo Han, Anurag Kumar, Shinji Watanabe, Bhiksha Raj:
PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement. CoRR abs/2302.08095 (2023) - [i122]Laurie M. Heller, Benjamin Elizalde, Bhiksha Raj, Soham Deshmukh:
Synergy between human and machine approaches to sound/scene recognition and processing: An overview of ICASSP special session. CoRR abs/2302.09719 (2023) - [i121]Ankit Shah, Shuyi Chen, Kejun Zhou, Yue Chen, Bhiksha Raj:
Approach to Learning Generalized Audio Representation Through Batch Embedding Covariance Regularization and Constant-Q Transforms. CoRR abs/2303.03591 (2023) - [i120]Joseph Konan, Ojas Bhargave, Shikhar Agnihotri, Hojeong Lee, Ankit Shah, Shuo Han, Yunyang Zeng, Amanda Shu, Haohui Liu, Xuankai Chang, Hamza Khalid, Minseon Gwak, Kawon Lee, Minjeong Kim, Bhiksha Raj:
Improving Perceptual Quality, Intelligibility, and Acoustics on VoIP Platforms. CoRR abs/2303.09048 (2023) - [i119]Thanh-Dat Truong, Ngan Le, Bhiksha Raj, Jackson David Cothren, Khoa Luu:
FREDOM: Fairness Domain Adaptation Approach to Semantic Scene Understanding. CoRR abs/2304.02135 (2023) - [i118]Yutian Chen, Hao Kang, Vivian Zhai, Liangze Li, Rita Singh, Bhiksha Raj:
GPT-Sentinel: Distinguishing Human and ChatGPT Generated Content. CoRR abs/2305.07969 (2023) - [i117]Hao Chen, Ankit Shah, Jindong Wang, Ran Tao, Yidong Wang, Xing Xie, Masashi Sugiyama, Rita Singh, Bhiksha Raj:
Imprecise Label Learning: A Unified Framework for Learning with Various Imprecise Label Configurations. CoRR abs/2305.12715 (2023) - [i116]Thanh-Dat Truong, Hoang-Quan Nguyen, Bhiksha Raj, Khoa Luu:
Fairness Continual Learning Approach to Semantic Scene Understanding in Open-World Environments. CoRR abs/2305.15700 (2023) - [i115]Xiang Li, Chung-Ching Lin, Yinpeng Chen, Zicheng Liu, Jinglu Wang, Bhiksha Raj:
PaintSeg: Training-free Segmentation via Painting. CoRR abs/2305.19406 (2023) - [i114]Pha A. Nguyen, Kha Gia Quach, John Gauch, Samee U. Khan, Bhiksha Raj, Khoa Luu:
UTOPIA: Unconstrained Tracking Objects without Preliminary Examination via Cross-Domain Adaptation. CoRR abs/2306.09613 (2023) - [i113]Roshan S. Sharma, Kenneth Zheng, Siddhant Arora, Shinji Watanabe, Rita Singh, Bhiksha Raj:
BASS: Block-wise Adaptation for Speech Summarization. CoRR abs/2307.08217 (2023) - [i112]Xiang Li, Yandong Wen, Muqiao Yang, Jinglu Wang, Rita Singh, Bhiksha Raj:
Rethinking Voice-Face Correlation: A Geometry View. CoRR abs/2307.13948 (2023) - [i111]Liao Qu, Xianwei Zou, Xiang Li, Yandong Wen, Rita Singh, Bhiksha Raj:
The Hidden Dance of Phonemes and Visage: Unveiling the Enigmatic Link between Phonemes and Facial Features. CoRR abs/2307.13953 (2023) - [i110]Muhammad A. Shah, Bhiksha Raj:
Training on Foveated Images Improves Robustness to Adversarial Attacks. CoRR abs/2308.00854 (2023) - [i109]Muhammad Ahmed Shah, Bhiksha Raj:
Fixed Inter-Neuron Covariability Induces Adversarial Robustness. CoRR abs/2308.03956 (2023) - [i108]Soham Deshmukh, Benjamin Elizalde, Dimitra Emmanouilidou, Bhiksha Raj, Rita Singh, Huaming Wang:
Training Audio Captioning Models without Audio. CoRR abs/2309.07372 (2023) - [i107]Chien-yu Huang, Ke-Han Lu, Shih-Heng Wang, Chi-Yuan Hsiao, Chun-Yi Kuan, Haibin Wu, Siddhant Arora, Kai-Wei Chang, Jiatong Shi, Yifan Peng, Roshan S. Sharma, Shinji Watanabe, Bhiksha Ramakrishnan, Shady Shehata, Hung-yi Lee:
Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech. CoRR abs/2309.09510 (2023) - [i106]Ankit Shah, Fuyu Tang, Zelin Ye, Rita Singh, Bhiksha Raj:
Importance of negative sampling in weak label learning. CoRR abs/2309.13227 (2023) - [i105]Hao Chen, Jindong Wang, Ankit Shah, Ran Tao, Hongxin Wei, Xing Xie, Masashi Sugiyama, Bhiksha Raj:
Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks. CoRR abs/2309.17002 (2023) - [i104]Xiang Li, Jinglu Wang, Xiaohao Xu, Xiulian Peng, Rita Singh, Yan Lu, Bhiksha Raj:
Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition. CoRR abs/2310.00132 (2023) - [i103]Dareen Alharthi, Roshan Sharma, Hira Dhamyal, Soumi Maiti, Bhiksha Raj, Rita Singh:
Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech. CoRR abs/2310.00706 (2023) - [i102]Xiang Li, Yinpeng Chen, Chung-Ching Lin, Rita Singh, Bhiksha Raj, Zicheng Liu:
Completing Visual Objects via Bridging Generation and Segmentation. CoRR abs/2310.00808 (2023) - [i101]Muqiao Yang, Chunlei Zhang, Yong Xu, Zhongweiyang Xu, Heming Wang, Bhiksha Raj, Dong Yu:
uSee: Unified Speech Enhancement and Editing with Conditional Diffusion Models. CoRR abs/2310.00900 (2023) - [i100]Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh:
Prompting Audios Using Acoustic Properties For Emotion Representation. CoRR abs/2310.02298 (2023) - [i99]Umberto Cappellazzo, Enrico Fini, Muqiao Yang, Daniele Falavigna, Alessio Brutti, Bhiksha Raj:
Continual Contrastive Spoken Language Understanding. CoRR abs/2310.02699 (2023) - [i98]Muhammad Ahmed Shah, Roshan Sharma, Hira Dhamyal, Raphaël Olivier, Ankit Shah, Joseph Konan, Dareen Alharthi, Hazim T. Bukhari, Massa Baali, Soham Deshmukh, Michael Kuhlmann, Bhiksha Raj, Rita Singh:
LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model. CoRR abs/2310.04445 (2023) - [i97]Joseph Konan, Ojas Bhargave, Shikhar Agnihotri, Shuo Han, Yunyang Zeng, Ankit Shah, Bhiksha Raj:
Psychoacoustic Challenges Of Speech Enhancement On VoIP Platforms. CoRR abs/2310.07161 (2023) - [i96]Yandong Wen, Weiyang Liu, Yao Feng, Bhiksha Raj, Rita Singh, Adrian Weller, Michael J. Black, Bernhard Schölkopf:
Pairwise Similarity Learning is SimPLE. CoRR abs/2310.09449 (2023) - [i95]Yutian Chen, Hao Kang, Vivian Zhai, Liangze Li, Rita Singh, Bhiksha Raj:
Token Prediction as Implicit Classification to Identify LLM-Generated Text. CoRR abs/2311.08723 (2023) - [i94]Shentong Mo, Bhiksha Raj:
Weakly-Supervised Audio-Visual Segmentation. CoRR abs/2311.15080 (2023) - [i93]Thanh-Dat Truong, Utsav Prabhu, Bhiksha Raj, Jackson David Cothren, Khoa Luu:
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding in Open World. CoRR abs/2311.15965 (2023) - 2022
- [c218]Roshan Sharma, Bhiksha Raj:
Cross-utterance context for multimodal video transcription. IEEECONF 2022: 1321-1325 - [c217]Yandong Wen, Weiyang Liu, Adrian Weller, Bhiksha Raj, Rita Singh:
SphereFace2: Binary Classification is All You Need for Deep Face Recognition. ICLR 2022 - [c216]Hira Dhamyal, Bhiksha Raj, Rita Singh:
Positional Encoding for Capturing Modality Specific Cadence for Emotion Detection. INTERSPEECH 2022: 166-170 - [c215]Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso:
Towards End-to-End Private Automatic Speaker Recognition. INTERSPEECH 2022: 2798-2802 - [c214]Muqiao Yang, Joseph Konan, David Bick, Anurag Kumar, Shinji Watanabe, Bhiksha Raj:
Improving Speech Enhancement through Fine-Grained Speech Characteristics. INTERSPEECH 2022: 2953-2957 - [c213]Raphaël Olivier, Bhiksha Raj:
Recent improvements of ASR models in the face of adversarial attacks. INTERSPEECH 2022: 4113-4117 - [c212]Yidong Wang, Hao Chen, Yue Fan, Wang Sun, Ran Tao, Wenxin Hou, Renjie Wang, Linyi Yang, Zhi Zhou, Lan-Zhe Guo, Heli Qi, Zhen Wu, Yufeng Li, Satoshi Nakamura, Wei Ye, Marios Savvides, Bhiksha Raj, Takahiro Shinozaki, Bernt Schiele, Jindong Wang, Xing Xie, Yue Zhang:
USB: A Unified Semi-supervised Learning Benchmark for Classification. NeurIPS 2022 - [i92]Larry Tang, Po Hao Chou, Yi Yu Zheng, Ziqian Ge, Ankit Shah, Bhiksha Raj:
Ontological Learning from Weak Labels. CoRR abs/2203.02483 (2022) - [i91]Joseph Turian, Jordie Shier, Humair Raj Khan, Bhiksha Raj, Björn W. Schuller, Christian J. Steinmetz, Colin Malloy, George Tzanetakis, Gissel Velarde, Kirk McNally, Max Henry, Nicolas Pinto, Camille Noufi, Christian Clough, Dorien Herremans, Eduardo Fonseca, Jesse H. Engel, Justin Salamon, Philippe Esling, Pranay Manocha, Shinji Watanabe, Zeyu Jin, Yonatan Bisk:
HEAR 2021: Holistic Evaluation of Audio Representations. CoRR abs/2203.03022 (2022) - [i90]Shentong Mo, Jingfei Xia, Xiaoqing Tan, Bhiksha Raj:
Point3D: tracking actions as moving points with 3D CNNs. CoRR abs/2203.10584 (2022) - [i89]Raphaël Olivier, Bhiksha Raj:
Recent improvements of ASR models in the face of adversarial attacks. CoRR abs/2203.16536 (2022) - [i88]Ankit Shah, Hira Dhamyal, Yang Gao, Rita Singh, Bhiksha Raj:
On the pragmatism of using binary classifiers over data intensive neural network classifiers for detection of COVID-19 from voice. CoRR abs/2204.04802 (2022) - [i87]Yidong Wang, Hao Chen, Qiang Heng, Wenxin Hou, Yue Fan, Zhen Wu, Marios Savvides, Takahiro Shinozaki, Bhiksha Raj, Bernt Schiele:
FreeMatch: Self-adaptive Thresholding for Semi-supervised Learning. CoRR abs/2205.07246 (2022) - [i86]Chonghan Chen, Qi Jiang, Chih-Hao Wang, Noel Chen, Haohan Wang, Xiang Li, Bhiksha Raj:
Bear the Query in Mind: Visual Grounding with Query-condit