default search action

combined dblp search
author search
venue search
publication search

ask others

Ali Vosoughi

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-11557
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-11557
Jing Bi, Junjia Guo, Susan Liang, Guangyu Sun, Luchuan Song, Yunlong Tang, Jinxi He, Jiarui Wu, Ali Vosoughi, Chen Chen, Chenliang Xu:
VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity. CoRR abs/2503.11557 (2025)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-05541
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-05541
Yunlong Tang, Jing Bi, Chao Huang, Susan Liang, Daiki Shimada, Hang Hua, Yunzhong Xiao, Yizhi Song, Pinxin Liu, Mingqian Feng, Junjia Guo, Zhuo Liu, Luchuan Song, Ali Vosoughi, Jinxi He, Liu He, Zeliang Zhang, Jiebo Luo, Chenliang Xu:
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting. CoRR abs/2504.05541 (2025)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-16425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-16425
Jing Bi, Pinxin Liu, Ali Vosoughi, Jiarui Wu, Jinxi He, Chenliang Xu:
I²G: Generating Instructional Illustrations via Text-Conditioned Diffusion. CoRR abs/2505.16425 (2025)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-20426
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-20426
Yunlong Tang, Pinxin Liu, Mingqian Feng, Zhangyun Tan, Rui Mao, Chao Huang, Jing Bi, Yunzhong Xiao, Susan Liang, Hang Hua, Ali Vosoughi, Luchuan Song, Zeliang Zhang, Chenliang Xu:
MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness. CoRR abs/2505.20426 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-10416
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-10416
Ali Vosoughi, Jing Bi, Pinxin Liu, Yunlong Tang, Chenliang Xu:
Can Sound Replace Vision in LLaVA With Token Substitution? CoRR abs/2506.10416 (2025)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-09155
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-09155
Ali Vosoughi, Ayoub Shahnazari, Yufeng Xi, Zeliang Zhang, Griffin Hess, Chenliang Xu, Niaz Abdolrahim:
OPENXRD: A Comprehensive Benchmark and Enhancement Framework for LLM/MLLM XRD Question Answering. CoRR abs/2507.09155 (2025)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-05034
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-05034
Yolo Yunlong Tang, Jing Bi, Pinxin Liu, Zhenyu Pan, Zhangyun Tan, Qianxiang Shen, Jiani Liu, Hang Hua, Junjia Guo, Yunzhong Xiao, Chao Huang, Zhiyuan Wang, Susan Liang, Xinyi Liu, Yizhi Song, Junhua Huang, Jia-Xing Zhong, Bozheng Li, Daiqing Qi, Ziyun Zeng, Ali Vosoughi, Luchuan Song, Zeliang Zhang, Daiki Shimada, Han Liu, Jiebo Luo, Chenliang Xu:
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models. CoRR abs/2510.05034 (2025)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-20696
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-20696
Jing Bi, Guangyu Sun, Ali Vosoughi, Chen Chen, Chenliang Xu:
Diagnosing Visual Reasoning: Challenges, Insights, and a Path Forward. CoRR abs/2510.20696 (2025)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-22439
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-22439
Ali Vosoughi, Yongyi Zang, Qihui Yang, Nathan Paek, Randal J. Leistikow, Chenliang Xu:
PromptReverb: Multimodal Room Impulse Response Generation Through Latent Rectified Flow Matching. CoRR abs/2510.22439 (2025)
2024
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/VosoughiDZTXL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/VosoughiDZTXL24
Ali Vosoughi, Shijian Deng, Songyang Zhang, Yapeng Tian, Chenliang Xu, Jiebo Luo:
Cross Modality Bias in Visual Question Answering: A Causal View With Possible Worlds VQA. IEEE Trans. Multim. 26: 8609-8624 (2024)
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VosoughiBWX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VosoughiBWX24
Ali Vosoughi, Luca Bondi, Ho-Hsiang Wu, Chenliang Xu:
Learning Audio Concepts from Counterfactual Natural Language. ICASSP 2024: 366-370
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Bi0SVNX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Bi0SVNX24
Jing Bi, Yunlong Tang, Luchuan Song, Ali Vosoughi, Nguyen Nguyen, Chenliang Xu:
EAGLE: Egocentric AGgregated Language-video Engine. ACM Multimedia 2024: 1682-1691
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/NguyenBVTFX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/NguyenBVTFX24
Nguyen Nguyen, Jing Bi, Ali Vosoughi, Yapeng Tian, Pooyan Fazli, Chenliang Xu:
OSCaR: Object State Captioning and State Change Representation. NAACL-HLT (Findings) 2024: 3565-3576
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-04935
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-04935
Ali Vosoughi, Luca Bondi, Ho-Hsiang Wu, Chenliang Xu:
Learning Audio Concepts from Counterfactual Natural Language. CoRR abs/2401.04935 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-17128
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-17128
Nguyen Manh Nguyen, Jing Bi, Ali Vosoughi, Yapeng Tian, Pooyan Fazli, Chenliang Xu:
OSCaR: Object State Captioning and State Change Representation. CoRR abs/2402.17128 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-17523
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-17523
Jing Bi, Yunlong Tang, Luchuan Song, Ali Vosoughi, Nguyen Nguyen, Chenliang Xu:
EAGLE: Egocentric AGgregated Language-video Engine. CoRR abs/2409.17523 (2024)
2023
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19664
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19664
Ali Vosoughi, Shijian Deng, Songyang Zhang, Yapeng Tian, Chenliang Xu, Jiebo Luo:
Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA. CoRR abs/2305.19664 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-11699
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-11699
Jing Bi, Nguyen Manh Nguyen, Ali Vosoughi, Chenliang Xu:
MISAR: A Multimodal Instructional System with Augmented Reality. CoRR abs/2310.11699 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-11713
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-11713
Yiyang Su, Ali Vosoughi, Shijian Deng, Yapeng Tian, Chenliang Xu:
Separating Invisible Sounds Toward Universal Audiovisual Scene-Aware Sound Separation. CoRR abs/2310.11713 (2023)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-17432
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-17432
Yunlong Tang, Jing Bi, Siting Xu, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, Ali Vosoughi, Chao Huang, Zeliang Zhang, Feng Zheng, Jianguo Zhang, Ping Luo, Jiebo Luo, Chenliang Xu:
Video Understanding with Large Language Models: A Survey. CoRR abs/2312.17432 (2023)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.