


default search action
Ali Vosoughi
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[i16]Jing Bi, Junjia Guo, Susan Liang, Guangyu Sun, Luchuan Song, Yunlong Tang, Jinxi He, Jiarui Wu, Ali Vosoughi, Chen Chen, Chenliang Xu:
VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity. CoRR abs/2503.11557 (2025)
[i15]Yunlong Tang, Jing Bi, Chao Huang, Susan Liang, Daiki Shimada, Hang Hua, Yunzhong Xiao, Yizhi Song, Pinxin Liu, Mingqian Feng, Junjia Guo, Zhuo Liu, Luchuan Song, Ali Vosoughi, Jinxi He, Liu He, Zeliang Zhang, Jiebo Luo, Chenliang Xu:
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting. CoRR abs/2504.05541 (2025)
[i14]Jing Bi, Pinxin Liu, Ali Vosoughi, Jiarui Wu, Jinxi He, Chenliang Xu:
I2G: Generating Instructional Illustrations via Text-Conditioned Diffusion. CoRR abs/2505.16425 (2025)
[i13]Yunlong Tang, Pinxin Liu, Mingqian Feng, Zhangyun Tan, Rui Mao, Chao Huang, Jing Bi, Yunzhong Xiao, Susan Liang, Hang Hua, Ali Vosoughi, Luchuan Song, Zeliang Zhang, Chenliang Xu:
MMPerspective: Do MLLMs Understand Perspective? A Comprehensive Benchmark for Perspective Perception, Reasoning, and Robustness. CoRR abs/2505.20426 (2025)
[i12]Ali Vosoughi, Jing Bi, Pinxin Liu, Yunlong Tang, Chenliang Xu:
Can Sound Replace Vision in LLaVA With Token Substitution? CoRR abs/2506.10416 (2025)
[i11]Ali Vosoughi, Ayoub Shahnazari, Yufeng Xi, Zeliang Zhang, Griffin Hess, Chenliang Xu, Niaz Abdolrahim:
OPENXRD: A Comprehensive Benchmark and Enhancement Framework for LLM/MLLM XRD Question Answering. CoRR abs/2507.09155 (2025)
[i10]Yolo Yunlong Tang, Jing Bi, Pinxin Liu, Zhenyu Pan, Zhangyun Tan, Qianxiang Shen, Jiani Liu, Hang Hua, Junjia Guo, Yunzhong Xiao, Chao Huang, Zhiyuan Wang, Susan Liang, Xinyi Liu, Yizhi Song, Junhua Huang, Jia-Xing Zhong, Bozheng Li, Daiqing Qi, Ziyun Zeng, Ali Vosoughi, Luchuan Song, Zeliang Zhang, Daiki Shimada, Han Liu, Jiebo Luo, Chenliang Xu:
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models. CoRR abs/2510.05034 (2025)
[i9]Jing Bi, Guangyu Sun, Ali Vosoughi, Chen Chen, Chenliang Xu:
Diagnosing Visual Reasoning: Challenges, Insights, and a Path Forward. CoRR abs/2510.20696 (2025)
[i8]Ali Vosoughi, Yongyi Zang, Qihui Yang, Nathan Paek, Randal J. Leistikow, Chenliang Xu:
PromptReverb: Multimodal Room Impulse Response Generation Through Latent Rectified Flow Matching. CoRR abs/2510.22439 (2025)- 2024
[j1]Ali Vosoughi
, Shijian Deng
, Songyang Zhang
, Yapeng Tian
, Chenliang Xu
, Jiebo Luo
:
Cross Modality Bias in Visual Question Answering: A Causal View With Possible Worlds VQA. IEEE Trans. Multim. 26: 8609-8624 (2024)
[c3]Ali Vosoughi, Luca Bondi, Ho-Hsiang Wu, Chenliang Xu:
Learning Audio Concepts from Counterfactual Natural Language. ICASSP 2024: 366-370
[c2]Jing Bi
, Yunlong Tang
, Luchuan Song
, Ali Vosoughi
, Nguyen Nguyen
, Chenliang Xu
:
EAGLE: Egocentric AGgregated Language-video Engine. ACM Multimedia 2024: 1682-1691
[c1]Nguyen Nguyen, Jing Bi, Ali Vosoughi, Yapeng Tian, Pooyan Fazli, Chenliang Xu:
OSCaR: Object State Captioning and State Change Representation. NAACL-HLT (Findings) 2024: 3565-3576
[i7]Ali Vosoughi, Luca Bondi, Ho-Hsiang Wu, Chenliang Xu:
Learning Audio Concepts from Counterfactual Natural Language. CoRR abs/2401.04935 (2024)
[i6]Nguyen Manh Nguyen, Jing Bi, Ali Vosoughi, Yapeng Tian, Pooyan Fazli, Chenliang Xu:
OSCaR: Object State Captioning and State Change Representation. CoRR abs/2402.17128 (2024)
[i5]Jing Bi, Yunlong Tang, Luchuan Song, Ali Vosoughi, Nguyen Nguyen, Chenliang Xu:
EAGLE: Egocentric AGgregated Language-video Engine. CoRR abs/2409.17523 (2024)- 2023
[i4]Ali Vosoughi, Shijian Deng, Songyang Zhang, Yapeng Tian, Chenliang Xu, Jiebo Luo:
Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA. CoRR abs/2305.19664 (2023)
[i3]Jing Bi, Nguyen Manh Nguyen, Ali Vosoughi, Chenliang Xu:
MISAR: A Multimodal Instructional System with Augmented Reality. CoRR abs/2310.11699 (2023)
[i2]Yiyang Su, Ali Vosoughi, Shijian Deng, Yapeng Tian, Chenliang Xu:
Separating Invisible Sounds Toward Universal Audiovisual Scene-Aware Sound Separation. CoRR abs/2310.11713 (2023)
[i1]Yunlong Tang, Jing Bi, Siting Xu, Luchuan Song, Susan Liang, Teng Wang, Daoan Zhang, Jie An, Jingyang Lin, Rongyi Zhu, Ali Vosoughi, Chao Huang, Zeliang Zhang, Feng Zheng, Jianguo Zhang, Ping Luo, Jiebo Luo
, Chenliang Xu:
Video Understanding with Large Language Models: A Survey. CoRR abs/2312.17432 (2023)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-11-22 05:19 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







