​
Login / Signup
Ali Vosoughi
Publication Activity (10 Years)
Years Active: 2023-2024
Publications (10 Years): 8
Top Topics
Language Model
Question Answering
Natural Language
Visual Features
Top Venues
CoRR
ICASSP
IEEE Trans. Multim.
</>
Publications
</>
Ali Vosoughi
,
Shijian Deng
,
Songyang Zhang
,
Yapeng Tian
,
Chenliang Xu
,
Jiebo Luo
Cross Modality Bias in Visual Question Answering: A Causal View With Possible Worlds VQA.
IEEE Trans. Multim.
26 (2024)
Ali Vosoughi
,
Luca Bondi
,
Ho-Hsiang Wu
,
Chenliang Xu
Learning Audio Concepts from Counterfactual Natural Language.
CoRR
(2024)
Nguyen Manh Nguyen
,
Jing Bi
,
Ali Vosoughi
,
Yapeng Tian
,
Pooyan Fazli
,
Chenliang Xu
OSCaR: Object State Captioning and State Change Representation.
CoRR
(2024)
Ali Vosoughi
,
Luca Bondi
,
Ho-Hsiang Wu
,
Chenliang Xu
Learning Audio Concepts from Counterfactual Natural Language.
ICASSP
(2024)
Yunlong Tang
,
Jing Bi
,
Siting Xu
,
Luchuan Song
,
Susan Liang
,
Teng Wang
,
Daoan Zhang
,
Jie An
,
Jingyang Lin
,
Rongyi Zhu
,
Ali Vosoughi
,
Chao Huang
,
Zeliang Zhang
,
Feng Zheng
,
Jianguo Zhang
,
Ping Luo
,
Jiebo Luo
,
Chenliang Xu
Video Understanding with Large Language Models: A Survey.
CoRR
(2023)
Jing Bi
,
Nguyen Manh Nguyen
,
Ali Vosoughi
,
Chenliang Xu
MISAR: A Multimodal Instructional System with Augmented Reality.
CoRR
(2023)
Yiyang Su
,
Ali Vosoughi
,
Shijian Deng
,
Yapeng Tian
,
Chenliang Xu
Separating Invisible Sounds Toward Universal Audiovisual Scene-Aware Sound Separation.
CoRR
(2023)
Ali Vosoughi
,
Shijian Deng
,
Songyang Zhang
,
Yapeng Tian
,
Chenliang Xu
,
Jiebo Luo
Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA.
CoRR
(2023)