​
Login / Signup
Linli Yao
ORCID
Publication Activity (10 Years)
Years Active: 2019-2024
Publications (10 Years): 15
Top Topics
Visual Data
Text Generation
Diffusion Models
Long Video
Top Venues
CoRR
MediaEval
SIGIR
WWW
</>
Publications
</>
Yuchi Wang
,
Shuhuai Ren
,
Rundong Gao
,
Linli Yao
,
Qingyan Guo
,
Kaikai An
,
Jianhong Bai
,
Xu Sun
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?
CoRR
(2024)
Yuting Mei
,
Linli Yao
,
Qin Jin
UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos.
ICMR
(2024)
Yuchi Wang
,
Shuhuai Ren
,
Rundong Gao
,
Linli Yao
,
Qingyan Guo
,
Kaikai An
,
Jianhong Bai
,
Xu Sun
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?
NAACL-HLT
(2024)
Yuting Mei
,
Linli Yao
,
Qin Jin
UBiSS: A Unified Framework for Bimodal Semantic Summarization of Videos.
CoRR
(2024)
Linli Yao
,
Lei Li
,
Shuhuai Ren
,
Lean Wang
,
Yuanxin Liu
,
Xu Sun
,
Lu Hou
DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language Models.
CoRR
(2024)
Linli Yao
,
Yuanmeng Zhang
,
Ziheng Wang
,
Xinglin Hou
,
Tiezheng Ge
,
Yuning Jiang
,
Qin Jin
Edit As You Wish: Video Description Editing with Multi-grained Commands.
CoRR
(2023)
Weijing Chen
,
Linli Yao
,
Qin Jin
Rethinking Benchmarks for Cross-modal Image-text Retrieval.
SIGIR
(2023)
Weijing Chen
,
Linli Yao
,
Qin Jin
Rethinking Benchmarks for Cross-modal Image-text Retrieval.
CoRR
(2023)
Linli Yao
,
Weijing Chen
,
Qin Jin
CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge.
WWW
(2023)
Shuhuai Ren
,
Linli Yao
,
Shicheng Li
,
Xu Sun
,
Lu Hou
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding.
CoRR
(2023)
Linli Yao
,
Weiying Wang
,
Qin Jin
Image Difference Captioning with Pre-training and Contrastive Learning.
CoRR
(2022)
Linli Yao
,
Weiying Wang
,
Qin Jin
Image Difference Captioning with Pre-training and Contrastive Learning.
AAAI
(2022)
Linli Yao
,
Weijing Chen
,
Qin Jin
CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge.
CoRR
(2022)
Shizhe Chen
,
Weiying Wang
,
Ludan Ruan
,
Linli Yao
,
Qin Jin
YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos.
CoRR
(2020)
Shuai Wang
,
Linli Yao
,
Jieting Chen
,
Qin Jin
RUC at MediaEval 2019: Video Memorability Prediction Based on Visual Textual and Concept Related Features.
MediaEval
(2019)