Alert button
Picture for Linli Yao

Linli Yao

Alert button

LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?

Add code
Bookmark button
Alert button
Apr 16, 2024
Yuchi Wang, Shuhuai Ren, Rundong Gao, Linli Yao, Qingyan Guo, Kaikai An, Jianhong Bai, Xu Sun

Viaarxiv icon

TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Add code
Bookmark button
Alert button
Dec 04, 2023
Shuhuai Ren, Linli Yao, Shicheng Li, Xu Sun, Lu Hou

Viaarxiv icon

Edit As You Wish: Video Description Editing with Multi-grained Commands

Add code
Bookmark button
Alert button
May 15, 2023
Linli Yao, Yuanmeng Zhang, Ziheng Wang, Xinglin Hou, Tiezheng Ge, Yuning Jiang, Qin Jin

Figure 1 for Edit As You Wish: Video Description Editing with Multi-grained Commands
Figure 2 for Edit As You Wish: Video Description Editing with Multi-grained Commands
Figure 3 for Edit As You Wish: Video Description Editing with Multi-grained Commands
Figure 4 for Edit As You Wish: Video Description Editing with Multi-grained Commands
Viaarxiv icon

Rethinking Benchmarks for Cross-modal Image-text Retrieval

Add code
Bookmark button
Alert button
Apr 21, 2023
Weijing Chen, Linli Yao, Qin Jin

Figure 1 for Rethinking Benchmarks for Cross-modal Image-text Retrieval
Figure 2 for Rethinking Benchmarks for Cross-modal Image-text Retrieval
Figure 3 for Rethinking Benchmarks for Cross-modal Image-text Retrieval
Figure 4 for Rethinking Benchmarks for Cross-modal Image-text Retrieval
Viaarxiv icon

CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge

Add code
Bookmark button
Alert button
Nov 17, 2022
Linli Yao, Weijing Chen, Qin Jin

Figure 1 for CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Figure 2 for CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Figure 3 for CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Figure 4 for CapEnrich: Enriching Caption Semantics for Web Images via Cross-modal Pre-trained Knowledge
Viaarxiv icon

Image Difference Captioning with Pre-training and Contrastive Learning

Add code
Bookmark button
Alert button
Feb 09, 2022
Linli Yao, Weiying Wang, Qin Jin

Figure 1 for Image Difference Captioning with Pre-training and Contrastive Learning
Figure 2 for Image Difference Captioning with Pre-training and Contrastive Learning
Figure 3 for Image Difference Captioning with Pre-training and Contrastive Learning
Figure 4 for Image Difference Captioning with Pre-training and Contrastive Learning
Viaarxiv icon

YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos

Add code
Bookmark button
Alert button
Apr 12, 2020
Shizhe Chen, Weiying Wang, Ludan Ruan, Linli Yao, Qin Jin

Figure 1 for YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos
Figure 2 for YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos
Figure 3 for YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos
Figure 4 for YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos
Viaarxiv icon