Picture for Ziyue Lin

Ziyue Lin

VTI-CoT: Visual-Textual Interleaved Chain of Thought for Video Reasoning

Add code
Jun 04, 2026
Viaarxiv icon

Decoupled Residual Denoising Diffusion Models for Unified and Data Efficient Image-to-Image Translation

Add code
May 31, 2026
Viaarxiv icon

FedVLMBench: Benchmarking Federated Fine-Tuning of Vision-Language Models

Add code
Jun 11, 2025
Viaarxiv icon

MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V

Add code
Nov 23, 2023
Figure 1 for MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V
Figure 2 for MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V
Figure 3 for MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V
Figure 4 for MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V
Viaarxiv icon