Picture for Jiale Yuan

Jiale Yuan

Steering One-Step Diffusion Model with Fidelity-Rich Decoder for Fast Image Compression

Add code
Aug 07, 2025
Viaarxiv icon

ChineseSimpleVQA -- "See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models

Add code
Feb 19, 2025
Viaarxiv icon

Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration

Add code
Jan 09, 2025
Figure 1 for Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration
Figure 2 for Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration
Figure 3 for Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration
Figure 4 for Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration
Viaarxiv icon

LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating

Add code
Dec 24, 2024
Viaarxiv icon

ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models

Add code
May 24, 2024
Viaarxiv icon