Picture for Zhiyuan Fan

Zhiyuan Fan

May

Empowering Reliable Visual-Centric Instruction Following in MLLMs

Add code
Jan 06, 2026
Viaarxiv icon

A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis

Add code
Dec 16, 2025
Figure 1 for A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
Figure 2 for A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
Figure 3 for A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
Figure 4 for A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
Viaarxiv icon

Scaling Environments for LLM Agents in the Era of Learning from Interaction: A Survey

Add code
Nov 12, 2025
Viaarxiv icon

Diversity-Enhanced Reasoning for Subjective Questions

Add code
Jul 27, 2025
Figure 1 for Diversity-Enhanced Reasoning for Subjective Questions
Figure 2 for Diversity-Enhanced Reasoning for Subjective Questions
Figure 3 for Diversity-Enhanced Reasoning for Subjective Questions
Figure 4 for Diversity-Enhanced Reasoning for Subjective Questions
Viaarxiv icon

CultureCLIP: Empowering CLIP with Cultural Awareness through Synthetic Images and Contextualized Captions

Add code
Jul 08, 2025
Viaarxiv icon

MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence Calibration

Add code
May 29, 2025
Viaarxiv icon

V$^2$R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations

Add code
Apr 24, 2025
Figure 1 for V$^2$R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations
Figure 2 for V$^2$R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations
Figure 3 for V$^2$R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations
Figure 4 for V$^2$R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations
Viaarxiv icon

Efficient Near-Optimal Algorithm for Online Shortest Paths in Directed Acyclic Graphs with Bandit Feedback Against Adaptive Adversaries

Add code
Apr 01, 2025
Viaarxiv icon

Prototype-Guided Cross-Modal Knowledge Enhancement for Adaptive Survival Prediction

Add code
Mar 13, 2025
Figure 1 for Prototype-Guided Cross-Modal Knowledge Enhancement for Adaptive Survival Prediction
Figure 2 for Prototype-Guided Cross-Modal Knowledge Enhancement for Adaptive Survival Prediction
Figure 3 for Prototype-Guided Cross-Modal Knowledge Enhancement for Adaptive Survival Prediction
Figure 4 for Prototype-Guided Cross-Modal Knowledge Enhancement for Adaptive Survival Prediction
Viaarxiv icon

CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering

Add code
Jan 30, 2025
Figure 1 for CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering
Figure 2 for CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering
Figure 3 for CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering
Figure 4 for CALM: Unleashing the Cross-Lingual Self-Aligning Ability of Language Model Question Answering
Viaarxiv icon