Picture for Yi R. Fung

Yi R. Fung

EcomBench: Towards Holistic Evaluation of Foundation Agents in E-commerce

Add code
Dec 11, 2025
Viaarxiv icon

Scaling Environments for LLM Agents in the Era of Learning from Interaction: A Survey

Add code
Nov 12, 2025
Viaarxiv icon

Reasoning Path Divergence: A New Metric and Curation Strategy to Unlock LLM Diverse Thinking

Add code
Oct 30, 2025
Viaarxiv icon

Lean4Physics: Comprehensive Reasoning Framework for College-level Physics in Lean4

Add code
Oct 30, 2025
Viaarxiv icon

Diversity-Enhanced Reasoning for Subjective Questions

Add code
Jul 27, 2025
Figure 1 for Diversity-Enhanced Reasoning for Subjective Questions
Figure 2 for Diversity-Enhanced Reasoning for Subjective Questions
Figure 3 for Diversity-Enhanced Reasoning for Subjective Questions
Figure 4 for Diversity-Enhanced Reasoning for Subjective Questions
Viaarxiv icon

DocCHA: Towards LLM-Augmented Interactive Online diagnosis System

Add code
Jul 10, 2025
Viaarxiv icon

CultureCLIP: Empowering CLIP with Cultural Awareness through Synthetic Images and Contextualized Captions

Add code
Jul 08, 2025
Viaarxiv icon

MATP-BENCH: Can MLLM Be a Good Automated Theorem Prover for Multimodal Problems?

Add code
Jun 06, 2025
Viaarxiv icon

AdaCtrl: Towards Adaptive and Controllable Reasoning via Difficulty-Aware Budgeting

Add code
May 24, 2025
Viaarxiv icon

V$^2$R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations

Add code
Apr 24, 2025
Figure 1 for V$^2$R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations
Figure 2 for V$^2$R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations
Figure 3 for V$^2$R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations
Figure 4 for V$^2$R-Bench: Holistically Evaluating LVLM Robustness to Fundamental Visual Variations
Viaarxiv icon