Picture for Yilun Zhao

Yilun Zhao

TableVista: Benchmarking Multimodal Table Reasoning under Visual and Structural Complexity

Add code
May 07, 2026
Viaarxiv icon

Rethinking Reasoning-Intensive Retrieval: Evaluating and Advancing Retrievers in Agentic Search Systems

Add code
May 05, 2026
Viaarxiv icon

TexOCR: Advancing Document OCR Models for Compilable Page-to-LaTeX Reconstruction

Add code
Apr 24, 2026
Viaarxiv icon

SciMDR: Benchmarking and Advancing Scientific Multimodal Document Reasoning

Add code
Mar 12, 2026
Viaarxiv icon

RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation

Add code
Mar 10, 2026
Viaarxiv icon

Deconstructing Multimodal Mathematical Reasoning: Towards a Unified Perception-Alignment-Reasoning Paradigm

Add code
Mar 09, 2026
Viaarxiv icon

ANCHOR: Branch-Point Data Generation for GUI Agents

Add code
Feb 06, 2026
Viaarxiv icon

SAGE: Benchmarking and Improving Retrieval for Deep Research Agents

Add code
Feb 05, 2026
Viaarxiv icon

Rethinking Composed Image Retrieval Evaluation: A Fine-Grained Benchmark from Image Editing

Add code
Jan 22, 2026
Viaarxiv icon

Patient-Similarity Cohort Reasoning in Clinical Text-to-SQL

Add code
Jan 14, 2026
Viaarxiv icon