Picture for Yanghua Xiao

Yanghua Xiao

Enhancing Multimodal In-Context Learning via Inductive-Deductive Reasoning

Add code
May 04, 2026
Viaarxiv icon

On the Trainability of Masked Diffusion Language Models via Blockwise Locality

Add code
Apr 27, 2026
Viaarxiv icon

The GaoYao Benchmark: A Comprehensive Framework for Evaluating Multilingual and Multicultural Abilities of Large Language Models

Add code
Apr 22, 2026
Viaarxiv icon

SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment

Add code
Apr 14, 2026
Viaarxiv icon

RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following

Add code
Mar 26, 2026
Viaarxiv icon

From AI Assistant to AI Scientist: Autonomous Discovery of LLM-RL Algorithms with LLM Agents

Add code
Mar 25, 2026
Viaarxiv icon

Thinking with Constructions: A Benchmark and Policy Optimization for Visual-Text Interleaved Geometric Reasoning

Add code
Mar 19, 2026
Viaarxiv icon

DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use

Add code
Mar 10, 2026
Viaarxiv icon

CODA: Difficulty-Aware Compute Allocation for Adaptive Reasoning

Add code
Mar 09, 2026
Viaarxiv icon

HER: Human-like Reasoning and Reinforcement Learning for LLM Role-playing

Add code
Jan 29, 2026
Viaarxiv icon