Picture for Yunfang Wu

Yunfang Wu

Key Laboratory of Computational Linguistics, Ministry of Education, China, School of Computer Science, Peking University, China

SyncThink: A Training-Free Strategy to Align Inference Termination with Reasoning Saturation

Add code
Jan 07, 2026
Viaarxiv icon

Safety-Utility Conflicts Are Not Global: Surgical Alignment via Head-Level Diagnosis

Add code
Jan 07, 2026
Viaarxiv icon

One Tool Is Enough: Reinforcement Learning for Repository-Level LLM Agents

Add code
Dec 24, 2025
Viaarxiv icon

Do Not Step Into the Same River Twice: Learning to Reason from Trial and Error

Add code
Oct 30, 2025
Viaarxiv icon

Think Outside the Policy: In-Context Steered Policy Optimization

Add code
Oct 30, 2025
Viaarxiv icon

Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing

Add code
Sep 18, 2025
Figure 1 for Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing
Figure 2 for Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing
Figure 3 for Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing
Figure 4 for Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing
Viaarxiv icon

Composable Cross-prompt Essay Scoring by Merging Models

Add code
May 24, 2025
Figure 1 for Composable Cross-prompt Essay Scoring by Merging Models
Figure 2 for Composable Cross-prompt Essay Scoring by Merging Models
Figure 3 for Composable Cross-prompt Essay Scoring by Merging Models
Figure 4 for Composable Cross-prompt Essay Scoring by Merging Models
Viaarxiv icon

ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy

Add code
May 21, 2025
Figure 1 for ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy
Figure 2 for ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy
Figure 3 for ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy
Figure 4 for ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy
Viaarxiv icon

Dynamic Fisher-weighted Model Merging via Bayesian Optimization

Add code
Apr 26, 2025
Figure 1 for Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Figure 2 for Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Figure 3 for Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Figure 4 for Dynamic Fisher-weighted Model Merging via Bayesian Optimization
Viaarxiv icon

From Prompting to Alignment: A Generative Framework for Query Recommendation

Add code
Apr 14, 2025
Figure 1 for From Prompting to Alignment: A Generative Framework for Query Recommendation
Figure 2 for From Prompting to Alignment: A Generative Framework for Query Recommendation
Figure 3 for From Prompting to Alignment: A Generative Framework for Query Recommendation
Figure 4 for From Prompting to Alignment: A Generative Framework for Query Recommendation
Viaarxiv icon