Picture for Yunfang Wu

Yunfang Wu

Key Laboratory of Computational Linguistics, Ministry of Education, China, School of Computer Science, Peking University, China

Think Outside the Policy: In-Context Steered Policy Optimization

Add code
Oct 30, 2025
Viaarxiv icon

Do Not Step Into the Same River Twice: Learning to Reason from Trial and Error

Add code
Oct 30, 2025
Viaarxiv icon

Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing

Add code
Sep 18, 2025
Figure 1 for Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing
Figure 2 for Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing
Figure 3 for Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing
Figure 4 for Beyond Spurious Signals: Debiasing Multimodal Large Language Models via Counterfactual Inference and Adaptive Expert Routing
Viaarxiv icon

Composable Cross-prompt Essay Scoring by Merging Models

Add code
May 24, 2025
Viaarxiv icon

ThinkLess: A Training-Free Inference-Efficient Method for Reducing Reasoning Redundancy

Add code
May 21, 2025
Viaarxiv icon

Dynamic Fisher-weighted Model Merging via Bayesian Optimization

Add code
Apr 26, 2025
Viaarxiv icon

From Prompting to Alignment: A Generative Framework for Query Recommendation

Add code
Apr 14, 2025
Figure 1 for From Prompting to Alignment: A Generative Framework for Query Recommendation
Figure 2 for From Prompting to Alignment: A Generative Framework for Query Recommendation
Figure 3 for From Prompting to Alignment: A Generative Framework for Query Recommendation
Figure 4 for From Prompting to Alignment: A Generative Framework for Query Recommendation
Viaarxiv icon

Rank-Then-Score: Enhancing Large Language Models for Automated Essay Scoring

Add code
Apr 08, 2025
Viaarxiv icon

Optimal Brain Iterative Merging: Mitigating Interference in LLM Merging

Add code
Feb 17, 2025
Figure 1 for Optimal Brain Iterative Merging: Mitigating Interference in LLM Merging
Figure 2 for Optimal Brain Iterative Merging: Mitigating Interference in LLM Merging
Figure 3 for Optimal Brain Iterative Merging: Mitigating Interference in LLM Merging
Figure 4 for Optimal Brain Iterative Merging: Mitigating Interference in LLM Merging
Viaarxiv icon

A Survey of Uncertainty Estimation in LLMs: Theory Meets Practice

Add code
Oct 20, 2024
Figure 1 for A Survey of Uncertainty Estimation in LLMs: Theory Meets Practice
Viaarxiv icon