Picture for Wenxuan Zhou

Wenxuan Zhou

Principled Foundations for Preference Optimization

Add code
Jul 10, 2025
Viaarxiv icon

Code Execution as Grounded Supervision for LLM Reasoning

Add code
Jun 12, 2025
Viaarxiv icon

Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation

Add code
May 18, 2025
Viaarxiv icon

MetaScale: Test-Time Scaling with Evolving Meta-Thoughts

Add code
Mar 17, 2025
Viaarxiv icon

Semantic-Clipping: Efficient Vision-Language Modeling with Semantic-Guidedd Visual Selection

Add code
Mar 14, 2025
Viaarxiv icon

ThinkGuard: Deliberative Slow Thinking Leads to Cautious Guardrails

Add code
Feb 19, 2025
Viaarxiv icon

Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization

Add code
Jan 31, 2025
Viaarxiv icon

FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings

Add code
Jan 11, 2025
Figure 1 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 2 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 3 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Figure 4 for FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
Viaarxiv icon

T-REG: Preference Optimization with Token-Level Reward Regularization

Add code
Dec 03, 2024
Viaarxiv icon

Improving Model Factuality with Fine-grained Critique-based Evaluator

Add code
Oct 24, 2024
Viaarxiv icon