Picture for Junzhe Wang

Junzhe Wang

Faithfulness-QA: A Counterfactual Entity Substitution Dataset for Training Context-Faithful RAG Models

Add code
Apr 28, 2026
Viaarxiv icon

Enhancing LLM-based Search Agents via Contribution Weighted Group Relative Policy Optimization

Add code
Apr 15, 2026
Viaarxiv icon

MM-Doc-R1: Training Agents for Long Document Visual Question Answering through Multi-turn Reinforcement Learning

Add code
Apr 15, 2026
Viaarxiv icon

LLM-Based Scientific Equation Discovery via Physics-Informed Token-Regularized Policy Optimization

Add code
Feb 11, 2026
Viaarxiv icon

CL-bench: A Benchmark for Context Learning

Add code
Feb 03, 2026
Viaarxiv icon

CARE-Bench: A Benchmark of Diverse Client Simulations Guided by Expert Principles for Evaluating LLMs in Psychological Counseling

Add code
Nov 12, 2025
Viaarxiv icon

Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM

Add code
Nov 07, 2025
Figure 1 for Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM
Figure 2 for Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM
Figure 3 for Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM
Figure 4 for Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM
Viaarxiv icon

LLMEval-3: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models

Add code
Aug 07, 2025
Viaarxiv icon

Psychological Counseling Cannot Be Achieved Overnight: Automated Psychological Counseling Through Multi-Session Conversations

Add code
Jun 07, 2025
Viaarxiv icon

Active Contact Engagement for Aerial Navigation in Unknown Environments with Glass

Add code
May 01, 2025
Viaarxiv icon