Picture for Wenting Zhao

Wenting Zhao

Reasoning over mathematical objects: on-policy reward modeling and test time aggregation

Add code
Mar 19, 2026
Viaarxiv icon

Position: Vector Prompt Interfaces Should Be Exposed to Enable Customization of Large Language Models

Add code
Mar 04, 2026
Viaarxiv icon

Qwen3-Coder-Next Technical Report

Add code
Feb 28, 2026
Viaarxiv icon

AudioCapBench: Quick Evaluation on Audio Captioning across Sound, Music, and Speech

Add code
Feb 27, 2026
Viaarxiv icon

Scaling Agentic Verifier for Competitive Coding

Add code
Feb 04, 2026
Viaarxiv icon

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Add code
Feb 02, 2026
Viaarxiv icon

SweRank+: Multilingual, Multi-Turn Code Ranking for Software Issue Localization

Add code
Dec 23, 2025
Viaarxiv icon

Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement

Add code
Nov 08, 2025
Figure 1 for Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Figure 2 for Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Figure 3 for Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Figure 4 for Self-Abstraction from Grounded Experience for Plan-Guided Policy Refinement
Viaarxiv icon

StepWiser: Stepwise Generative Judges for Wiser Reasoning

Add code
Aug 27, 2025
Figure 1 for StepWiser: Stepwise Generative Judges for Wiser Reasoning
Figure 2 for StepWiser: Stepwise Generative Judges for Wiser Reasoning
Figure 3 for StepWiser: Stepwise Generative Judges for Wiser Reasoning
Figure 4 for StepWiser: Stepwise Generative Judges for Wiser Reasoning
Viaarxiv icon

Towards LLM Agents for Earth Observation

Add code
Apr 16, 2025
Viaarxiv icon