Picture for Qingyang Wu

Qingyang Wu

Introspective Diffusion Language Models

Add code
Apr 13, 2026
Viaarxiv icon

Squeeze Evolve: Unified Multi-Model Orchestration for Verifier-Free Evolution

Add code
Apr 09, 2026
Viaarxiv icon

$V_1$: Unifying Generation and Self-Verification for Parallel Reasoners

Add code
Mar 04, 2026
Viaarxiv icon

When RL Meets Adaptive Speculative Training: A Unified Training-Serving System

Add code
Feb 06, 2026
Viaarxiv icon

Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time

Add code
Dec 31, 2025
Viaarxiv icon

Beat the long tail: Distribution-Aware Speculative Decoding for RL Training

Add code
Nov 17, 2025
Viaarxiv icon

Data Diversification Methods In Alignment Enhance Math Performance In LLMs

Add code
Jul 02, 2025
Figure 1 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 2 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 3 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Figure 4 for Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Viaarxiv icon

Disentangling Reasoning and Knowledge in Medical Large Language Models

Add code
May 16, 2025
Figure 1 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 2 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 3 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Figure 4 for Disentangling Reasoning and Knowledge in Medical Large Language Models
Viaarxiv icon

How Well Can General Vision-Language Models Learn Medicine By Watching Public Educational Videos?

Add code
Apr 19, 2025
Viaarxiv icon

Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods

Add code
Apr 18, 2025
Viaarxiv icon