Picture for Feng Yao

Feng Yao

Rose-SQL: Role-State Evolution Guided Structured Reasoning for Multi-Turn Text-to-SQL

Add code
May 05, 2026
Viaarxiv icon

CocoaBench: Evaluating Unified Digital Agents in the Wild

Add code
Apr 14, 2026
Viaarxiv icon

IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL

Add code
Mar 12, 2026
Viaarxiv icon

Contamination Detection for VLMs using Multi-Modal Semantic Perturbation

Add code
Nov 05, 2025
Viaarxiv icon

Finish First, Perfect Later: Test-Time Token-Level Cross-Validation for Diffusion Large Language Models

Add code
Oct 06, 2025
Figure 1 for Finish First, Perfect Later: Test-Time Token-Level Cross-Validation for Diffusion Large Language Models
Figure 2 for Finish First, Perfect Later: Test-Time Token-Level Cross-Validation for Diffusion Large Language Models
Figure 3 for Finish First, Perfect Later: Test-Time Token-Level Cross-Validation for Diffusion Large Language Models
Figure 4 for Finish First, Perfect Later: Test-Time Token-Level Cross-Validation for Diffusion Large Language Models
Viaarxiv icon

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Add code
Jun 17, 2025
Figure 1 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Figure 2 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Figure 3 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Figure 4 for Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
Viaarxiv icon

ProDiff: Prototype-Guided Diffusion for Minimal Information Trajectory Imputation

Add code
May 29, 2025
Viaarxiv icon

Training Language Models to Generate Quality Code with Program Analysis Feedback

Add code
May 28, 2025
Figure 1 for Training Language Models to Generate Quality Code with Program Analysis Feedback
Figure 2 for Training Language Models to Generate Quality Code with Program Analysis Feedback
Figure 3 for Training Language Models to Generate Quality Code with Program Analysis Feedback
Figure 4 for Training Language Models to Generate Quality Code with Program Analysis Feedback
Viaarxiv icon

Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation

Add code
Oct 30, 2024
Figure 1 for Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation
Figure 2 for Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation
Figure 3 for Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation
Figure 4 for Next-Token Prediction Task Assumes Optimal Data Ordering for LLM Training in Proof Generation
Viaarxiv icon

ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework

Add code
Oct 25, 2024
Figure 1 for ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework
Figure 2 for ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework
Figure 3 for ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework
Figure 4 for ShifCon: Enhancing Non-Dominant Language Capabilities with a Shift-based Contrastive Framework
Viaarxiv icon