Picture for Yuxiong He

Yuxiong He

R$^3$-SQL: Ranking Reward and Resampling for Text-to-SQL

Add code
Apr 28, 2026
Viaarxiv icon

Learning to Hint for Reinforcement Learning

Add code
Apr 01, 2026
Viaarxiv icon

Learning to Self-Evolve

Add code
Mar 19, 2026
Viaarxiv icon

DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

Add code
Feb 27, 2026
Viaarxiv icon

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Add code
Feb 11, 2026
Viaarxiv icon

Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

Add code
Dec 16, 2025
Viaarxiv icon

Arctic Inference with Shift Parallelism: Fast and Efficient Open Source Inference System for Enterprise AI

Add code
Jul 16, 2025
Viaarxiv icon

ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback

Add code
Mar 25, 2025
Viaarxiv icon

ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments

Add code
Feb 27, 2025
Viaarxiv icon

CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation

Add code
Dec 19, 2024
Figure 1 for CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation
Figure 2 for CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation
Figure 3 for CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation
Figure 4 for CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation
Viaarxiv icon