Picture for Yuxiong He

Yuxiong He

DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science

Add code
Feb 27, 2026
Viaarxiv icon

Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning

Add code
Feb 11, 2026
Viaarxiv icon

Fast and Accurate Causal Parallel Decoding using Jacobi Forcing

Add code
Dec 16, 2025
Viaarxiv icon

Arctic Inference with Shift Parallelism: Fast and Efficient Open Source Inference System for Enterprise AI

Add code
Jul 16, 2025
Viaarxiv icon

ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback

Add code
Mar 25, 2025
Viaarxiv icon

ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments

Add code
Feb 27, 2025
Viaarxiv icon

CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation

Add code
Dec 19, 2024
Figure 1 for CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation
Figure 2 for CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation
Figure 3 for CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation
Figure 4 for CORD: Balancing COnsistency and Rank Distillation for Robust Retrieval-Augmented Generation
Viaarxiv icon

Inference Scaling for Bridging Retrieval and Augmented Generation

Add code
Dec 14, 2024
Viaarxiv icon

SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation

Add code
Oct 04, 2024
Viaarxiv icon

STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning

Add code
Sep 10, 2024
Figure 1 for STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
Figure 2 for STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
Figure 3 for STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
Figure 4 for STUN: Structured-Then-Unstructured Pruning for Scalable MoE Pruning
Viaarxiv icon