Picture for Yuejie Chi

Yuejie Chi

Sample Complexity of Average-Reward Q-Learning: From Single-agent to Federated Reinforcement Learning

Add code
Jan 20, 2026
Viaarxiv icon

Preconditioning Benefits of Spectral Orthogonalization in Muon

Add code
Jan 20, 2026
Viaarxiv icon

Polynomial Convergence of Riemannian Diffusion Models

Add code
Jan 05, 2026
Viaarxiv icon

Transformers Provably Learn Chain-of-Thought Reasoning with Length Generalization

Add code
Nov 10, 2025
Viaarxiv icon

Multi-head Transformers Provably Learn Symbolic Multi-step Reasoning via Gradient Descent

Add code
Aug 11, 2025
Viaarxiv icon

Scalable LLM Math Reasoning Acceleration with Low-rank Distillation

Add code
May 08, 2025
Viaarxiv icon

LoRe: Personalizing LLMs via Low-Rank Reward Modeling

Add code
Apr 20, 2025
Figure 1 for LoRe: Personalizing LLMs via Low-Rank Reward Modeling
Figure 2 for LoRe: Personalizing LLMs via Low-Rank Reward Modeling
Figure 3 for LoRe: Personalizing LLMs via Low-Rank Reward Modeling
Figure 4 for LoRe: Personalizing LLMs via Low-Rank Reward Modeling
Viaarxiv icon

Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning

Add code
Feb 27, 2025
Figure 1 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 2 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 3 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 4 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Viaarxiv icon

Characterizing the Accuracy-Communication-Privacy Trade-off in Distributed Stochastic Convex Optimization

Add code
Jan 06, 2025
Viaarxiv icon

Vertical Federated Learning with Missing Features During Training and Inference

Add code
Oct 29, 2024
Figure 1 for Vertical Federated Learning with Missing Features During Training and Inference
Figure 2 for Vertical Federated Learning with Missing Features During Training and Inference
Figure 3 for Vertical Federated Learning with Missing Features During Training and Inference
Viaarxiv icon