Picture for Yuejie Chi

Yuejie Chi

Preconditioning Benefits of Spectral Orthogonalization in Muon

Add code
Jan 20, 2026
Viaarxiv icon

Sample Complexity of Average-Reward Q-Learning: From Single-agent to Federated Reinforcement Learning

Add code
Jan 20, 2026
Viaarxiv icon

Polynomial Convergence of Riemannian Diffusion Models

Add code
Jan 05, 2026
Viaarxiv icon

Transformers Provably Learn Chain-of-Thought Reasoning with Length Generalization

Add code
Nov 10, 2025
Viaarxiv icon

Multi-head Transformers Provably Learn Symbolic Multi-step Reasoning via Gradient Descent

Add code
Aug 11, 2025
Viaarxiv icon

Scalable LLM Math Reasoning Acceleration with Low-rank Distillation

Add code
May 08, 2025
Viaarxiv icon

LoRe: Personalizing LLMs via Low-Rank Reward Modeling

Add code
Apr 20, 2025
Figure 1 for LoRe: Personalizing LLMs via Low-Rank Reward Modeling
Figure 2 for LoRe: Personalizing LLMs via Low-Rank Reward Modeling
Figure 3 for LoRe: Personalizing LLMs via Low-Rank Reward Modeling
Figure 4 for LoRe: Personalizing LLMs via Low-Rank Reward Modeling
Viaarxiv icon

Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning

Add code
Feb 27, 2025
Figure 1 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 2 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 3 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Figure 4 for Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning
Viaarxiv icon

Characterizing the Accuracy-Communication-Privacy Trade-off in Distributed Stochastic Convex Optimization

Add code
Jan 06, 2025
Viaarxiv icon

Vertical Federated Learning with Missing Features During Training and Inference

Add code
Oct 29, 2024
Figure 1 for Vertical Federated Learning with Missing Features During Training and Inference
Figure 2 for Vertical Federated Learning with Missing Features During Training and Inference
Figure 3 for Vertical Federated Learning with Missing Features During Training and Inference
Viaarxiv icon