Picture for Yijiang Li

Yijiang Li

On-Policy Distillation with Best-of-N Teacher Rollout Selection

Add code
May 10, 2026
Viaarxiv icon

Reward-Decomposed Reinforcement Learning for Immersive Video Role-Playing

Add code
May 06, 2026
Viaarxiv icon

FedQueue: Queue-Aware Federated Learning for Cross-Facility HPC Training

Add code
May 04, 2026
Viaarxiv icon

LUMINA: A Grid Foundation Model for Benchmarking AC Optimal Power Flow Surrogate Learning

Add code
May 04, 2026
Viaarxiv icon

Towards Systematic Generalization for Power Grid Optimization Problems

Add code
May 03, 2026
Viaarxiv icon

CocoaBench: Evaluating Unified Digital Agents in the Wild

Add code
Apr 14, 2026
Viaarxiv icon

ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model

Add code
Mar 23, 2026
Viaarxiv icon

Scalable Cross-Facility Federated Learning for Scientific Foundation Models on Multiple Supercomputers

Add code
Mar 20, 2026
Viaarxiv icon

EgoForge: Goal-Directed Egocentric World Simulator

Add code
Mar 20, 2026
Viaarxiv icon

BackdoorIDS: Zero-shot Backdoor Detection for Pretrained Vision Encoder

Add code
Mar 12, 2026
Viaarxiv icon