Picture for Ruoyu Sun

Ruoyu Sun

Xi'an Jiaotong-Liverpool University

QK-Normed MLA: QK normalization without full key caching

Add code
Jun 15, 2026
Viaarxiv icon

Addressing Market Regime Changes and Heavy-Tailed Returns in Portfolio Optimization via Bayesian VAR and Elliptical Black-Litterman

Add code
Jun 08, 2026
Viaarxiv icon

PC Layer: Polynomial Weight Preconditioning for Improving LLM Pre-Training

Add code
Jun 04, 2026
Viaarxiv icon

A Geometric Characterization of the Stationary Plateau for Two-Layer Neural Networks

Add code
Jun 03, 2026
Viaarxiv icon

FineViT: Progressively Unlocking Fine-Grained Perception with Dense Recaptions

Add code
Mar 18, 2026
Viaarxiv icon

Adam Converges Without Any Modification On Update Rules

Add code
Mar 02, 2026
Viaarxiv icon

AI-Driven Spectrum Occupancy Prediction Using Real-World Spectrum Measurements

Add code
Jan 16, 2026
Viaarxiv icon

LarS-Net: A Large-Scale Framework for Network-Level Spectrum Sensing

Add code
Jan 16, 2026
Viaarxiv icon

Automated Spectrum Sensing and Analysis Framework

Add code
Jan 16, 2026
Viaarxiv icon

VCORE: Variance-Controlled Optimization-based Reweighting for Chain-of-Thought Supervision

Add code
Oct 31, 2025
Figure 1 for VCORE: Variance-Controlled Optimization-based Reweighting for Chain-of-Thought Supervision
Figure 2 for VCORE: Variance-Controlled Optimization-based Reweighting for Chain-of-Thought Supervision
Figure 3 for VCORE: Variance-Controlled Optimization-based Reweighting for Chain-of-Thought Supervision
Figure 4 for VCORE: Variance-Controlled Optimization-based Reweighting for Chain-of-Thought Supervision
Viaarxiv icon