Picture for Yijie Peng

Yijie Peng

Sharper Generalization Bounds for Transformer

Add code
Mar 23, 2026
Viaarxiv icon

Adaptive Robust Estimator for Multi-Agent Reinforcement Learning

Add code
Mar 23, 2026
Viaarxiv icon

Optimal low-rank stochastic gradient estimation for LLM training

Add code
Mar 21, 2026
Viaarxiv icon

Nonparametric Bayesian Optimization for General Rewards

Add code
Feb 07, 2026
Viaarxiv icon

LLM-Inspired Pretrain-Then-Finetune for Small-Data, Large-Scale Optimization

Add code
Feb 03, 2026
Viaarxiv icon

Stochastic Approximation Methods for Distortion Risk Measure Optimization

Add code
Oct 06, 2025
Viaarxiv icon

Closing the Loop: Coordinating Inventory and Recommendation via Deep Reinforcement Learning on Multiple Timescales

Add code
Oct 05, 2025
Viaarxiv icon

Forward Learning with Differential Privacy

Add code
Apr 01, 2025
Figure 1 for Forward Learning with Differential Privacy
Figure 2 for Forward Learning with Differential Privacy
Figure 3 for Forward Learning with Differential Privacy
Figure 4 for Forward Learning with Differential Privacy
Viaarxiv icon

CoNNect: A Swiss-Army-Knife Regularizer for Pruning of Neural Networks

Add code
Feb 02, 2025
Figure 1 for CoNNect: A Swiss-Army-Knife Regularizer for Pruning of Neural Networks
Figure 2 for CoNNect: A Swiss-Army-Knife Regularizer for Pruning of Neural Networks
Figure 3 for CoNNect: A Swiss-Army-Knife Regularizer for Pruning of Neural Networks
Figure 4 for CoNNect: A Swiss-Army-Knife Regularizer for Pruning of Neural Networks
Viaarxiv icon

Eliminating Ratio Bias for Gradient-based Simulated Parameter Estimation

Add code
Nov 20, 2024
Figure 1 for Eliminating Ratio Bias for Gradient-based Simulated Parameter Estimation
Figure 2 for Eliminating Ratio Bias for Gradient-based Simulated Parameter Estimation
Figure 3 for Eliminating Ratio Bias for Gradient-based Simulated Parameter Estimation
Figure 4 for Eliminating Ratio Bias for Gradient-based Simulated Parameter Estimation
Viaarxiv icon