Picture for Na Li

Na Li

Smiltec

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Q-Regularized Generative Auto-Bidding: From Suboptimal Trajectories to Optimal Policies

Add code
Jan 06, 2026
Viaarxiv icon

Adaptive Diffusion-based Augmentation for Recommendation

Add code
Jan 04, 2026
Viaarxiv icon

DiffKD-DCIS: Predicting Upgrade of Ductal Carcinoma In Situ with Diffusion Augmentation and Knowledge Distillation

Add code
Jan 04, 2026
Viaarxiv icon

Are First-Order Diffusion Samplers Really Slower? A Fast Forward-Value Approach

Add code
Dec 31, 2025
Viaarxiv icon

Max-Entropy Reinforcement Learning with Flow Matching and A Case Study on LQR

Add code
Dec 29, 2025
Viaarxiv icon

Spectral Representation-based Reinforcement Learning

Add code
Dec 17, 2025
Viaarxiv icon

Model-Based Diffusion Sampling for Predictive Control in Offline Decision Making

Add code
Dec 09, 2025
Viaarxiv icon

Offline Imitation Learning upon Arbitrary Demonstrations by Pre-Training Dynamics Representations

Add code
Aug 20, 2025
Viaarxiv icon

One-Step Flow Policy Mirror Descent

Add code
Jul 31, 2025
Viaarxiv icon