Picture for Alessandro Rinaldo

Alessandro Rinaldo

On Minimax Estimation of Parameters in Softmax-Contaminated Mixture of Experts

Add code
May 24, 2025
Viaarxiv icon

On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating

Add code
May 16, 2025
Viaarxiv icon

Convergence Rates for Softmax Gating Mixture of Experts

Add code
Mar 05, 2025
Viaarxiv icon

Uncertainty quantification for Markov chains with application to temporal difference learning

Add code
Feb 19, 2025
Viaarxiv icon

Statistical Inference for Temporal Difference Learning with Linear Function Approximation

Add code
Oct 21, 2024
Viaarxiv icon

Straightness of Rectified Flow: A Theoretical Insight into Wasserstein Convergence

Add code
Oct 19, 2024
Viaarxiv icon

Sigmoid Gating is More Sample Efficient than Softmax Gating in Mixture of Experts

Add code
May 22, 2024
Viaarxiv icon

On Least Squares Estimation in Softmax Gating Mixture of Experts

Add code
Feb 05, 2024
Viaarxiv icon

Sharp high-probability sample complexities for policy evaluation with linear function approximation

Add code
May 30, 2023
Viaarxiv icon

Mitigating multiple descents: A model-agnostic framework for risk monotonization

Add code
May 25, 2022
Figure 1 for Mitigating multiple descents: A model-agnostic framework for risk monotonization
Figure 2 for Mitigating multiple descents: A model-agnostic framework for risk monotonization
Figure 3 for Mitigating multiple descents: A model-agnostic framework for risk monotonization
Figure 4 for Mitigating multiple descents: A model-agnostic framework for risk monotonization
Viaarxiv icon