Picture for Alireza Mousavi-Hosseini

Alireza Mousavi-Hosseini

Super Apriel: One Checkpoint, Many Speeds

Add code
Apr 21, 2026
Viaarxiv icon

On Fitting Flow Models with Large Sinkhorn Couplings

Add code
Jun 05, 2025
Viaarxiv icon

When Do Transformers Outperform Feedforward and Recurrent Networks? A Statistical Perspective

Add code
Mar 14, 2025
Viaarxiv icon

Robust Feature Learning for Multi-Index Models in High Dimensions

Add code
Oct 21, 2024
Figure 1 for Robust Feature Learning for Multi-Index Models in High Dimensions
Viaarxiv icon

Learning Multi-Index Models with Neural Networks via Mean-Field Langevin Dynamics

Add code
Aug 14, 2024
Figure 1 for Learning Multi-Index Models with Neural Networks via Mean-Field Langevin Dynamics
Figure 2 for Learning Multi-Index Models with Neural Networks via Mean-Field Langevin Dynamics
Viaarxiv icon

Mean-Field Langevin Dynamics for Signed Measures via a Bilevel Approach

Add code
Jun 26, 2024
Viaarxiv icon

A Separation in Heavy-Tailed Sampling: Gaussian vs. Stable Oracles for Proximal Samplers

Add code
May 27, 2024
Viaarxiv icon

Gradient-Based Feature Learning under Structured Data

Add code
Sep 07, 2023
Viaarxiv icon

Towards a Complete Analysis of Langevin Monte Carlo: Beyond Poincaré Inequality

Add code
Mar 07, 2023
Viaarxiv icon

Neural Networks Efficiently Learn Low-Dimensional Representations with SGD

Add code
Sep 29, 2022
Figure 1 for Neural Networks Efficiently Learn Low-Dimensional Representations with SGD
Figure 2 for Neural Networks Efficiently Learn Low-Dimensional Representations with SGD
Viaarxiv icon