Picture for Molei Tao

Molei Tao

How Does the ReLU Activation Affect the Implicit Bias of Gradient Descent on High-dimensional Neural Network Regression?

Add code
Mar 05, 2026
Viaarxiv icon

LaViDa-R1: Advancing Reasoning for Unified Multimodal Diffusion Language Models

Add code
Feb 15, 2026
Viaarxiv icon

Discrete Adjoint Schrödinger Bridge Sampler

Add code
Feb 09, 2026
Viaarxiv icon

Diffusion Model's Generalization Can Be Characterized by Inductive Biases toward a Data-Dependent Ridge Manifold

Add code
Feb 05, 2026
Viaarxiv icon

Rethinking the Design Space of Reinforcement Learning for Diffusion Models: On the Importance of Likelihood Estimation Beyond Loss Design

Add code
Feb 04, 2026
Viaarxiv icon

Prism: Efficient Test-Time Scaling via Hierarchical Search and Self-Verification for Discrete Diffusion Language Models

Add code
Feb 02, 2026
Viaarxiv icon

Generalization of Diffusion Models Arises with a Balanced Representation Space

Add code
Dec 24, 2025
Figure 1 for Generalization of Diffusion Models Arises with a Balanced Representation Space
Figure 2 for Generalization of Diffusion Models Arises with a Balanced Representation Space
Figure 3 for Generalization of Diffusion Models Arises with a Balanced Representation Space
Figure 4 for Generalization of Diffusion Models Arises with a Balanced Representation Space
Viaarxiv icon

From Masks to Worlds: A Hitchhiker's Guide to World Models

Add code
Oct 23, 2025
Viaarxiv icon

Variational Learning Finds Flatter Solutions at the Edge of Stability

Add code
Jun 15, 2025
Viaarxiv icon

Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces

Add code
Jun 09, 2025
Figure 1 for Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces
Figure 2 for Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces
Figure 3 for Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces
Figure 4 for Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces
Viaarxiv icon