Picture for Taiji Suzuki

Taiji Suzuki

Generalization Bound of Gradient Flow through Training Trajectory and Data-dependent Kernel

Add code
Jun 12, 2025
Viaarxiv icon

On the Role of Label Noise in the Feature Learning Process

Add code
May 25, 2025
Viaarxiv icon

Direct Density Ratio Optimization: A Statistically Consistent Approach to Aligning Large Language Models

Add code
May 12, 2025
Viaarxiv icon

Quantifying Memory Utilization with Effective State-Size

Add code
Apr 28, 2025
Viaarxiv icon

When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars

Add code
Apr 24, 2025
Viaarxiv icon

Propagation of Chaos for Mean-Field Langevin Dynamics and its Application to Model Ensemble

Add code
Feb 09, 2025
Viaarxiv icon

Direct Distributional Optimization for Provable Alignment of Diffusion Models

Add code
Feb 05, 2025
Figure 1 for Direct Distributional Optimization for Provable Alignment of Diffusion Models
Figure 2 for Direct Distributional Optimization for Provable Alignment of Diffusion Models
Figure 3 for Direct Distributional Optimization for Provable Alignment of Diffusion Models
Figure 4 for Direct Distributional Optimization for Provable Alignment of Diffusion Models
Viaarxiv icon

Metastable Dynamics of Chain-of-Thought Reasoning: Provable Benefits of Search, RL and Distillation

Add code
Feb 02, 2025
Viaarxiv icon

Optimality and Adaptivity of Deep Neural Features for Instrumental Variable Regression

Add code
Jan 09, 2025
Figure 1 for Optimality and Adaptivity of Deep Neural Features for Instrumental Variable Regression
Figure 2 for Optimality and Adaptivity of Deep Neural Features for Instrumental Variable Regression
Viaarxiv icon

On the Comparison between Multi-modal and Single-modal Contrastive Learning

Add code
Nov 05, 2024
Viaarxiv icon