Picture for Ryotaro Kawata

Ryotaro Kawata

From Shortcut to Induction Head: How Data Diversity Shapes Algorithm Selection in Transformers

Add code
Dec 21, 2025
Viaarxiv icon

When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars

Add code
Apr 24, 2025
Figure 1 for When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars
Figure 2 for When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars
Figure 3 for When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars
Figure 4 for When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars
Viaarxiv icon

Direct Distributional Optimization for Provable Alignment of Diffusion Models

Add code
Feb 05, 2025
Figure 1 for Direct Distributional Optimization for Provable Alignment of Diffusion Models
Figure 2 for Direct Distributional Optimization for Provable Alignment of Diffusion Models
Figure 3 for Direct Distributional Optimization for Provable Alignment of Diffusion Models
Figure 4 for Direct Distributional Optimization for Provable Alignment of Diffusion Models
Viaarxiv icon