Picture for Rei Higuchi

Rei Higuchi

Direct Density Ratio Optimization: A Statistically Consistent Approach to Aligning Large Language Models

Add code
May 12, 2025
Viaarxiv icon

When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars

Add code
Apr 24, 2025
Viaarxiv icon