Picture for Naoaki Okazaki

Naoaki Okazaki

Aligning Tree-Search Policies with Fixed Token Budgets in Test-Time Scaling of LLMs

Add code
Feb 10, 2026
Viaarxiv icon

Autoregressive Direct Preference Optimization

Add code
Feb 10, 2026
Viaarxiv icon

Diffusion-State Policy Optimization for Masked Diffusion Language Models

Add code
Feb 09, 2026
Viaarxiv icon

From Correspondence to Actions: Human-Like Multi-Image Spatial Reasoning in Multi-modal Large Language Models

Add code
Feb 09, 2026
Viaarxiv icon

Stopping Computation for Converged Tokens in Masked Diffusion-LM Decoding

Add code
Feb 06, 2026
Viaarxiv icon

From Interpretability to Performance: Optimizing Retrieval Heads for Long-Context Language Models

Add code
Jan 16, 2026
Viaarxiv icon

Bit-level BPE: Below the byte boundary

Add code
Jun 09, 2025
Viaarxiv icon

Rewriting Pre-Training Data Boosts LLM Performance in Math and Code

Add code
May 05, 2025
Viaarxiv icon

Building Instruction-Tuning Datasets from Human-Written Instructions with Open-Weight Large Language Models

Add code
Mar 31, 2025
Viaarxiv icon

Intent-Aware Self-Correction for Mitigating Social Biases in Large Language Models

Add code
Mar 08, 2025
Viaarxiv icon