Picture for Jun Suzuki

Jun Suzuki

Adapting Text LLMs to Speech via Multimodal Depth Up-Scaling

Add code
Apr 01, 2026
Viaarxiv icon

SLVMEval: Synthetic Meta Evaluation Benchmark for Text-to-Long Video Generation

Add code
Mar 31, 2026
Viaarxiv icon

Pre-training LLM without Learning Rate Decay Enhances Supervised Fine-Tuning

Add code
Mar 17, 2026
Viaarxiv icon

Enhancing Persuasive Dialogue Agents by Synthesizing Cross-Disciplinary Communication Strategies

Add code
Feb 26, 2026
Viaarxiv icon

Relaxing Positional Alignment in Masked Diffusion Language Models

Add code
Jan 30, 2026
Viaarxiv icon

TimeMachine-bench: A Benchmark for Evaluating Model Capabilities in Repository-Level Migration Tasks

Add code
Jan 30, 2026
Viaarxiv icon

Suppressing Final Layer Hidden State Jumps in Transformer Pretraining

Add code
Jan 26, 2026
Viaarxiv icon

Instruction-Following Evaluation of Large Vision-Language Models

Add code
Dec 29, 2025
Viaarxiv icon

An Open and Reproducible Deep Research Agent for Long-Form Question Answering

Add code
Dec 15, 2025
Figure 1 for An Open and Reproducible Deep Research Agent for Long-Form Question Answering
Figure 2 for An Open and Reproducible Deep Research Agent for Long-Form Question Answering
Figure 3 for An Open and Reproducible Deep Research Agent for Long-Form Question Answering
Figure 4 for An Open and Reproducible Deep Research Agent for Long-Form Question Answering
Viaarxiv icon

Transformer Key-Value Memories Are Nearly as Interpretable as Sparse Autoencoders

Add code
Oct 25, 2025
Figure 1 for Transformer Key-Value Memories Are Nearly as Interpretable as Sparse Autoencoders
Figure 2 for Transformer Key-Value Memories Are Nearly as Interpretable as Sparse Autoencoders
Figure 3 for Transformer Key-Value Memories Are Nearly as Interpretable as Sparse Autoencoders
Figure 4 for Transformer Key-Value Memories Are Nearly as Interpretable as Sparse Autoencoders
Viaarxiv icon