Picture for Yusuke Oda

Yusuke Oda

Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision-Language Models

Add code
Apr 02, 2026
Viaarxiv icon

JAMMEval: A Refined Collection of Japanese Benchmarks for Reliable VLM Evaluation

Add code
Apr 01, 2026
Viaarxiv icon

HiFlow: Tokenization-Free Scale-Wise Autoregressive Policy Learning via Flow Matching

Add code
Mar 28, 2026
Viaarxiv icon

ShapleyLaw: A Game-Theoretic Approach to Multilingual Scaling Laws

Add code
Mar 18, 2026
Viaarxiv icon

ReMoRa: Multimodal Large Language Model based on Refined Motion Representation for Long-Video Understanding

Add code
Feb 18, 2026
Viaarxiv icon

Instability in Downstream Task Performance During LLM Pretraining

Add code
Oct 06, 2025
Viaarxiv icon

Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens

Add code
Sep 18, 2025
Figure 1 for Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens
Figure 2 for Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens
Figure 3 for Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens
Figure 4 for Llama-Mimi: Speech Language Models with Interleaved Semantic and Acoustic Tokens
Viaarxiv icon

Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality

Add code
Jun 17, 2025
Viaarxiv icon

BIS Reasoning 1.0: The First Large-Scale Japanese Benchmark for Belief-Inconsistent Syllogistic Reasoning

Add code
Jun 08, 2025
Viaarxiv icon

llm-jp-modernbert: A ModernBERT Model Trained on a Large-Scale Japanese Corpus with Long Context Length

Add code
Apr 22, 2025
Viaarxiv icon