Picture for Yusuke Miyao

Yusuke Miyao

Human-Grounded Multimodal Benchmark with 900K-Scale Aggregated Student Response Distributions from Japan's National Assessment of Academic Ability

Add code
May 12, 2026
Viaarxiv icon

Exclusive Unlearning

Add code
Apr 07, 2026
Viaarxiv icon

A Comparative Analysis of LLM Memorization at Statistical and Internal Levels: Cross-Model Commonalities and Model-Specific Signatures

Add code
Mar 23, 2026
Viaarxiv icon

The Imperfective Paradox in Large Language Models

Add code
Jan 14, 2026
Viaarxiv icon

Do Self-Supervised Speech Models Exhibit the Critical Period Effects in Language Acquisition?

Add code
Aug 28, 2025
Viaarxiv icon

Tracking World States with Language Models: State-Based Evaluation Using Chess

Add code
Aug 27, 2025
Viaarxiv icon

Massive Supervised Fine-tuning Experiments Reveal How Data, Layer, and Training Factors Shape LLM Alignment Quality

Add code
Jun 17, 2025
Viaarxiv icon

BIS Reasoning 1.0: The First Large-Scale Japanese Benchmark for Belief-Inconsistent Syllogistic Reasoning

Add code
Jun 08, 2025
Viaarxiv icon

Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance

Add code
May 27, 2025
Figure 1 for Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance
Figure 2 for Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance
Figure 3 for Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance
Figure 4 for Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance
Viaarxiv icon

Exploring the Effect of Segmentation and Vocabulary Size on Speech Tokenization for Speech Language Models

Add code
May 23, 2025
Viaarxiv icon