Picture for Kyosuke Nishida

Kyosuke Nishida

Responses Fall Short of Understanding: Revealing the Gap between Internal Representations and Responses in Visual Document Understanding

Add code
Apr 06, 2026
Viaarxiv icon

Can LLMs Detect Their Own Hallucinations?

Add code
Nov 14, 2025
Viaarxiv icon

Let's Put Ourselves in Sally's Shoes: Shoes-of-Others Prefixing Improves Theory of Mind in Large Language Models

Add code
Jun 06, 2025
Viaarxiv icon

VDocRAG: Retrieval-Augmented Generation over Visually-Rich Documents

Add code
Apr 14, 2025
Viaarxiv icon

Portable Reward Tuning: Towards Reusable Fine-Tuning across Different Pretrained Models

Add code
Feb 18, 2025
Viaarxiv icon

Wavelet-based Positional Representation for Long Context

Add code
Feb 04, 2025
Viaarxiv icon

ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind

Add code
Jan 15, 2025
Figure 1 for ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind
Figure 2 for ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind
Figure 3 for ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind
Figure 4 for ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind
Viaarxiv icon

Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes

Add code
Oct 07, 2024
Viaarxiv icon

InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions

Add code
Jan 24, 2024
Viaarxiv icon

Robust Text-driven Image Editing Method that Adaptively Explores Directions in Latent Spaces of StyleGAN and CLIP

Add code
Apr 03, 2023
Viaarxiv icon