Picture for Katsuhiko Hayashi

Katsuhiko Hayashi

Diversity of Transformer Layers: One Aspect of Parameter Scaling Laws

Add code
May 29, 2025
Viaarxiv icon

TextTIGER: Text-based Intelligent Generation with Entity Prompt Refinement for Text-to-Image Generation

Add code
Apr 25, 2025
Viaarxiv icon

The Role of Background Information in Reducing Object Hallucination in Vision-Language Models: Insights from Cutoff API Prompting

Add code
Feb 21, 2025
Viaarxiv icon

A Simple but Effective Closed-form Solution for Extreme Multi-label Learning

Add code
Jan 17, 2025
Viaarxiv icon

Can Impressions of Music be Extracted from Thumbnail Images?

Add code
Jan 05, 2025
Figure 1 for Can Impressions of Music be Extracted from Thumbnail Images?
Figure 2 for Can Impressions of Music be Extracted from Thumbnail Images?
Figure 3 for Can Impressions of Music be Extracted from Thumbnail Images?
Figure 4 for Can Impressions of Music be Extracted from Thumbnail Images?
Viaarxiv icon

Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain

Add code
Dec 29, 2024
Figure 1 for Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain
Figure 2 for Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain
Figure 3 for Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain
Figure 4 for Understanding the Impact of Confidence in Retrieval Augmented Generation: A Case Study in the Medical Domain
Viaarxiv icon

How Panel Layouts Define Manga: Insights from Visual Ablation Experiments

Add code
Dec 26, 2024
Figure 1 for How Panel Layouts Define Manga: Insights from Visual Ablation Experiments
Figure 2 for How Panel Layouts Define Manga: Insights from Visual Ablation Experiments
Figure 3 for How Panel Layouts Define Manga: Insights from Visual Ablation Experiments
Figure 4 for How Panel Layouts Define Manga: Insights from Visual Ablation Experiments
Viaarxiv icon

Theoretical Aspects of Bias and Diversity in Minimum Bayes Risk Decoding

Add code
Oct 19, 2024
Viaarxiv icon

Towards Cross-Lingual Explanation of Artwork in Large-scale Vision Language Models

Add code
Sep 03, 2024
Figure 1 for Towards Cross-Lingual Explanation of Artwork in Large-scale Vision Language Models
Figure 2 for Towards Cross-Lingual Explanation of Artwork in Large-scale Vision Language Models
Figure 3 for Towards Cross-Lingual Explanation of Artwork in Large-scale Vision Language Models
Figure 4 for Towards Cross-Lingual Explanation of Artwork in Large-scale Vision Language Models
Viaarxiv icon

Multi-label Learning with Random Circular Vectors

Add code
Jul 08, 2024
Viaarxiv icon