Picture for Yuiga Wada

Yuiga Wada

Attention Lattice Adapter: Visual Explanation Generation for Visual Foundation Model

Add code
Sep 18, 2025
Viaarxiv icon

Capturing Fine-Grained Alignments Improves 3D Affordance Detection

Add code
Jun 24, 2025
Viaarxiv icon

ZINA: Multimodal Fine-grained Hallucination Detection and Editing

Add code
Jun 16, 2025
Figure 1 for ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Figure 2 for ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Figure 3 for ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Figure 4 for ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Viaarxiv icon

DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning

Add code
Sep 28, 2024
Figure 1 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Figure 2 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Figure 3 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Figure 4 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Viaarxiv icon

Polos: Multimodal Metric Learning from Human Feedback for Image Captioning

Add code
Feb 28, 2024
Figure 1 for Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
Figure 2 for Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
Figure 3 for Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
Figure 4 for Polos: Multimodal Metric Learning from Human Feedback for Image Captioning
Viaarxiv icon

DialMAT: Dialogue-Enabled Transformer with Moment-Based Adversarial Training

Add code
Nov 12, 2023
Viaarxiv icon

JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models

Add code
Nov 07, 2023
Viaarxiv icon

Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions

Add code
Jul 17, 2023
Figure 1 for Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions
Figure 2 for Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions
Figure 3 for Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions
Figure 4 for Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions
Viaarxiv icon