Picture for Komei Sugiura

Komei Sugiura

ZINA: Multimodal Fine-grained Hallucination Detection and Editing

Add code
Jun 16, 2025
Viaarxiv icon

Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement

Add code
Jan 28, 2025
Viaarxiv icon

Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories

Add code
Jan 08, 2025
Figure 1 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 2 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 3 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 4 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Viaarxiv icon

Task Success Prediction and Open-Vocabulary Object Manipulation

Add code
Dec 26, 2024
Figure 1 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 2 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 3 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 4 for Task Success Prediction and Open-Vocabulary Object Manipulation
Viaarxiv icon

Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling

Add code
Dec 24, 2024
Figure 1 for Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling
Figure 2 for Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling
Figure 3 for Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling
Figure 4 for Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling
Viaarxiv icon

Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations

Add code
Oct 01, 2024
Viaarxiv icon

DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning

Add code
Sep 28, 2024
Figure 1 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Figure 2 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Figure 3 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Figure 4 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Viaarxiv icon

DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions

Add code
Aug 15, 2024
Viaarxiv icon

Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks

Add code
Jul 18, 2024
Figure 1 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Figure 2 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Figure 3 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Figure 4 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Viaarxiv icon

Layer-Wise Relevance Propagation with Conservation Property for ResNet

Add code
Jul 12, 2024
Figure 1 for Layer-Wise Relevance Propagation with Conservation Property for ResNet
Figure 2 for Layer-Wise Relevance Propagation with Conservation Property for ResNet
Figure 3 for Layer-Wise Relevance Propagation with Conservation Property for ResNet
Figure 4 for Layer-Wise Relevance Propagation with Conservation Property for ResNet
Viaarxiv icon