Picture for Komei Sugiura

Komei Sugiura

Attention Lattice Adapter: Visual Explanation Generation for Visual Foundation Model

Add code
Sep 18, 2025
Viaarxiv icon

Pre-Manipulation Alignment Prediction with Parallel Deep State-Space and Transformer Models

Add code
Sep 17, 2025
Viaarxiv icon

Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images

Add code
Aug 11, 2025
Viaarxiv icon

ZINA: Multimodal Fine-grained Hallucination Detection and Editing

Add code
Jun 16, 2025
Viaarxiv icon

Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement

Add code
Jan 28, 2025
Viaarxiv icon

Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories

Add code
Jan 08, 2025
Figure 1 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 2 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 3 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 4 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Viaarxiv icon

Task Success Prediction and Open-Vocabulary Object Manipulation

Add code
Dec 26, 2024
Figure 1 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 2 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 3 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 4 for Task Success Prediction and Open-Vocabulary Object Manipulation
Viaarxiv icon

Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling

Add code
Dec 24, 2024
Figure 1 for Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling
Figure 2 for Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling
Figure 3 for Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling
Figure 4 for Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling
Viaarxiv icon

Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations

Add code
Oct 01, 2024
Figure 1 for Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations
Figure 2 for Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations
Figure 3 for Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations
Figure 4 for Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations
Viaarxiv icon

DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning

Add code
Sep 28, 2024
Figure 1 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Figure 2 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Figure 3 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Figure 4 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Viaarxiv icon