Picture for Komei Sugiura

Komei Sugiura

Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images

Add code
Aug 11, 2025
Viaarxiv icon

ZINA: Multimodal Fine-grained Hallucination Detection and Editing

Add code
Jun 16, 2025
Viaarxiv icon

Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement

Add code
Jan 28, 2025
Viaarxiv icon

Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories

Add code
Jan 08, 2025
Figure 1 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 2 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 3 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 4 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Viaarxiv icon

Task Success Prediction and Open-Vocabulary Object Manipulation

Add code
Dec 26, 2024
Figure 1 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 2 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 3 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 4 for Task Success Prediction and Open-Vocabulary Object Manipulation
Viaarxiv icon

Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling

Add code
Dec 24, 2024
Figure 1 for Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling
Figure 2 for Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling
Figure 3 for Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling
Figure 4 for Open-Vocabulary Mobile Manipulation Based on Double Relaxed Contrastive Learning with Dense Labeling
Viaarxiv icon

Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations

Add code
Oct 01, 2024
Figure 1 for Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations
Figure 2 for Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations
Figure 3 for Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations
Figure 4 for Task Success Prediction for Open-Vocabulary Manipulation Based on Multi-Level Aligned Representations
Viaarxiv icon

DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning

Add code
Sep 28, 2024
Figure 1 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Figure 2 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Figure 3 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Figure 4 for DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning
Viaarxiv icon

DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions

Add code
Aug 15, 2024
Viaarxiv icon

Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks

Add code
Jul 18, 2024
Figure 1 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Figure 2 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Figure 3 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Figure 4 for Nearest Neighbor Future Captioning: Generating Descriptions for Possible Collisions in Object Placement Tasks
Viaarxiv icon