Picture for Komei Sugiura

Komei Sugiura

LLM-Free Image Captioning Evaluation in Reference-Flexible Settings

Add code
Dec 25, 2025
Viaarxiv icon

Affordance RAG: Hierarchical Multimodal Retrieval with Affordance-Aware Embodied Memory for Mobile Manipulation

Add code
Dec 22, 2025
Viaarxiv icon

MEGState: Phoneme Decoding from Magnetoencephalography Signals

Add code
Dec 19, 2025
Viaarxiv icon

Attention Lattice Adapter: Visual Explanation Generation for Visual Foundation Model

Add code
Sep 18, 2025
Viaarxiv icon

Pre-Manipulation Alignment Prediction with Parallel Deep State-Space and Transformer Models

Add code
Sep 17, 2025
Viaarxiv icon

Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images

Add code
Aug 11, 2025
Figure 1 for Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images
Figure 2 for Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images
Figure 3 for Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images
Figure 4 for Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images
Viaarxiv icon

ZINA: Multimodal Fine-grained Hallucination Detection and Editing

Add code
Jun 16, 2025
Figure 1 for ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Figure 2 for ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Figure 3 for ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Figure 4 for ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Viaarxiv icon

Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement

Add code
Jan 28, 2025
Figure 1 for Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement
Figure 2 for Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement
Figure 3 for Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement
Figure 4 for Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement
Viaarxiv icon

Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories

Add code
Jan 08, 2025
Figure 1 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 2 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 3 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 4 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Viaarxiv icon

Task Success Prediction and Open-Vocabulary Object Manipulation

Add code
Dec 26, 2024
Figure 1 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 2 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 3 for Task Success Prediction and Open-Vocabulary Object Manipulation
Figure 4 for Task Success Prediction and Open-Vocabulary Object Manipulation
Viaarxiv icon