Picture for Komei Sugiura

Komei Sugiura

ReMoRa: Multimodal Large Language Model based on Refined Motion Representation for Long-Video Understanding

Add code
Feb 18, 2026
Viaarxiv icon

LLM-Free Image Captioning Evaluation in Reference-Flexible Settings

Add code
Dec 25, 2025
Viaarxiv icon

Affordance RAG: Hierarchical Multimodal Retrieval with Affordance-Aware Embodied Memory for Mobile Manipulation

Add code
Dec 22, 2025
Viaarxiv icon

MEGState: Phoneme Decoding from Magnetoencephalography Signals

Add code
Dec 19, 2025
Figure 1 for MEGState: Phoneme Decoding from Magnetoencephalography Signals
Figure 2 for MEGState: Phoneme Decoding from Magnetoencephalography Signals
Viaarxiv icon

Attention Lattice Adapter: Visual Explanation Generation for Visual Foundation Model

Add code
Sep 18, 2025
Viaarxiv icon

Pre-Manipulation Alignment Prediction with Parallel Deep State-Space and Transformer Models

Add code
Sep 17, 2025
Viaarxiv icon

Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images

Add code
Aug 11, 2025
Figure 1 for Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images
Figure 2 for Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images
Figure 3 for Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images
Figure 4 for Deep Space Weather Model: Long-Range Solar Flare Prediction from Multi-Wavelength Images
Viaarxiv icon

ZINA: Multimodal Fine-grained Hallucination Detection and Editing

Add code
Jun 16, 2025
Figure 1 for ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Figure 2 for ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Figure 3 for ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Figure 4 for ZINA: Multimodal Fine-grained Hallucination Detection and Editing
Viaarxiv icon

Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement

Add code
Jan 28, 2025
Figure 1 for Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement
Figure 2 for Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement
Figure 3 for Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement
Figure 4 for Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement
Viaarxiv icon

Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories

Add code
Jan 08, 2025
Figure 1 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 2 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 3 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Figure 4 for Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories
Viaarxiv icon