Picture for Atsushi Hashimoto

Atsushi Hashimoto

Evaluating the Capability of Video Question Generation for Expert Knowledge Elicitation

Add code
Dec 17, 2025
Viaarxiv icon

CaptionSmiths: Flexibly Controlling Language Pattern in Image Captioning

Add code
Jul 02, 2025
Viaarxiv icon

KeyMPs: One-Shot Vision-Language Guided Motion Generation by Sequencing DMPs for Occlusion-Rich Tasks

Add code
Apr 14, 2025
Figure 1 for KeyMPs: One-Shot Vision-Language Guided Motion Generation by Sequencing DMPs for Occlusion-Rich Tasks
Figure 2 for KeyMPs: One-Shot Vision-Language Guided Motion Generation by Sequencing DMPs for Occlusion-Rich Tasks
Figure 3 for KeyMPs: One-Shot Vision-Language Guided Motion Generation by Sequencing DMPs for Occlusion-Rich Tasks
Figure 4 for KeyMPs: One-Shot Vision-Language Guided Motion Generation by Sequencing DMPs for Occlusion-Rich Tasks
Viaarxiv icon

Visuo-Tactile Zero-Shot Object Recognition with Vision-Language Model

Add code
Sep 14, 2024
Viaarxiv icon

COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark

Add code
Aug 05, 2024
Viaarxiv icon

AdaCoder: Adaptive Prompt Compression for Programmatic Visual Question Answering

Add code
Jul 28, 2024
Viaarxiv icon

Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos

Add code
Nov 29, 2023
Figure 1 for Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos
Figure 2 for Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos
Figure 3 for Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos
Figure 4 for Exo2EgoDVC: Dense Video Captioning of Egocentric Procedural Activities Using Web Instructional Videos
Viaarxiv icon

Vision-Language Interpreter for Robot Task Planning

Add code
Nov 02, 2023
Viaarxiv icon

WeaveNet for Approximating Two-sided Matching Problems

Add code
Oct 19, 2023
Viaarxiv icon

A Critical Look at the Current Usage of Foundation Model for Dense Recognition Task

Add code
Aug 01, 2023
Viaarxiv icon