Alert button
Picture for Shuhei Kurita

Shuhei Kurita

Alert button

Text-driven Affordance Learning from Egocentric Vision

Add code
Bookmark button
Alert button
Apr 03, 2024
Tomoya Yoshida, Shuhei Kurita, Taichi Nishimura, Shinsuke Mori

Viaarxiv icon

JDocQA: Japanese Document Question Answering Dataset for Generative Language Models

Add code
Bookmark button
Alert button
Mar 28, 2024
Eri Onami, Shuhei Kurita, Taiki Miyanishi, Taro Watanabe

Figure 1 for JDocQA: Japanese Document Question Answering Dataset for Generative Language Models
Figure 2 for JDocQA: Japanese Document Question Answering Dataset for Generative Language Models
Figure 3 for JDocQA: Japanese Document Question Answering Dataset for Generative Language Models
Figure 4 for JDocQA: Japanese Document Question Answering Dataset for Generative Language Models
Viaarxiv icon

Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction

Add code
Bookmark button
Alert button
Feb 28, 2024
Koki Maeda, Shuhei Kurita, Taiki Miyanishi, Naoaki Okazaki

Viaarxiv icon

SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition

Add code
Bookmark button
Alert button
Jan 18, 2024
Hao Wang, Shuhei Kurita, Shuichiro Shimizu, Daisuke Kawahara

Viaarxiv icon

CityRefer: Geography-aware 3D Visual Grounding Dataset on City-scale Point Cloud Data

Add code
Bookmark button
Alert button
Oct 28, 2023
Taiki Miyanishi, Fumiya Kitamori, Shuhei Kurita, Jungdae Lee, Motoaki Kawanabe, Nakamasa Inoue

Viaarxiv icon

RefEgo: Referring Expression Comprehension Dataset from First-Person Perception of Ego4D

Add code
Bookmark button
Alert button
Aug 23, 2023
Shuhei Kurita, Naoki Katsura, Eri Onami

Viaarxiv icon

Cross3DVG: Baseline and Dataset for Cross-Dataset 3D Visual Grounding on Different RGB-D Scans

Add code
Bookmark button
Alert button
May 23, 2023
Taiki Miyanishi, Daichi Azuma, Shuhei Kurita, Motoki Kawanabe

Figure 1 for Cross3DVG: Baseline and Dataset for Cross-Dataset 3D Visual Grounding on Different RGB-D Scans
Figure 2 for Cross3DVG: Baseline and Dataset for Cross-Dataset 3D Visual Grounding on Different RGB-D Scans
Figure 3 for Cross3DVG: Baseline and Dataset for Cross-Dataset 3D Visual Grounding on Different RGB-D Scans
Figure 4 for Cross3DVG: Baseline and Dataset for Cross-Dataset 3D Visual Grounding on Different RGB-D Scans
Viaarxiv icon

Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows

Add code
Bookmark button
Alert button
Sep 13, 2022
Keisuke Shirai, Atsushi Hashimoto, Taichi Nishimura, Hirotaka Kameko, Shuhei Kurita, Yoshitaka Ushiku, Shinsuke Mori

Figure 1 for Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows
Figure 2 for Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows
Figure 3 for Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows
Figure 4 for Visual Recipe Flow: A Dataset for Learning Visual State Changes of Objects with Recipe Flows
Viaarxiv icon

ScanQA: 3D Question Answering for Spatial Scene Understanding

Add code
Bookmark button
Alert button
Dec 20, 2021
Daichi Azuma, Taiki Miyanishi, Shuhei Kurita, Motoki Kawanabe

Figure 1 for ScanQA: 3D Question Answering for Spatial Scene Understanding
Figure 2 for ScanQA: 3D Question Answering for Spatial Scene Understanding
Figure 3 for ScanQA: 3D Question Answering for Spatial Scene Understanding
Figure 4 for ScanQA: 3D Question Answering for Spatial Scene Understanding
Viaarxiv icon

Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes' Rule

Add code
Bookmark button
Alert button
Oct 08, 2020
Shuhei Kurita, Kyunghyun Cho

Figure 1 for Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes' Rule
Figure 2 for Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes' Rule
Figure 3 for Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes' Rule
Figure 4 for Generative Language-Grounded Policy in Vision-and-Language Navigation with Bayes' Rule
Viaarxiv icon