Picture for Yoshiki Obinata

Yoshiki Obinata

Remote Life Support Robot Interface System for Global Task Planning and Local Action Expansion Using Foundation Models

Add code
Nov 15, 2024
Figure 1 for Remote Life Support Robot Interface System for Global Task Planning and Local Action Expansion Using Foundation Models
Figure 2 for Remote Life Support Robot Interface System for Global Task Planning and Local Action Expansion Using Foundation Models
Figure 3 for Remote Life Support Robot Interface System for Global Task Planning and Local Action Expansion Using Foundation Models
Figure 4 for Remote Life Support Robot Interface System for Global Task Planning and Local Action Expansion Using Foundation Models
Viaarxiv icon

Robotic State Recognition with Image-to-Text Retrieval Task of Pre-Trained Vision-Language Model and Black-Box Optimization

Add code
Oct 30, 2024
Viaarxiv icon

Real-World Cooking Robot System from Recipes Based on Food State Recognition Using Foundation Models and PDDL

Add code
Oct 07, 2024
Figure 1 for Real-World Cooking Robot System from Recipes Based on Food State Recognition Using Foundation Models and PDDL
Figure 2 for Real-World Cooking Robot System from Recipes Based on Food State Recognition Using Foundation Models and PDDL
Figure 3 for Real-World Cooking Robot System from Recipes Based on Food State Recognition Using Foundation Models and PDDL
Figure 4 for Real-World Cooking Robot System from Recipes Based on Food State Recognition Using Foundation Models and PDDL
Viaarxiv icon

Robotic Environmental State Recognition with Pre-Trained Vision-Language Models and Black-Box Optimization

Add code
Sep 26, 2024
Figure 1 for Robotic Environmental State Recognition with Pre-Trained Vision-Language Models and Black-Box Optimization
Figure 2 for Robotic Environmental State Recognition with Pre-Trained Vision-Language Models and Black-Box Optimization
Figure 3 for Robotic Environmental State Recognition with Pre-Trained Vision-Language Models and Black-Box Optimization
Viaarxiv icon

Reflex-Based Open-Vocabulary Navigation without Prior Knowledge Using Omnidirectional Camera and Multiple Vision-Language Models

Add code
Aug 21, 2024
Viaarxiv icon

Continuous Object State Recognition for Cooking Robots Using Pre-Trained Vision-Language Models and Black-box Optimization

Add code
Mar 13, 2024
Viaarxiv icon

Daily Assistive View Control Learning of Low-Cost Low-Rigidity Robot via Large-Scale Vision-Language Model

Add code
Dec 12, 2023
Viaarxiv icon

Binary State Recognition by Robots using Visual Question Answering of Pre-Trained Vision-Language Model

Add code
Oct 25, 2023
Viaarxiv icon

Semantic Scene Difference Detection in Daily Life Patroling by Mobile Robots using Pre-Trained Large-Scale Vision-Language Model

Add code
Sep 28, 2023
Viaarxiv icon

Recognition of Heat-Induced Food State Changes by Time-Series Use of Vision-Language Model for Cooking Robot

Add code
Sep 06, 2023
Viaarxiv icon