Picture for Tomoya Yoshida

Tomoya Yoshida

Developing Vision-Language-Action Model from Egocentric Videos

Add code
Sep 26, 2025
Viaarxiv icon

EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos with Procedural Texts

Add code
Oct 07, 2024
Figure 1 for EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos with Procedural Texts
Figure 2 for EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos with Procedural Texts
Figure 3 for EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos with Procedural Texts
Figure 4 for EgoOops: A Dataset for Mistake Action Detection from Egocentric Videos with Procedural Texts
Viaarxiv icon

Text-driven Affordance Learning from Egocentric Vision

Add code
Apr 03, 2024
Viaarxiv icon