Picture for Yoichi Sato

Yoichi Sato

Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision

Add code
Jun 06, 2025
Viaarxiv icon

Leadership Assessment in Pediatric Intensive Care Unit Team Training

Add code
May 30, 2025
Viaarxiv icon

Egocentric Action-aware Inertial Localization in Point Clouds

Add code
May 20, 2025
Viaarxiv icon

SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training

Add code
Feb 21, 2025
Figure 1 for SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training
Figure 2 for SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training
Figure 3 for SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training
Figure 4 for SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training
Viaarxiv icon

Pre-Training for 3D Hand Pose Estimation with Contrastive Learning on Large-Scale Hand Images in the Wild

Add code
Sep 15, 2024
Figure 1 for Pre-Training for 3D Hand Pose Estimation with Contrastive Learning on Large-Scale Hand Images in the Wild
Figure 2 for Pre-Training for 3D Hand Pose Estimation with Contrastive Learning on Large-Scale Hand Images in the Wild
Figure 3 for Pre-Training for 3D Hand Pose Estimation with Contrastive Learning on Large-Scale Hand Images in the Wild
Figure 4 for Pre-Training for 3D Hand Pose Estimation with Contrastive Learning on Large-Scale Hand Images in the Wild
Viaarxiv icon

WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding

Add code
Jul 22, 2024
Figure 1 for WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding
Figure 2 for WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding
Figure 3 for WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding
Figure 4 for WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding
Viaarxiv icon

ActionVOS: Actions as Prompts for Video Object Segmentation

Add code
Jul 10, 2024
Figure 1 for ActionVOS: Actions as Prompts for Video Object Segmentation
Figure 2 for ActionVOS: Actions as Prompts for Video Object Segmentation
Figure 3 for ActionVOS: Actions as Prompts for Video Object Segmentation
Figure 4 for ActionVOS: Actions as Prompts for Video Object Segmentation
Viaarxiv icon

Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition

Add code
Jul 09, 2024
Viaarxiv icon

Learning Object States from Actions via Large Language Models

Add code
May 02, 2024
Figure 1 for Learning Object States from Actions via Large Language Models
Figure 2 for Learning Object States from Actions via Large Language Models
Figure 3 for Learning Object States from Actions via Large Language Models
Figure 4 for Learning Object States from Actions via Large Language Models
Viaarxiv icon

Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects

Add code
Mar 25, 2024
Figure 1 for Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects
Figure 2 for Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects
Figure 3 for Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects
Figure 4 for Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects
Viaarxiv icon