Picture for Yifei Huang

Yifei Huang

Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision

Add code
Jun 06, 2025
Viaarxiv icon

Egocentric Action-aware Inertial Localization in Point Clouds

Add code
May 20, 2025
Viaarxiv icon

Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining

Add code
May 10, 2025
Viaarxiv icon

Learning Streaming Video Representation via Multitask Training

Add code
Apr 28, 2025
Viaarxiv icon

EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos

Add code
Apr 16, 2025
Viaarxiv icon

An Egocentric Vision-Language Model based Portable Real-time Smart Assistant

Add code
Mar 06, 2025
Viaarxiv icon

Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning

Add code
Mar 02, 2025
Viaarxiv icon

SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training

Add code
Feb 21, 2025
Figure 1 for SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training
Figure 2 for SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training
Figure 3 for SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training
Figure 4 for SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training
Viaarxiv icon

Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model

Add code
Dec 30, 2024
Figure 1 for Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Figure 2 for Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Figure 3 for Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Figure 4 for Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
Viaarxiv icon

CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding

Add code
Dec 16, 2024
Figure 1 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 2 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 3 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Figure 4 for CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding
Viaarxiv icon