Picture for Yifei Huang

Yifei Huang

Towards Interactive Intelligence for Digital Humans

Add code
Dec 15, 2025
Viaarxiv icon

The N-Body Problem: Parallel Execution from Single-Person Egocentric Video

Add code
Dec 12, 2025
Viaarxiv icon

UniLS: End-to-End Audio-Driven Avatars for Unified Listening and Speaking

Add code
Dec 10, 2025
Viaarxiv icon

Living the Novel: A System for Generating Self-Training Timeline-Aware Conversational Agents from Novels

Add code
Dec 08, 2025
Viaarxiv icon

Can MLLMs Read the Room? A Multimodal Benchmark for Verifying Truthfulness in Multi-Party Social Interactions

Add code
Oct 31, 2025
Viaarxiv icon

Solving the Hubbard model with Neural Quantum States

Add code
Jul 03, 2025
Viaarxiv icon

Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision

Add code
Jun 06, 2025
Figure 1 for Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
Figure 2 for Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
Figure 3 for Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
Figure 4 for Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
Viaarxiv icon

Egocentric Action-aware Inertial Localization in Point Clouds

Add code
May 20, 2025
Figure 1 for Egocentric Action-aware Inertial Localization in Point Clouds
Figure 2 for Egocentric Action-aware Inertial Localization in Point Clouds
Figure 3 for Egocentric Action-aware Inertial Localization in Point Clouds
Figure 4 for Egocentric Action-aware Inertial Localization in Point Clouds
Viaarxiv icon

Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining

Add code
May 10, 2025
Viaarxiv icon

Learning Streaming Video Representation via Multitask Training

Add code
Apr 28, 2025
Figure 1 for Learning Streaming Video Representation via Multitask Training
Figure 2 for Learning Streaming Video Representation via Multitask Training
Figure 3 for Learning Streaming Video Representation via Multitask Training
Figure 4 for Learning Streaming Video Representation via Multitask Training
Viaarxiv icon