Picture for Zhi Han

Zhi Han

Agentization of Digital Assets for the Agentic Web: Concepts, Techniques, and Benchmark

Add code
Apr 05, 2026
Viaarxiv icon

T-800: An 800 Hz Data Glove for Precise Hand Gesture Tracking

Add code
Mar 27, 2026
Viaarxiv icon

All-day Multi-scenes Lifelong Vision-and-Language Navigation with Tucker Adaptation

Add code
Mar 15, 2026
Viaarxiv icon

Lifelong Language-Conditioned Robotic Manipulation Learning

Add code
Mar 05, 2026
Viaarxiv icon

The power of small initialization in noisy low-tubal-rank tensor recovery

Add code
Mar 03, 2026
Viaarxiv icon

SeqWalker: Sequential-Horizon Vision-and-Language Navigation with Hierarchical Planning

Add code
Jan 08, 2026
Viaarxiv icon

Efficient Low-Tubal-Rank Tensor Estimation via Alternating Preconditioned Gradient Descent

Add code
Dec 23, 2025
Figure 1 for Efficient Low-Tubal-Rank Tensor Estimation via Alternating Preconditioned Gradient Descent
Figure 2 for Efficient Low-Tubal-Rank Tensor Estimation via Alternating Preconditioned Gradient Descent
Figure 3 for Efficient Low-Tubal-Rank Tensor Estimation via Alternating Preconditioned Gradient Descent
Figure 4 for Efficient Low-Tubal-Rank Tensor Estimation via Alternating Preconditioned Gradient Descent
Viaarxiv icon

CRISP: Contrastive Residual Injection and Semantic Prompting for Continual Video Instance Segmentation

Add code
Aug 14, 2025
Figure 1 for CRISP: Contrastive Residual Injection and Semantic Prompting for Continual Video Instance Segmentation
Figure 2 for CRISP: Contrastive Residual Injection and Semantic Prompting for Continual Video Instance Segmentation
Figure 3 for CRISP: Contrastive Residual Injection and Semantic Prompting for Continual Video Instance Segmentation
Figure 4 for CRISP: Contrastive Residual Injection and Semantic Prompting for Continual Video Instance Segmentation
Viaarxiv icon

MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation

Add code
May 29, 2025
Viaarxiv icon

Vision and Language Integration for Domain Generalization

Add code
Apr 17, 2025
Figure 1 for Vision and Language Integration for Domain Generalization
Figure 2 for Vision and Language Integration for Domain Generalization
Figure 3 for Vision and Language Integration for Domain Generalization
Figure 4 for Vision and Language Integration for Domain Generalization
Viaarxiv icon