Picture for Zihao Bo

Zihao Bo

ITO: Images and Texts as One via Synergizing Multiple Alignment and Training-Time Fusion

Add code
Mar 04, 2026
Viaarxiv icon

iGVLM: Dynamic Instruction-Guided Vision Encoding for Question-Aware Multimodal Understanding

Add code
Mar 03, 2026
Viaarxiv icon

DenseAttentionSeg: Segment Hands from Interacted Objects Using Depth Input

Add code
Apr 23, 2019
Figure 1 for DenseAttentionSeg: Segment Hands from Interacted Objects Using Depth Input
Figure 2 for DenseAttentionSeg: Segment Hands from Interacted Objects Using Depth Input
Figure 3 for DenseAttentionSeg: Segment Hands from Interacted Objects Using Depth Input
Figure 4 for DenseAttentionSeg: Segment Hands from Interacted Objects Using Depth Input
Viaarxiv icon