Picture for Boqiang Duan

Boqiang Duan

VideoSEG-O3: A Multi-turn Reinforcement Learning Framework for Reasoning Video Object Segmentation

Add code
Jun 05, 2026
Viaarxiv icon

ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation

Add code
Dec 31, 2021
Figure 1 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 2 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 3 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 4 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Viaarxiv icon