Picture for Yuankai Qi

Yuankai Qi

AerialVLN: Vision-and-Language Navigation for UAVs

Add code
Aug 13, 2023
Figure 1 for AerialVLN: Vision-and-Language Navigation for UAVs
Figure 2 for AerialVLN: Vision-and-Language Navigation for UAVs
Figure 3 for AerialVLN: Vision-and-Language Navigation for UAVs
Figure 4 for AerialVLN: Vision-and-Language Navigation for UAVs
Viaarxiv icon

Mind the Gap: Improving Success Rate of Vision-and-Language Navigation by Revisiting Oracle Success Routes

Add code
Aug 07, 2023
Viaarxiv icon

Teacher Agent: A Non-Knowledge Distillation Method for Rehearsal-based Video Incremental Learning

Add code
Jun 01, 2023
Figure 1 for Teacher Agent: A Non-Knowledge Distillation Method for Rehearsal-based Video Incremental Learning
Figure 2 for Teacher Agent: A Non-Knowledge Distillation Method for Rehearsal-based Video Incremental Learning
Figure 3 for Teacher Agent: A Non-Knowledge Distillation Method for Rehearsal-based Video Incremental Learning
Figure 4 for Teacher Agent: A Non-Knowledge Distillation Method for Rehearsal-based Video Incremental Learning
Viaarxiv icon

A Unified Object Counting Network with Object Occupation Prior

Add code
Dec 29, 2022
Viaarxiv icon

Consistency-Aware Anchor Pyramid Network for Crowd Localization

Add code
Dec 08, 2022
Figure 1 for Consistency-Aware Anchor Pyramid Network for Crowd Localization
Figure 2 for Consistency-Aware Anchor Pyramid Network for Crowd Localization
Figure 3 for Consistency-Aware Anchor Pyramid Network for Crowd Localization
Figure 4 for Consistency-Aware Anchor Pyramid Network for Crowd Localization
Viaarxiv icon

BEVBert: Topo-Metric Map Pre-training for Language-guided Navigation

Add code
Dec 08, 2022
Viaarxiv icon

Learning to Dub Movies via Hierarchical Prosody Models

Add code
Dec 08, 2022
Viaarxiv icon

Exploiting Completeness and Uncertainty of Pseudo Labels for Weakly Supervised Video Anomaly Detection

Add code
Dec 08, 2022
Viaarxiv icon

Progressive Multi-resolution Loss for Crowd Counting

Add code
Dec 08, 2022
Figure 1 for Progressive Multi-resolution Loss for Crowd Counting
Figure 2 for Progressive Multi-resolution Loss for Crowd Counting
Figure 3 for Progressive Multi-resolution Loss for Crowd Counting
Figure 4 for Progressive Multi-resolution Loss for Crowd Counting
Viaarxiv icon

Multi-Attention Network for Compressed Video Referring Object Segmentation

Add code
Jul 26, 2022
Figure 1 for Multi-Attention Network for Compressed Video Referring Object Segmentation
Figure 2 for Multi-Attention Network for Compressed Video Referring Object Segmentation
Figure 3 for Multi-Attention Network for Compressed Video Referring Object Segmentation
Figure 4 for Multi-Attention Network for Compressed Video Referring Object Segmentation
Viaarxiv icon