Picture for Yakun Zhang

Yakun Zhang

Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization

Add code
Mar 24, 2024
Viaarxiv icon

Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation

Add code
Aug 24, 2023
Figure 1 for Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation
Figure 2 for Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation
Figure 3 for Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation
Figure 4 for Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation
Viaarxiv icon