Picture for Liang Lin

Liang Lin

All Robots in One: A New Standard and Unified Dataset for Versatile, General-Purpose Embodied Agents

Add code
Aug 20, 2024
Viaarxiv icon

Style-Preserving Lip Sync via Audio-Aware Style Reference

Add code
Aug 10, 2024
Viaarxiv icon

High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model

Add code
Aug 10, 2024
Figure 1 for High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Figure 2 for High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Figure 3 for High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Figure 4 for High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Viaarxiv icon

Improving Network Interpretability via Explanation Consistency Evaluation

Add code
Aug 08, 2024
Viaarxiv icon

VideoQA in the Era of LLMs: An Empirical Study

Add code
Aug 08, 2024
Viaarxiv icon

MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection

Add code
Jul 31, 2024
Viaarxiv icon

Cool-Fusion: Fuse Large Language Models without Training

Add code
Jul 29, 2024
Viaarxiv icon

CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation

Add code
Jul 20, 2024
Viaarxiv icon

WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models

Add code
Jul 15, 2024
Viaarxiv icon

Fuse, Reason and Verify: Geometry Problem Solving with Parsed Clauses from Diagram

Add code
Jul 10, 2024
Viaarxiv icon