Picture for Liang Lin

Liang Lin

Style-Preserving Lip Sync via Audio-Aware Style Reference

Add code
Aug 10, 2024
Figure 1 for Style-Preserving Lip Sync via Audio-Aware Style Reference
Figure 2 for Style-Preserving Lip Sync via Audio-Aware Style Reference
Figure 3 for Style-Preserving Lip Sync via Audio-Aware Style Reference
Figure 4 for Style-Preserving Lip Sync via Audio-Aware Style Reference
Viaarxiv icon

High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model

Add code
Aug 10, 2024
Figure 1 for High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Figure 2 for High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Figure 3 for High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Figure 4 for High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model
Viaarxiv icon

Improving Network Interpretability via Explanation Consistency Evaluation

Add code
Aug 08, 2024
Figure 1 for Improving Network Interpretability via Explanation Consistency Evaluation
Figure 2 for Improving Network Interpretability via Explanation Consistency Evaluation
Figure 3 for Improving Network Interpretability via Explanation Consistency Evaluation
Figure 4 for Improving Network Interpretability via Explanation Consistency Evaluation
Viaarxiv icon

VideoQA in the Era of LLMs: An Empirical Study

Add code
Aug 08, 2024
Figure 1 for VideoQA in the Era of LLMs: An Empirical Study
Figure 2 for VideoQA in the Era of LLMs: An Empirical Study
Figure 3 for VideoQA in the Era of LLMs: An Empirical Study
Figure 4 for VideoQA in the Era of LLMs: An Empirical Study
Viaarxiv icon

MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection

Add code
Jul 31, 2024
Figure 1 for MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection
Figure 2 for MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection
Figure 3 for MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection
Figure 4 for MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection
Viaarxiv icon

Cool-Fusion: Fuse Large Language Models without Training

Add code
Jul 29, 2024
Figure 1 for Cool-Fusion: Fuse Large Language Models without Training
Figure 2 for Cool-Fusion: Fuse Large Language Models without Training
Figure 3 for Cool-Fusion: Fuse Large Language Models without Training
Figure 4 for Cool-Fusion: Fuse Large Language Models without Training
Viaarxiv icon

CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation

Add code
Jul 20, 2024
Figure 1 for CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation
Figure 2 for CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation
Figure 3 for CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation
Figure 4 for CrossDehaze: Scaling Up Image Dehazing with Cross-Data Vision Alignment and Augmentation
Viaarxiv icon

WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models

Add code
Jul 15, 2024
Figure 1 for WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models
Figure 2 for WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models
Figure 3 for WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models
Figure 4 for WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models
Viaarxiv icon

Fuse, Reason and Verify: Geometry Problem Solving with Parsed Clauses from Diagram

Add code
Jul 10, 2024
Viaarxiv icon

Dynamic Correlation Learning and Regularization for Multi-Label Confidence Calibration

Add code
Jul 09, 2024
Viaarxiv icon