Picture for Jingdong Wang

Jingdong Wang

Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation

Add code
Oct 10, 2024
Figure 1 for Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation
Figure 2 for Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation
Figure 3 for Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation
Figure 4 for Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation
Viaarxiv icon

MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction

Add code
Oct 10, 2024
Figure 1 for MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction
Figure 2 for MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction
Figure 3 for MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction
Figure 4 for MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction
Viaarxiv icon

Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery

Add code
Sep 29, 2024
Figure 1 for Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Figure 2 for Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Figure 3 for Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Figure 4 for Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery
Viaarxiv icon

MonoFormer: One Transformer for Both Diffusion and Autoregression

Add code
Sep 24, 2024
Figure 1 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Figure 2 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Figure 3 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Figure 4 for MonoFormer: One Transformer for Both Diffusion and Autoregression
Viaarxiv icon

SpotActor: Training-Free Layout-Controlled Consistent Image Generation

Add code
Sep 07, 2024
Figure 1 for SpotActor: Training-Free Layout-Controlled Consistent Image Generation
Figure 2 for SpotActor: Training-Free Layout-Controlled Consistent Image Generation
Figure 3 for SpotActor: Training-Free Layout-Controlled Consistent Image Generation
Figure 4 for SpotActor: Training-Free Layout-Controlled Consistent Image Generation
Viaarxiv icon

Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression

Add code
Sep 01, 2024
Figure 1 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Figure 2 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Figure 3 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Figure 4 for Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression
Viaarxiv icon

EasyChauffeur: A Baseline Advancing Simplicity and Efficiency on Waymax

Add code
Aug 29, 2024
Figure 1 for EasyChauffeur: A Baseline Advancing Simplicity and Efficiency on Waymax
Figure 2 for EasyChauffeur: A Baseline Advancing Simplicity and Efficiency on Waymax
Figure 3 for EasyChauffeur: A Baseline Advancing Simplicity and Efficiency on Waymax
Figure 4 for EasyChauffeur: A Baseline Advancing Simplicity and Efficiency on Waymax
Viaarxiv icon

Disentangled Noisy Correspondence Learning

Add code
Aug 10, 2024
Figure 1 for Disentangled Noisy Correspondence Learning
Figure 2 for Disentangled Noisy Correspondence Learning
Figure 3 for Disentangled Noisy Correspondence Learning
Figure 4 for Disentangled Noisy Correspondence Learning
Viaarxiv icon

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

Add code
Aug 06, 2024
Figure 1 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Figure 2 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Figure 3 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Figure 4 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Viaarxiv icon

Add-SD: Rational Generation without Manual Reference

Add code
Jul 30, 2024
Figure 1 for Add-SD: Rational Generation without Manual Reference
Figure 2 for Add-SD: Rational Generation without Manual Reference
Figure 3 for Add-SD: Rational Generation without Manual Reference
Figure 4 for Add-SD: Rational Generation without Manual Reference
Viaarxiv icon