Picture for Dogucan Yaman

Dogucan Yaman

CAPE: A CLIP-Aware Pointing Ensemble of Complementary Heatmap Cues for Embodied Reference Understanding

Add code
Jul 29, 2025
Viaarxiv icon

Mask-Free Audio-driven Talking Face Generation for Enhanced Visual Quality and Identity Preservation

Add code
Jul 28, 2025
Viaarxiv icon

Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck

Add code
Oct 15, 2024
Figure 1 for Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck
Figure 2 for Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck
Figure 3 for Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck
Figure 4 for Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck
Viaarxiv icon

Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation

Add code
May 07, 2024
Viaarxiv icon

Plug the Leaks: Advancing Audio-driven Talking Face Generation by Preventing Unintended Information Flow

Add code
Jul 18, 2023
Figure 1 for Plug the Leaks: Advancing Audio-driven Talking Face Generation by Preventing Unintended Information Flow
Figure 2 for Plug the Leaks: Advancing Audio-driven Talking Face Generation by Preventing Unintended Information Flow
Figure 3 for Plug the Leaks: Advancing Audio-driven Talking Face Generation by Preventing Unintended Information Flow
Figure 4 for Plug the Leaks: Advancing Audio-driven Talking Face Generation by Preventing Unintended Information Flow
Viaarxiv icon

Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos

Add code
Jun 09, 2022
Figure 1 for Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Figure 2 for Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Figure 3 for Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Figure 4 for Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos
Viaarxiv icon

Exposure Correction Model to Enhance Image Quality

Add code
Apr 22, 2022
Figure 1 for Exposure Correction Model to Enhance Image Quality
Figure 2 for Exposure Correction Model to Enhance Image Quality
Figure 3 for Exposure Correction Model to Enhance Image Quality
Figure 4 for Exposure Correction Model to Enhance Image Quality
Viaarxiv icon

Alpha Matte Generation from Single Input for Portrait Matting

Add code
Jun 14, 2021
Figure 1 for Alpha Matte Generation from Single Input for Portrait Matting
Figure 2 for Alpha Matte Generation from Single Input for Portrait Matting
Figure 3 for Alpha Matte Generation from Single Input for Portrait Matting
Figure 4 for Alpha Matte Generation from Single Input for Portrait Matting
Viaarxiv icon

CAGAN: Text-To-Image Generation with Combined Attention GANs

Add code
Apr 26, 2021
Figure 1 for CAGAN: Text-To-Image Generation with Combined Attention GANs
Figure 2 for CAGAN: Text-To-Image Generation with Combined Attention GANs
Figure 3 for CAGAN: Text-To-Image Generation with Combined Attention GANs
Figure 4 for CAGAN: Text-To-Image Generation with Combined Attention GANs
Viaarxiv icon

Ear2Face: Deep Biometric Modality Mapping

Add code
Jun 02, 2020
Figure 1 for Ear2Face: Deep Biometric Modality Mapping
Figure 2 for Ear2Face: Deep Biometric Modality Mapping
Figure 3 for Ear2Face: Deep Biometric Modality Mapping
Figure 4 for Ear2Face: Deep Biometric Modality Mapping
Viaarxiv icon