Picture for Tae-Hyun Oh

Tae-Hyun Oh

POSTECH

Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment

Add code
Jul 18, 2024
Viaarxiv icon

BEAF: Observing BEfore-AFter Changes to Evaluate Hallucination in Vision-language Models

Add code
Jul 18, 2024
Viaarxiv icon

Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert

Add code
Jul 01, 2024
Viaarxiv icon

MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset

Add code
Jun 20, 2024
Viaarxiv icon

Object-Centric Domain Randomization for 3D Shape Reconstruction in the Wild

Add code
Mar 21, 2024
Figure 1 for Object-Centric Domain Randomization for 3D Shape Reconstruction in the Wild
Figure 2 for Object-Centric Domain Randomization for 3D Shape Reconstruction in the Wild
Figure 3 for Object-Centric Domain Randomization for 3D Shape Reconstruction in the Wild
Figure 4 for Object-Centric Domain Randomization for 3D Shape Reconstruction in the Wild
Viaarxiv icon

Revisiting Learning-based Video Motion Magnification for Real-time Processing

Add code
Mar 04, 2024
Figure 1 for Revisiting Learning-based Video Motion Magnification for Real-time Processing
Figure 2 for Revisiting Learning-based Video Motion Magnification for Real-time Processing
Figure 3 for Revisiting Learning-based Video Motion Magnification for Real-time Processing
Figure 4 for Revisiting Learning-based Video Motion Magnification for Real-time Processing
Viaarxiv icon

Noise Map Guidance: Inversion with Spatial Context for Real Image Editing

Add code
Feb 07, 2024
Viaarxiv icon

FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields

Add code
Jan 10, 2024
Figure 1 for FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields
Figure 2 for FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields
Figure 3 for FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields
Figure 4 for FPRF: Feed-Forward Photorealistic Style Transfer of Large-Scale 3D Neural Radiance Fields
Viaarxiv icon

Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering

Add code
Dec 18, 2023
Figure 1 for Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering
Figure 2 for Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering
Figure 3 for Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering
Figure 4 for Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering
Viaarxiv icon

SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models

Add code
Dec 15, 2023
Viaarxiv icon