Picture for Se Jin Park

Se Jin Park

Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation

Add code
Jun 12, 2024
Viaarxiv icon

Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation

Mar 07, 2024
Figure 1 for Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Figure 2 for Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Figure 3 for Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Figure 4 for Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation
Viaarxiv icon

Multilingual Visual Speech Recognition with a Single Model by Learning with Discrete Visual Speech Units

Jan 18, 2024
Viaarxiv icon

AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation

Add code
Dec 05, 2023
Viaarxiv icon

Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model

Oct 23, 2023
Figure 1 for Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model
Figure 2 for Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model
Viaarxiv icon

Reprogramming Audio-driven Talking Face Synthesis into Text-driven

Jun 28, 2023
Figure 1 for Reprogramming Audio-driven Talking Face Synthesis into Text-driven
Figure 2 for Reprogramming Audio-driven Talking Face Synthesis into Text-driven
Figure 3 for Reprogramming Audio-driven Talking Face Synthesis into Text-driven
Figure 4 for Reprogramming Audio-driven Talking Face Synthesis into Text-driven
Viaarxiv icon

Exploring Phonetic Context in Lip Movement for Authentic Talking Face Generation

May 31, 2023
Figure 1 for Exploring Phonetic Context in Lip Movement for Authentic Talking Face Generation
Figure 2 for Exploring Phonetic Context in Lip Movement for Authentic Talking Face Generation
Figure 3 for Exploring Phonetic Context in Lip Movement for Authentic Talking Face Generation
Figure 4 for Exploring Phonetic Context in Lip Movement for Authentic Talking Face Generation
Viaarxiv icon

SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory

Nov 03, 2022
Figure 1 for SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory
Figure 2 for SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory
Figure 3 for SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory
Figure 4 for SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory
Viaarxiv icon

Test-time Adaptation for Real Image Denoising via Meta-transfer Learning

Jul 05, 2022
Figure 1 for Test-time Adaptation for Real Image Denoising via Meta-transfer Learning
Figure 2 for Test-time Adaptation for Real Image Denoising via Meta-transfer Learning
Figure 3 for Test-time Adaptation for Real Image Denoising via Meta-transfer Learning
Figure 4 for Test-time Adaptation for Real Image Denoising via Meta-transfer Learning
Viaarxiv icon

Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video

Apr 04, 2022
Figure 1 for Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video
Figure 2 for Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video
Figure 3 for Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video
Figure 4 for Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video
Viaarxiv icon