Picture for Sungroh Yoon

Sungroh Yoon

Unsupervised Homography Estimation on Multimodal Image Pair via Alternating Optimization

Add code
Nov 20, 2024
Figure 1 for Unsupervised Homography Estimation on Multimodal Image Pair via Alternating Optimization
Figure 2 for Unsupervised Homography Estimation on Multimodal Image Pair via Alternating Optimization
Figure 3 for Unsupervised Homography Estimation on Multimodal Image Pair via Alternating Optimization
Figure 4 for Unsupervised Homography Estimation on Multimodal Image Pair via Alternating Optimization
Viaarxiv icon

Interpretable Language Modeling via Induction-head Ngram Models

Add code
Oct 31, 2024
Figure 1 for Interpretable Language Modeling via Induction-head Ngram Models
Figure 2 for Interpretable Language Modeling via Induction-head Ngram Models
Figure 3 for Interpretable Language Modeling via Induction-head Ngram Models
Figure 4 for Interpretable Language Modeling via Induction-head Ngram Models
Viaarxiv icon

Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP

Add code
Oct 11, 2024
Figure 1 for Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Figure 2 for Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Figure 3 for Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Figure 4 for Semantic Token Reweighting for Interpretable and Controllable Text Embeddings in CLIP
Viaarxiv icon

Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered Context

Add code
Oct 09, 2024
Figure 1 for Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered Context
Figure 2 for Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered Context
Figure 3 for Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered Context
Figure 4 for Unleashing Multi-Hop Reasoning Potential in Large Language Models through Repetition of Misordered Context
Viaarxiv icon

Textual Training for the Hassle-Free Removal of Unwanted Visual Data

Add code
Sep 30, 2024
Figure 1 for Textual Training for the Hassle-Free Removal of Unwanted Visual Data
Figure 2 for Textual Training for the Hassle-Free Removal of Unwanted Visual Data
Figure 3 for Textual Training for the Hassle-Free Removal of Unwanted Visual Data
Figure 4 for Textual Training for the Hassle-Free Removal of Unwanted Visual Data
Viaarxiv icon

NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers

Add code
Sep 24, 2024
Figure 1 for NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers
Figure 2 for NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers
Figure 3 for NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers
Figure 4 for NanoVoice: Efficient Speaker-Adaptive Text-to-Speech for Multiple Speakers
Viaarxiv icon

VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance

Add code
Sep 24, 2024
Figure 1 for VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance
Figure 2 for VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance
Figure 3 for VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance
Figure 4 for VoiceGuider: Enhancing Out-of-Domain Performance in Parameter-Efficient Speaker-Adaptive Text-to-Speech via Autoguidance
Viaarxiv icon

VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech

Add code
Aug 27, 2024
Figure 1 for VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech
Figure 2 for VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech
Figure 3 for VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech
Figure 4 for VoiceTailor: Lightweight Plug-In Adapter for Diffusion-Based Personalized Text-to-Speech
Viaarxiv icon

Unlocking Intrinsic Fairness in Stable Diffusion

Add code
Aug 22, 2024
Figure 1 for Unlocking Intrinsic Fairness in Stable Diffusion
Figure 2 for Unlocking Intrinsic Fairness in Stable Diffusion
Figure 3 for Unlocking Intrinsic Fairness in Stable Diffusion
Figure 4 for Unlocking Intrinsic Fairness in Stable Diffusion
Viaarxiv icon

CKNN: Cleansed k-Nearest Neighbor for Unsupervised Video Anomaly Detection

Add code
Aug 06, 2024
Figure 1 for CKNN: Cleansed k-Nearest Neighbor for Unsupervised Video Anomaly Detection
Figure 2 for CKNN: Cleansed k-Nearest Neighbor for Unsupervised Video Anomaly Detection
Figure 3 for CKNN: Cleansed k-Nearest Neighbor for Unsupervised Video Anomaly Detection
Figure 4 for CKNN: Cleansed k-Nearest Neighbor for Unsupervised Video Anomaly Detection
Viaarxiv icon