Picture for Ji Woo Hong

Ji Woo Hong

Decomposed On-Policy Distillation for Vision-Language Reasoning: Steering Gradients for Visual Grounding

Add code
May 30, 2026
Viaarxiv icon

PDCR: Perception-Decomposed Confidence Reward for Vision-Language Reasoning

Add code
May 13, 2026
Viaarxiv icon

High-Fidelity Text-to-Image Generation from Pre-Trained Vision-Language Models via Distribution-Conditioned Diffusion Decoding

Add code
Mar 11, 2026
Viaarxiv icon

A Hidden Semantic Bottleneck in Conditional Embeddings of Diffusion Transformers

Add code
Feb 25, 2026
Viaarxiv icon

TARO: Timestep-Adaptive Representation Alignment with Onset-Aware Conditioning for Synchronized Video-to-Audio Synthesis

Add code
Apr 08, 2025
Viaarxiv icon

ITA-MDT: Image-Timestep-Adaptive Masked Diffusion Transformer Framework for Image-Based Virtual Try-On

Add code
Mar 26, 2025
Viaarxiv icon

E-MD3C: Taming Masked Diffusion Transformers for Efficient Zero-Shot Object Customization

Add code
Feb 13, 2025
Viaarxiv icon

Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation

Add code
Aug 16, 2024
Figure 1 for Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation
Figure 2 for Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation
Figure 3 for Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation
Figure 4 for Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation
Viaarxiv icon

FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing

Add code
Jul 25, 2024
Figure 1 for FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing
Figure 2 for FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing
Figure 3 for FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing
Figure 4 for FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing
Viaarxiv icon

Neutral Editing Framework for Diffusion-based Video Editing

Add code
Dec 10, 2023
Figure 1 for Neutral Editing Framework for Diffusion-based Video Editing
Figure 2 for Neutral Editing Framework for Diffusion-based Video Editing
Figure 3 for Neutral Editing Framework for Diffusion-based Video Editing
Figure 4 for Neutral Editing Framework for Diffusion-based Video Editing
Viaarxiv icon