Picture for Hyunjun Kim

Hyunjun Kim

DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes

Add code
May 29, 2025
Viaarxiv icon

Language-guided Learning for Object Detection Tackling Multiple Variations in Aerial Images

Add code
May 29, 2025
Viaarxiv icon

Rethinking LayerNorm in Image Restoration Transformers

Add code
Apr 09, 2025
Viaarxiv icon

Fine-Tuning Visual Autoregressive Models for Subject-Driven Generation

Add code
Apr 03, 2025
Viaarxiv icon

Optimizing Retrieval Strategies for Financial Question Answering Documents in Retrieval-Augmented Generation Systems

Add code
Mar 19, 2025
Viaarxiv icon

One-Shot is Enough: Consolidating Multi-Turn Attacks into Efficient Single-Turn Prompts for LLMs

Add code
Mar 06, 2025
Viaarxiv icon

Look Every Frame All at Once: Video-Ma$^2$mba for Efficient Long-form Video Understanding with Multi-Axis Gradient Checkpointing

Add code
Nov 29, 2024
Viaarxiv icon

SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis

Add code
Nov 25, 2024
Figure 1 for SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
Figure 2 for SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
Figure 3 for SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
Figure 4 for SALOVA: Segment-Augmented Long Video Assistant for Targeted Retrieval and Routing in Long-Form Video Analysis
Viaarxiv icon

Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language

Add code
Sep 02, 2024
Viaarxiv icon

CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models

Add code
Jun 04, 2024
Figure 1 for CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models
Figure 2 for CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models
Figure 3 for CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models
Figure 4 for CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models
Viaarxiv icon