Picture for Junho Kim

Junho Kim

Narrative-Driven Paper-to-Slide Generation via ArcDeck

Add code
Apr 13, 2026
Viaarxiv icon

Generating Humanless Environment Walkthroughs from Egocentric Walking Tour Videos

Add code
Mar 30, 2026
Viaarxiv icon

STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding

Add code
Mar 29, 2026
Viaarxiv icon

Grounding World Simulation Models in a Real-World Metropolis

Add code
Mar 16, 2026
Viaarxiv icon

CAPA: Contribution-Aware Pruning and FFN Approximation for Efficient Large Vision-Language Models

Add code
Jan 30, 2026
Viaarxiv icon

Video Parallel Scaling: Aggregating Diverse Frame Subsets for VideoLLMs

Add code
Sep 09, 2025
Figure 1 for Video Parallel Scaling: Aggregating Diverse Frame Subsets for VideoLLMs
Figure 2 for Video Parallel Scaling: Aggregating Diverse Frame Subsets for VideoLLMs
Figure 3 for Video Parallel Scaling: Aggregating Diverse Frame Subsets for VideoLLMs
Figure 4 for Video Parallel Scaling: Aggregating Diverse Frame Subsets for VideoLLMs
Viaarxiv icon

Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation

Add code
Jun 13, 2025
Viaarxiv icon

DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes

Add code
May 29, 2025
Viaarxiv icon

Enhancing Creative Generation on Stable Diffusion-based Models

Add code
Mar 30, 2025
Viaarxiv icon

Learning 3D Scene Analogies with Neural Contextual Scene Maps

Add code
Mar 20, 2025
Figure 1 for Learning 3D Scene Analogies with Neural Contextual Scene Maps
Figure 2 for Learning 3D Scene Analogies with Neural Contextual Scene Maps
Figure 3 for Learning 3D Scene Analogies with Neural Contextual Scene Maps
Figure 4 for Learning 3D Scene Analogies with Neural Contextual Scene Maps
Viaarxiv icon