Picture for Jungong Han

Jungong Han

Re-Prompting SAM 3 via Object Retrieval: 3rd of the 5th PVUW MOSE Track

Add code
Mar 24, 2026
Viaarxiv icon

Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Models

Add code
Mar 16, 2026
Viaarxiv icon

Show Me When and Where: Towards Referring Video Object Segmentation in the Wild

Add code
Mar 15, 2026
Viaarxiv icon

Improving Anomaly Detection with Foundation-Model Synthesis and Wavelet-Domain Attention

Add code
Mar 03, 2026
Viaarxiv icon

ProGIC: Progressive and Lightweight Generative Image Compression with Residual Vector Quantization

Add code
Mar 03, 2026
Viaarxiv icon

Controllable Exploration in Hybrid-Policy RLVR for Multi-Modal Reasoning

Add code
Feb 22, 2026
Viaarxiv icon

SAM-Body4D: Training-Free 4D Human Body Mesh Recovery from Videos

Add code
Dec 09, 2025
Viaarxiv icon

PruneHal: Reducing Hallucinations in Multi-modal Large Language Models through Adaptive KV Cache Pruning

Add code
Oct 22, 2025
Viaarxiv icon

Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model

Add code
Sep 09, 2025
Figure 1 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Figure 2 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Figure 3 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Figure 4 for Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model
Viaarxiv icon

Unlocking the Potential of Diffusion Priors in Blind Face Restoration

Add code
Aug 12, 2025
Figure 1 for Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Figure 2 for Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Figure 3 for Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Figure 4 for Unlocking the Potential of Diffusion Priors in Blind Face Restoration
Viaarxiv icon