Picture for Jungong Han

Jungong Han

QuoVLA: Quotient Space for Vision-Language-Action Models

Add code
May 24, 2026
Viaarxiv icon

Efficient Learned Image Compression without Entropy Coding

Add code
May 22, 2026
Viaarxiv icon

Generalizable 3D Gaussian Splatting enabled Semantic Coding for Real-Time Immersive Video Communications

Add code
Apr 28, 2026
Viaarxiv icon

Reinforcing 3D Understanding in Point-VLMs via Geometric Reward Credit Assignment

Add code
Apr 23, 2026
Viaarxiv icon

HarmoniDiff-RS: Training-Free Diffusion Harmonization for Satellite Image Composition

Add code
Apr 21, 2026
Viaarxiv icon

Cognitive Pivot Points and Visual Anchoring: Unveiling and Rectifying Hallucinations in Multimodal Reasoning Models

Add code
Apr 11, 2026
Viaarxiv icon

Re-Prompting SAM 3 via Object Retrieval: 3rd of the 5th PVUW MOSE Track

Add code
Mar 24, 2026
Viaarxiv icon

Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Models

Add code
Mar 16, 2026
Viaarxiv icon

Show Me When and Where: Towards Referring Video Object Segmentation in the Wild

Add code
Mar 15, 2026
Viaarxiv icon

Improving Anomaly Detection with Foundation-Model Synthesis and Wavelet-Domain Attention

Add code
Mar 03, 2026
Viaarxiv icon