Picture for Jungong Han

Jungong Han

Generalizable 3D Gaussian Splatting enabled Semantic Coding for Real-Time Immersive Video Communications

Add code
Apr 28, 2026
Viaarxiv icon

Reinforcing 3D Understanding in Point-VLMs via Geometric Reward Credit Assignment

Add code
Apr 23, 2026
Viaarxiv icon

HarmoniDiff-RS: Training-Free Diffusion Harmonization for Satellite Image Composition

Add code
Apr 21, 2026
Viaarxiv icon

Cognitive Pivot Points and Visual Anchoring: Unveiling and Rectifying Hallucinations in Multimodal Reasoning Models

Add code
Apr 11, 2026
Viaarxiv icon

Re-Prompting SAM 3 via Object Retrieval: 3rd of the 5th PVUW MOSE Track

Add code
Mar 24, 2026
Viaarxiv icon

Mostly Text, Smart Visuals: Asymmetric Text-Visual Pruning for Large Vision-Language Models

Add code
Mar 16, 2026
Viaarxiv icon

Show Me When and Where: Towards Referring Video Object Segmentation in the Wild

Add code
Mar 15, 2026
Viaarxiv icon

Improving Anomaly Detection with Foundation-Model Synthesis and Wavelet-Domain Attention

Add code
Mar 03, 2026
Viaarxiv icon

ProGIC: Progressive and Lightweight Generative Image Compression with Residual Vector Quantization

Add code
Mar 03, 2026
Viaarxiv icon

Controllable Exploration in Hybrid-Policy RLVR for Multi-Modal Reasoning

Add code
Feb 22, 2026
Viaarxiv icon