Picture for Jie Song

Jie Song

AMIF: Authorizable Medical Image Fusion Model with Built-in Authentication

Add code
Mar 25, 2026
Viaarxiv icon

Rethinking Token Reduction for Large Vision-Language Models

Add code
Mar 23, 2026
Viaarxiv icon

Semi-supervised Latent Disentangled Diffusion Model for Textile Pattern Generation

Add code
Mar 17, 2026
Viaarxiv icon

$D^3$-RSMDE: 40$\times$ Faster and High-Fidelity Remote Sensing Monocular Depth Estimation

Add code
Mar 17, 2026
Viaarxiv icon

DriveFix: Spatio-Temporally Coherent Driving Scene Restoration

Add code
Mar 17, 2026
Viaarxiv icon

Gaussian Wardrobe: Compositional 3D Gaussian Avatars for Free-Form Virtual Try-On

Add code
Mar 05, 2026
Viaarxiv icon

Fuz-RL: A Fuzzy-Guided Robust Framework for Safe Reinforcement Learning under Uncertainty

Add code
Feb 24, 2026
Viaarxiv icon

SpatiaLQA: A Benchmark for Evaluating Spatial Logical Reasoning in Vision-Language Models

Add code
Feb 24, 2026
Viaarxiv icon

Reasoning and Tool-use Compete in Agentic RL:From Quantifying Interference to Disentangled Tuning

Add code
Feb 01, 2026
Viaarxiv icon

From Rays to Projections: Better Inputs for Feed-Forward View Synthesis

Add code
Jan 08, 2026
Viaarxiv icon