Picture for Xin Jin

Xin Jin

Vision-Centric Activation and Coordination for Multimodal Large Language Models

Add code
Oct 16, 2025
Viaarxiv icon

UltraLED: Learning to See Everything in Ultra-High Dynamic Range Scenes

Add code
Oct 09, 2025
Figure 1 for UltraLED: Learning to See Everything in Ultra-High Dynamic Range Scenes
Figure 2 for UltraLED: Learning to See Everything in Ultra-High Dynamic Range Scenes
Figure 3 for UltraLED: Learning to See Everything in Ultra-High Dynamic Range Scenes
Figure 4 for UltraLED: Learning to See Everything in Ultra-High Dynamic Range Scenes
Viaarxiv icon

PrismGS: Physically-Grounded Anti-Aliasing for High-Fidelity Large-Scale 3D Gaussian Splatting

Add code
Oct 09, 2025
Viaarxiv icon

ANTS: Shaping the Adaptive Negative Textual Space by MLLM for OOD Detection

Add code
Sep 04, 2025
Viaarxiv icon

TokenLake: A Unified Segment-level Prefix Cache Pool for Fine-grained Elastic Long-Context LLM Serving

Add code
Aug 24, 2025
Viaarxiv icon

Structure-preserving Feature Alignment for Old Photo Colorization

Add code
Aug 18, 2025
Viaarxiv icon

Perceiving and Acting in First-Person: A Dataset and Benchmark for Egocentric Human-Object-Human Interactions

Add code
Aug 06, 2025
Viaarxiv icon

Perceptual Decoupling for Scalable Multi-modal Reasoning via Reward-Optimized Captioning

Add code
Jun 05, 2025
Viaarxiv icon

MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production

Add code
May 19, 2025
Viaarxiv icon

NTIRE 2025 Challenge on Efficient Burst HDR and Restoration: Datasets, Methods, and Results

Add code
May 17, 2025
Viaarxiv icon