Picture for Xi Li

Xi Li

Mark

Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection

Add code
Jul 23, 2025
Viaarxiv icon

Long-Tailed Distribution-Aware Router For Mixture-of-Experts in Large Vision-Language Model

Add code
Jul 02, 2025
Viaarxiv icon

SphereDrag: Spherical Geometry-Aware Panoramic Image Editing

Add code
Jun 13, 2025
Viaarxiv icon

DSG-World: Learning a 3D Gaussian World Model from Dual State Videos

Add code
Jun 05, 2025
Viaarxiv icon

NeuroGen: Neural Network Parameter Generation via Large Language Models

Add code
May 18, 2025
Viaarxiv icon

PeerGuard: Defending Multi-Agent Systems Against Backdoor Attacks Through Mutual Reasoning

Add code
May 16, 2025
Viaarxiv icon

Mitigating Image Captioning Hallucinations in Vision-Language Models

Add code
May 06, 2025
Viaarxiv icon

Visibility-Uncertainty-guided 3D Gaussian Inpainting via Scene Conceptional Learning

Add code
Apr 23, 2025
Viaarxiv icon

RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements

Add code
Apr 11, 2025
Figure 1 for RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements
Figure 2 for RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements
Figure 3 for RealCam-Vid: High-resolution Video Dataset with Dynamic Scenes and Metric-scale Camera Movements
Viaarxiv icon

All Patches Matter, More Patches Better: Enhance AI-Generated Image Detection via Panoptic Patch Learning

Add code
Apr 02, 2025
Viaarxiv icon