Picture for Yuan Xie

Yuan Xie

ProMMSearchAgent: A Generalizable Multimodal Search Agent Trained with Process-Oriented Rewards

Add code
Apr 22, 2026
Viaarxiv icon

GSCompleter: A Distillation-Free Plugin for Metric-Aware 3D Gaussian Splatting Completion in Seconds

Add code
Apr 22, 2026
Viaarxiv icon

DR-MMSearchAgent: Deepening Reasoning in Multimodal Search Agents

Add code
Apr 21, 2026
Viaarxiv icon

NIM4-ASR: Towards Efficient, Robust, and Customizable Real-Time LLM-Based ASR

Add code
Apr 20, 2026
Viaarxiv icon

DiPO: Disentangled Perplexity Policy Optimization for Fine-grained Exploration-Exploitation Trade-Off

Add code
Apr 15, 2026
Viaarxiv icon

Direct Segmentation without Logits Optimization for Training-Free Open-Vocabulary Semantic Segmentation

Add code
Apr 09, 2026
Viaarxiv icon

DailyArt: Discovering Articulation from Single Static Images via Latent Dynamics

Add code
Apr 09, 2026
Viaarxiv icon

Rethinking Entropy Allocation in LLM-based ASR: Understanding the Dynamics between Speech Encoders and LLMs

Add code
Apr 09, 2026
Viaarxiv icon

Face-D(^2)CL: Multi-Domain Synergistic Representation with Dual Continual Learning for Facial DeepFake Detection

Add code
Apr 09, 2026
Viaarxiv icon

Memory Intelligence Agent

Add code
Apr 07, 2026
Viaarxiv icon