Picture for Xin Xu

Xin Xu

Advancing Multimodal Reasoning Capabilities of Multimodal Large Language Models via Visual Perception Reward

Add code
Jun 08, 2025
Viaarxiv icon

Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification

Add code
Jun 05, 2025
Viaarxiv icon

AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition

Add code
May 29, 2025
Viaarxiv icon

VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models

Add code
May 21, 2025
Viaarxiv icon

NVSPolicy: Adaptive Novel-View Synthesis for Generalizable Language-Conditioned Policy Learning

Add code
May 15, 2025
Viaarxiv icon

From Air to Wear: Personalized 3D Digital Fashion with AR/VR Immersive 3D Sketching

Add code
May 15, 2025
Viaarxiv icon

Versatile Distributed Maneuvering with Generalized Formations using Guiding Vector Fields

Add code
May 09, 2025
Viaarxiv icon

RD-UIE: Relation-Driven State Space Modeling for Underwater Image Enhancement

Add code
May 02, 2025
Viaarxiv icon

DAM-Net: Domain Adaptation Network with Micro-Labeled Fine-Tuning for Change Detection

Add code
Apr 18, 2025
Viaarxiv icon

Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

Add code
Apr 15, 2025
Viaarxiv icon