Picture for Xiwen Chen

Xiwen Chen

S-SPPO: Semantic-Calibrated Self-Play Preference Optimization

Add code
Jun 01, 2026
Viaarxiv icon

Mags-RL: Wearing Multimodal LLMs a Magnifying Glass via Agentic Reinforcement Learning For Complex Scene Reasoning

Add code
May 27, 2026
Viaarxiv icon

OphIn-500K: Curating Web-Scale Visual Instructions for Scaling Ophthalmic Multimodal Large Language Models

Add code
May 27, 2026
Viaarxiv icon

Closed-Loop Bidirectional Prompting for Adversarial Robustness of Vision Language Models

Add code
May 25, 2026
Viaarxiv icon

Your Agent is More Brittle Than You Think: Uncovering Indirect Injection Vulnerabilities in Agentic LLMs

Add code
Apr 04, 2026
Viaarxiv icon

Bridging Restoration and Diagnosis: A Comprehensive Benchmark for Retinal Fundus Enhancement

Add code
Apr 04, 2026
Viaarxiv icon

SODA: Semi On-Policy Black-Box Distillation for Large Language Models

Add code
Apr 04, 2026
Viaarxiv icon

OTPrune: Distribution-Aligned Visual Token Pruning via Optimal Transport

Add code
Feb 25, 2026
Viaarxiv icon

AHA: Aligning Large Audio-Language Models for Reasoning Hallucinations via Counterfactual Hard Negatives

Add code
Dec 30, 2025
Viaarxiv icon

Factorized Transport Alignment for Multimodal and Multiview E-commerce Representation Learning

Add code
Dec 19, 2025
Viaarxiv icon