Picture for Xin Chen

Xin Chen

Univ. California, Santa Barbara

MMBU: A Massive Multi-modal Biomedical Understanding Benchmark to Probe the Perception Capabilities of Vision-Language Models

Add code
Jun 04, 2026
Viaarxiv icon

Beyond Output Matching: Preserving Internal Geometry in NVFP4 LLM Distillatio

Add code
Jun 04, 2026
Viaarxiv icon

Diffusion-Based Heart Sound Generation: Evaluation with Physiological Signal Metrics, Classifiers, and Expert Listening

Add code
Jun 01, 2026
Viaarxiv icon

IMAC-AgriVLN: Can Agricultural Vision-and-Language Navigation Agents be Aware of Instruction Mistakes?

Add code
Jun 01, 2026
Viaarxiv icon

OmniMatBench: A Human-Calibrated Multimodal Reasoning Benchmark Across 19 Materials Science Subfields

Add code
May 28, 2026
Viaarxiv icon

On-Policy Replay for Continual Supervised Fine-Tuning

Add code
May 28, 2026
Viaarxiv icon

How Far Has AI Come in Liver Fibrosis Staging? A Large-Scale Real-World Dataset and Benchmark

Add code
May 25, 2026
Viaarxiv icon

MIND: Multi-Scale Intent Diffusion for Text-Driven Physics-Based Humanoid Control

Add code
May 25, 2026
Viaarxiv icon

LANG: Reinforcement Learning for Multilingual Reasoning with Language-Adaptive Hint Guidance

Add code
May 21, 2026
Viaarxiv icon

MTR-Suite: A Framework for Evaluating and Synthesizing Conversational Retrieval Benchmarks

Add code
May 20, 2026
Viaarxiv icon