Picture for Yu Fu

Yu Fu

Department of Information Security, Naval University of Engineering, Wuhan, Hubei, 430033, China

PET-F2I: A Comprehensive Benchmark and Parameter-Efficient Fine-Tuning of LLMs for PET/CT Report Impression Generation

Add code
Mar 11, 2026
Viaarxiv icon

From Flow to One Step: Real-Time Multi-Modal Trajectory Policies via Implicit Maximum Likelihood Estimation-based Distribution Distillation

Add code
Mar 10, 2026
Viaarxiv icon

MOSAIC: A Unified Platform for Cross-Paradigm Comparison and Evaluation of Homogeneous and Heterogeneous Multi-Agent RL, LLM, VLM, and Human Decision-Makers

Add code
Mar 01, 2026
Viaarxiv icon

Overton Pluralistic Reinforcement Learning for Large Language Models

Add code
Feb 24, 2026
Viaarxiv icon

Is Reasoning Capability Enough for Safety in Long-Context Language Models?

Add code
Feb 09, 2026
Viaarxiv icon

LingLanMiDian: Systematic Evaluation of LLMs on TCM Knowledge and Clinical Reasoning

Add code
Feb 02, 2026
Viaarxiv icon

Thinking Out of Order: When Output Order Stops Reflecting Reasoning Order in Diffusion Language Models

Add code
Jan 29, 2026
Viaarxiv icon

M4Diffuser: Multi-View Diffusion Policy with Manipulability-Aware Control for Robust Mobile Manipulation

Add code
Sep 18, 2025
Viaarxiv icon

A Survey of Reinforcement Learning for Large Reasoning Models

Add code
Sep 10, 2025
Viaarxiv icon

Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?

Add code
Sep 04, 2025
Viaarxiv icon