Picture for Yuan Wang

Yuan Wang

Data-Efficient RLVR via Off-Policy Influence Guidance

Add code
Oct 30, 2025
Viaarxiv icon

Modest-Align: Data-Efficient Alignment for Vision-Language Models

Add code
Oct 24, 2025
Viaarxiv icon

A Case for Declarative LLM-friendly Interfaces for Improved Efficiency of Computer-Use Agents

Add code
Oct 06, 2025
Viaarxiv icon

TopoSizing: An LLM-aided Framework of Topology-based Understanding and Sizing for AMS Circuits

Add code
Sep 17, 2025
Viaarxiv icon

Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?

Add code
Sep 03, 2025
Viaarxiv icon

MedReseacher-R1: Expert-Level Medical Deep Researcher via A Knowledge-Informed Trajectory Synthesis Framework

Add code
Aug 20, 2025
Viaarxiv icon

Improving Learning of New Diseases through Knowledge-Enhanced Initialization for Federated Adapter Tuning

Add code
Aug 14, 2025
Figure 1 for Improving Learning of New Diseases through Knowledge-Enhanced Initialization for Federated Adapter Tuning
Figure 2 for Improving Learning of New Diseases through Knowledge-Enhanced Initialization for Federated Adapter Tuning
Figure 3 for Improving Learning of New Diseases through Knowledge-Enhanced Initialization for Federated Adapter Tuning
Figure 4 for Improving Learning of New Diseases through Knowledge-Enhanced Initialization for Federated Adapter Tuning
Viaarxiv icon

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Add code
Jul 02, 2025
Figure 1 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 2 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 3 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Figure 4 for GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
Viaarxiv icon

CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making

Add code
Jun 15, 2025
Figure 1 for CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making
Figure 2 for CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making
Figure 3 for CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making
Figure 4 for CAPO: Reinforcing Consistent Reasoning in Medical Decision-Making
Viaarxiv icon

Med-U1: Incentivizing Unified Medical Reasoning in LLMs via Large-scale Reinforcement Learning

Add code
Jun 14, 2025
Figure 1 for Med-U1: Incentivizing Unified Medical Reasoning in LLMs via Large-scale Reinforcement Learning
Figure 2 for Med-U1: Incentivizing Unified Medical Reasoning in LLMs via Large-scale Reinforcement Learning
Figure 3 for Med-U1: Incentivizing Unified Medical Reasoning in LLMs via Large-scale Reinforcement Learning
Figure 4 for Med-U1: Incentivizing Unified Medical Reasoning in LLMs via Large-scale Reinforcement Learning
Viaarxiv icon