Picture for Yanan Wu

Yanan Wu

ShopSimulator: Evaluating and Exploring RL-Driven LLM Agent for Shopping Assistants

Add code
Jan 26, 2026
Viaarxiv icon

One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling

Add code
Jan 06, 2026
Viaarxiv icon

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Add code
Dec 31, 2025
Viaarxiv icon

SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models

Add code
Nov 07, 2025
Viaarxiv icon

$μ$NeuFMT: Optical-Property-Adaptive Fluorescence Molecular Tomography via Implicit Neural Representation

Add code
Nov 06, 2025
Viaarxiv icon

A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction

Add code
Jul 16, 2025
Figure 1 for A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction
Figure 2 for A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction
Figure 3 for A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction
Figure 4 for A Multimodal Data Fusion Generative Adversarial Network for Real Time Underwater Sound Speed Field Construction
Viaarxiv icon

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library

Add code
Jun 06, 2025
Viaarxiv icon

DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models

Add code
Apr 25, 2025
Figure 1 for DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models
Figure 2 for DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models
Figure 3 for DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models
Figure 4 for DREAM: Disentangling Risks to Enhance Safety Alignment in Multimodal Large Language Models
Viaarxiv icon

ProgCo: Program Helps Self-Correction of Large Language Models

Add code
Jan 02, 2025
Viaarxiv icon

V$^2$-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy

Add code
Dec 23, 2024
Figure 1 for V$^2$-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy
Figure 2 for V$^2$-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy
Figure 3 for V$^2$-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy
Figure 4 for V$^2$-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy
Viaarxiv icon