Picture for Yinuo Wang

Yinuo Wang

ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment

Add code
Mar 24, 2026
Viaarxiv icon

Real-Time Generative Policy via Langevin-Guided Flow Matching for Autonomous Driving

Add code
Mar 03, 2026
Viaarxiv icon

OmniGAIA: Towards Native Omni-Modal AI Agents

Add code
Feb 26, 2026
Viaarxiv icon

STAPO: Stabilizing Reinforcement Learning for LLMs by Silencing Rare Spurious Tokens

Add code
Feb 17, 2026
Viaarxiv icon

TourPlanner: A Competitive Consensus Framework with Constraint-Gated Reinforcement Learning for Travel Planning

Add code
Jan 08, 2026
Viaarxiv icon

HeadLighter: Disentangling Illumination in Generative 3D Gaussian Heads via Lightstage Captures

Add code
Jan 05, 2026
Viaarxiv icon

PrivacyCD: Hierarchical Unlearning for Protecting Student Privacy in Cognitive Diagnosis

Add code
Nov 06, 2025
Viaarxiv icon

P-MIA: A Profiled-Based Membership Inference Attack on Cognitive Diagnosis Models

Add code
Nov 06, 2025
Figure 1 for P-MIA: A Profiled-Based Membership Inference Attack on Cognitive Diagnosis Models
Figure 2 for P-MIA: A Profiled-Based Membership Inference Attack on Cognitive Diagnosis Models
Figure 3 for P-MIA: A Profiled-Based Membership Inference Attack on Cognitive Diagnosis Models
Figure 4 for P-MIA: A Profiled-Based Membership Inference Attack on Cognitive Diagnosis Models
Viaarxiv icon

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Add code
Oct 24, 2025
Viaarxiv icon

Can SSD-Mamba2 Unlock Reinforcement Learning for End-to-End Motion Control?

Add code
Sep 09, 2025
Viaarxiv icon