Picture for Yong Wang

Yong Wang

ReSum: Synergizing LLM Reasoning and Summarization with Reinforcement Learning

Add code
Jun 11, 2026
Viaarxiv icon

APPO: Agentic Procedural Policy Optimization

Add code
Jun 10, 2026
Viaarxiv icon

Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution

Add code
Jun 09, 2026
Viaarxiv icon

DINO-GFSA: Geo-Localization via Semantic Gated Fusion and Mamba-based Sequential Aggregation

Add code
May 30, 2026
Viaarxiv icon

Implicit Action Chunking for Smooth Continuous Control

Add code
May 19, 2026
Viaarxiv icon

Robust LLM Unlearning Against Relearning Attacks: The Minor Components in Representations Matter

Add code
May 12, 2026
Viaarxiv icon

Learning Agentic Policy from Action Guidance

Add code
May 12, 2026
Viaarxiv icon

Decompose to Understand, Fuse to Detect: Frequency-Decoupled Anomaly Detection for Encrypted Network Traffic

Add code
May 03, 2026
Viaarxiv icon

Modeling LLM Unlearning as an Asymmetric Two-Task Learning Problem

Add code
Apr 16, 2026
Viaarxiv icon

Visual Enhanced Depth Scaling for Multimodal Latent Reasoning

Add code
Apr 12, 2026
Viaarxiv icon