Picture for Jie Wu

Jie Wu

SINTEF Ocean

AlphaGRPO: Unlocking Self-Reflective Multimodal Generation in UMMs via Decompositional Verifiable Reward

Add code
May 12, 2026
Viaarxiv icon

ViPO: Visual Preference Optimization at Scale

Add code
Apr 27, 2026
Viaarxiv icon

Beyond Single-Agent Alignment: Preventing Context-Fragmented Violations in Multi-Agent Systems

Add code
Apr 24, 2026
Viaarxiv icon

NIM4-ASR: Towards Efficient, Robust, and Customizable Real-Time LLM-Based ASR

Add code
Apr 20, 2026
Viaarxiv icon

LeapAlign: Post-Training Flow Matching Models at Any Generation Step by Building Two-Step Trajectories

Add code
Apr 16, 2026
Viaarxiv icon

Seedance 2.0: Advancing Video Generation for World Complexity

Add code
Apr 15, 2026
Viaarxiv icon

Policy-Invisible Violations in LLM-Based Agents

Add code
Apr 14, 2026
Viaarxiv icon

Towards Long-horizon Agentic Multimodal Search

Add code
Apr 14, 2026
Viaarxiv icon

Rethinking Entropy Allocation in LLM-based ASR: Understanding the Dynamics between Speech Encoders and LLMs

Add code
Apr 09, 2026
Viaarxiv icon

UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation

Add code
Mar 24, 2026
Viaarxiv icon