Picture for Wenjun Wu

Wenjun Wu

VGGT-Segmentor: Geometry-Enhanced Cross-View Segmentation

Add code
Apr 16, 2026
Viaarxiv icon

RailVQA: A Benchmark and Framework for Efficient Interpretable Visual Cognition in Automatic Train Operation

Add code
Mar 28, 2026
Viaarxiv icon

Z-Erase: Enabling Concept Erasure in Single-Stream Diffusion Transformers

Add code
Mar 26, 2026
Viaarxiv icon

TAMTRL: Teacher-Aligned Reward Reshaping for Multi-Turn Reinforcement Learning in Long-Context Compression

Add code
Mar 23, 2026
Viaarxiv icon

EventMemAgent: Hierarchical Event-Centric Memory for Online Video Understanding with Adaptive Tool Use

Add code
Feb 17, 2026
Viaarxiv icon

GPO: Growing Policy Optimization for Legged Robot Locomotion and Whole-Body Control

Add code
Jan 28, 2026
Viaarxiv icon

FARE: Fast-Slow Agentic Robotic Exploration

Add code
Jan 21, 2026
Viaarxiv icon

RANGER: A Monocular Zero-Shot Semantic Navigation Framework through Contextual Adaptation

Add code
Dec 30, 2025
Viaarxiv icon

RLLaVA: An RL-central Framework for Language and Vision Assistants

Add code
Dec 25, 2025
Figure 1 for RLLaVA: An RL-central Framework for Language and Vision Assistants
Figure 2 for RLLaVA: An RL-central Framework for Language and Vision Assistants
Figure 3 for RLLaVA: An RL-central Framework for Language and Vision Assistants
Figure 4 for RLLaVA: An RL-central Framework for Language and Vision Assistants
Viaarxiv icon

Tool-Augmented Hybrid Ensemble Reasoning with Distillation for Bilingual Mathematical Problem Solving

Add code
Dec 22, 2025
Viaarxiv icon