Picture for Wen Li

Wen Li

PokeGym: A Visually-Driven Long-Horizon Benchmark for Vision-Language Models

Add code
Apr 09, 2026
Viaarxiv icon

Deformation-based In-Context Learning for Point Cloud Understanding

Add code
Apr 03, 2026
Viaarxiv icon

The Geometry of Compromise: Unlocking Generative Capabilities via Controllable Modality Alignment

Add code
Mar 31, 2026
Viaarxiv icon

ToLL: Topological Layout Learning with Structural Multi-view Augmentation for 3D Scene Graph Pretraining

Add code
Mar 30, 2026
Viaarxiv icon

Chain of Event-Centric Causal Thought for Physically Plausible Video Generation

Add code
Mar 10, 2026
Viaarxiv icon

CAE-AV: Improving Audio-Visual Learning via Cross-modal Interactive Enrichment

Add code
Feb 09, 2026
Viaarxiv icon

Instance-Free Domain Adaptive Object Detection

Add code
Feb 06, 2026
Viaarxiv icon

From Single Scan to Sequential Consistency: A New Paradigm for LIDAR Relocalization

Add code
Feb 03, 2026
Viaarxiv icon

Let Samples Speak: Mitigating Spurious Correlation by Exploiting the Clusterness of Samples

Add code
Dec 28, 2025
Viaarxiv icon

The Devil is in Attention Sharing: Improving Complex Non-rigid Image Editing Faithfulness via Attention Synergy

Add code
Dec 17, 2025
Viaarxiv icon