Picture for Siheng Wang

Siheng Wang

DeCo-DETR: Decoupled Cognition DETR for efficient Open-Vocabulary Object Detection

Add code
Apr 03, 2026
Viaarxiv icon

GeneralVLA: Generalizable Vision-Language-Action Models with Knowledge-Guided Trajectory Planning

Add code
Feb 04, 2026
Viaarxiv icon

SDCD: Structure-Disrupted Contrastive Decoding for Mitigating Hallucinations in Large Vision-Language Models

Add code
Jan 07, 2026
Viaarxiv icon