Picture for Yu-Gang Jiang

Yu-Gang Jiang

StableAvatar: Infinite-Length Audio-Driven Avatar Video Generation

Add code
Aug 11, 2025
Viaarxiv icon

MOSEv2: A More Challenging Dataset for Video Object Segmentation in Complex Scenes

Add code
Aug 07, 2025
Viaarxiv icon

Multimodal Referring Segmentation: A Survey

Add code
Aug 01, 2025
Viaarxiv icon

Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation

Add code
Jul 30, 2025
Viaarxiv icon

RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base

Add code
Jun 23, 2025
Viaarxiv icon

NAP-Tuning: Neural Augmented Prompt Tuning for Adversarially Robust Vision-Language Models

Add code
Jun 15, 2025
Viaarxiv icon

GenBreak: Red Teaming Text-to-Image Generators Using Large Language Models

Add code
Jun 11, 2025
Viaarxiv icon

Reasoning Models Are More Easily Gaslighted Than You Think

Add code
Jun 11, 2025
Viaarxiv icon

Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection

Add code
Jun 06, 2025
Viaarxiv icon

You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping

Add code
Jun 06, 2025
Viaarxiv icon