Picture for Yu-Gang Jiang

Yu-Gang Jiang

RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base

Add code
Jun 23, 2025
Viaarxiv icon

NAP-Tuning: Neural Augmented Prompt Tuning for Adversarially Robust Vision-Language Models

Add code
Jun 15, 2025
Viaarxiv icon

GenBreak: Red Teaming Text-to-Image Generators Using Large Language Models

Add code
Jun 11, 2025
Viaarxiv icon

Reasoning Models Are More Easily Gaslighted Than You Think

Add code
Jun 11, 2025
Viaarxiv icon

Domain-RAG: Retrieval-Guided Compositional Image Generation for Cross-Domain Few-Shot Object Detection

Add code
Jun 06, 2025
Viaarxiv icon

You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping

Add code
Jun 06, 2025
Viaarxiv icon

CreatiDesign: A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design

Add code
May 25, 2025
Viaarxiv icon

OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks

Add code
May 24, 2025
Viaarxiv icon

MLLMs are Deeply Affected by Modality Bias

Add code
May 24, 2025
Viaarxiv icon

SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models

Add code
May 24, 2025
Viaarxiv icon