Picture for Yu-Gang Jiang

Yu-Gang Jiang

Fudan University

DiffPatch: Generating Customizable Adversarial Patches using Diffusion Model

Add code
Dec 02, 2024
Figure 1 for DiffPatch: Generating Customizable Adversarial Patches using Diffusion Model
Figure 2 for DiffPatch: Generating Customizable Adversarial Patches using Diffusion Model
Figure 3 for DiffPatch: Generating Customizable Adversarial Patches using Diffusion Model
Figure 4 for DiffPatch: Generating Customizable Adversarial Patches using Diffusion Model
Viaarxiv icon

ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection

Add code
Nov 29, 2024
Figure 1 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Figure 2 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Figure 3 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Figure 4 for ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation Detection
Viaarxiv icon

LoRA of Change: Learning to Generate LoRA for the Editing Instruction from A Single Before-After Image Pair

Add code
Nov 28, 2024
Viaarxiv icon

Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision

Add code
Nov 25, 2024
Figure 1 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 2 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 3 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Figure 4 for Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision
Viaarxiv icon

SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition

Add code
Nov 24, 2024
Viaarxiv icon

REDUCIO! Generating 1024$\times$1024 Video within 16 Seconds using Extremely Compressed Motion Latents

Add code
Nov 20, 2024
Figure 1 for REDUCIO! Generating 1024$\times$1024 Video within 16 Seconds using Extremely Compressed Motion Latents
Figure 2 for REDUCIO! Generating 1024$\times$1024 Video within 16 Seconds using Extremely Compressed Motion Latents
Figure 3 for REDUCIO! Generating 1024$\times$1024 Video within 16 Seconds using Extremely Compressed Motion Latents
Figure 4 for REDUCIO! Generating 1024$\times$1024 Video within 16 Seconds using Extremely Compressed Motion Latents
Viaarxiv icon

Visual Cue Enhancement and Dual Low-Rank Adaptation for Efficient Visual Instruction Fine-Tuning

Add code
Nov 19, 2024
Viaarxiv icon

Retrieval Augmented Recipe Generation

Add code
Nov 13, 2024
Viaarxiv icon

Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization

Add code
Nov 05, 2024
Figure 1 for Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization
Figure 2 for Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization
Figure 3 for Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization
Figure 4 for Domain Expansion and Boundary Growth for Open-Set Single-Source Domain Generalization
Viaarxiv icon

IDEATOR: Jailbreaking VLMs Using VLMs

Add code
Oct 29, 2024
Figure 1 for IDEATOR: Jailbreaking VLMs Using VLMs
Figure 2 for IDEATOR: Jailbreaking VLMs Using VLMs
Figure 3 for IDEATOR: Jailbreaking VLMs Using VLMs
Figure 4 for IDEATOR: Jailbreaking VLMs Using VLMs
Viaarxiv icon