Picture for Yang Zhang

Yang Zhang

University of Science and Technology of China

Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification

Add code
Jul 30, 2024
Viaarxiv icon

GradCraft: Elevating Multi-task Recommendations through Holistic Gradient Crafting

Add code
Jul 29, 2024
Figure 1 for GradCraft: Elevating Multi-task Recommendations through Holistic Gradient Crafting
Figure 2 for GradCraft: Elevating Multi-task Recommendations through Holistic Gradient Crafting
Figure 3 for GradCraft: Elevating Multi-task Recommendations through Holistic Gradient Crafting
Figure 4 for GradCraft: Elevating Multi-task Recommendations through Holistic Gradient Crafting
Viaarxiv icon

Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective

Add code
Jul 24, 2024
Figure 1 for Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective
Figure 2 for Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective
Figure 3 for Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective
Figure 4 for Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective
Viaarxiv icon

SeqMIA: Sequential-Metric Based Membership Inference Attack

Add code
Jul 21, 2024
Viaarxiv icon

Towards Understanding Unsafe Video Generation

Add code
Jul 17, 2024
Viaarxiv icon

ICLGuard: Controlling In-Context Learning Behavior for Applicability Authorization

Add code
Jul 09, 2024
Figure 1 for ICLGuard: Controlling In-Context Learning Behavior for Applicability Authorization
Figure 2 for ICLGuard: Controlling In-Context Learning Behavior for Applicability Authorization
Figure 3 for ICLGuard: Controlling In-Context Learning Behavior for Applicability Authorization
Figure 4 for ICLGuard: Controlling In-Context Learning Behavior for Applicability Authorization
Viaarxiv icon

Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition

Add code
Jul 05, 2024
Figure 1 for Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Figure 2 for Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Figure 3 for Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Figure 4 for Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Viaarxiv icon

SOS! Soft Prompt Attack Against Open-Source Large Language Models

Add code
Jul 03, 2024
Figure 1 for SOS! Soft Prompt Attack Against Open-Source Large Language Models
Figure 2 for SOS! Soft Prompt Attack Against Open-Source Large Language Models
Figure 3 for SOS! Soft Prompt Attack Against Open-Source Large Language Models
Figure 4 for SOS! Soft Prompt Attack Against Open-Source Large Language Models
Viaarxiv icon

VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for VLMs

Add code
Jul 02, 2024
Figure 1 for VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for VLMs
Figure 2 for VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for VLMs
Figure 3 for VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for VLMs
Figure 4 for VSP: Assessing the dual challenges of perception and reasoning in spatial planning tasks for VLMs
Viaarxiv icon

Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks

Add code
Jul 01, 2024
Figure 1 for Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks
Figure 2 for Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks
Figure 3 for Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks
Figure 4 for Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks
Viaarxiv icon