Picture for Dawn Song

Dawn Song

University of California, Berkeley

MLAN: Language-Based Instruction Tuning Improves Zero-Shot Generalization of Multimodal Large Language Models

Add code
Nov 19, 2024
Figure 1 for MLAN: Language-Based Instruction Tuning Improves Zero-Shot Generalization of Multimodal Large Language Models
Figure 2 for MLAN: Language-Based Instruction Tuning Improves Zero-Shot Generalization of Multimodal Large Language Models
Figure 3 for MLAN: Language-Based Instruction Tuning Improves Zero-Shot Generalization of Multimodal Large Language Models
Figure 4 for MLAN: Language-Based Instruction Tuning Improves Zero-Shot Generalization of Multimodal Large Language Models
Viaarxiv icon

RedCode: Risky Code Execution and Generation Benchmark for Code Agents

Add code
Nov 12, 2024
Figure 1 for RedCode: Risky Code Execution and Generation Benchmark for Code Agents
Figure 2 for RedCode: Risky Code Execution and Generation Benchmark for Code Agents
Figure 3 for RedCode: Risky Code Execution and Generation Benchmark for Code Agents
Figure 4 for RedCode: Risky Code Execution and Generation Benchmark for Code Agents
Viaarxiv icon

Hidden Persuaders: LLMs' Political Leaning and Their Influence on Voters

Add code
Oct 31, 2024
Figure 1 for Hidden Persuaders: LLMs' Political Leaning and Their Influence on Voters
Figure 2 for Hidden Persuaders: LLMs' Political Leaning and Their Influence on Voters
Figure 3 for Hidden Persuaders: LLMs' Political Leaning and Their Influence on Voters
Figure 4 for Hidden Persuaders: LLMs' Political Leaning and Their Influence on Voters
Viaarxiv icon

CTINEXUS: Leveraging Optimized LLM In-Context Learning for Constructing Cybersecurity Knowledge Graphs Under Data Scarcity

Add code
Oct 28, 2024
Viaarxiv icon

SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI

Add code
Oct 14, 2024
Figure 1 for SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI
Figure 2 for SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI
Figure 3 for SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI
Figure 4 for SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI
Viaarxiv icon

An undetectable watermark for generative image models

Add code
Oct 09, 2024
Figure 1 for An undetectable watermark for generative image models
Figure 2 for An undetectable watermark for generative image models
Figure 3 for An undetectable watermark for generative image models
Figure 4 for An undetectable watermark for generative image models
Viaarxiv icon

Multimodal Situational Safety

Add code
Oct 08, 2024
Figure 1 for Multimodal Situational Safety
Figure 2 for Multimodal Situational Safety
Figure 3 for Multimodal Situational Safety
Figure 4 for Multimodal Situational Safety
Viaarxiv icon

LLM-PBE: Assessing Data Privacy in Large Language Models

Add code
Aug 23, 2024
Figure 1 for LLM-PBE: Assessing Data Privacy in Large Language Models
Figure 2 for LLM-PBE: Assessing Data Privacy in Large Language Models
Figure 3 for LLM-PBE: Assessing Data Privacy in Large Language Models
Figure 4 for LLM-PBE: Assessing Data Privacy in Large Language Models
Viaarxiv icon

Tamper-Resistant Safeguards for Open-Weight LLMs

Add code
Aug 01, 2024
Figure 1 for Tamper-Resistant Safeguards for Open-Weight LLMs
Figure 2 for Tamper-Resistant Safeguards for Open-Weight LLMs
Figure 3 for Tamper-Resistant Safeguards for Open-Weight LLMs
Figure 4 for Tamper-Resistant Safeguards for Open-Weight LLMs
Viaarxiv icon

Can Editing LLMs Inject Harm?

Add code
Jul 29, 2024
Figure 1 for Can Editing LLMs Inject Harm?
Figure 2 for Can Editing LLMs Inject Harm?
Figure 3 for Can Editing LLMs Inject Harm?
Figure 4 for Can Editing LLMs Inject Harm?
Viaarxiv icon