Picture for Diyi Yang

Diyi Yang

Stanford University

Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors

Add code
Nov 12, 2024
Figure 1 for Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors
Figure 2 for Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors
Figure 3 for Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors
Figure 4 for Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors
Viaarxiv icon

Attacking Vision-Language Computer Agents via Pop-ups

Add code
Nov 04, 2024
Figure 1 for Attacking Vision-Language Computer Agents via Pop-ups
Figure 2 for Attacking Vision-Language Computer Agents via Pop-ups
Figure 3 for Attacking Vision-Language Computer Agents via Pop-ups
Figure 4 for Attacking Vision-Language Computer Agents via Pop-ups
Viaarxiv icon

Personalization of Large Language Models: A Survey

Add code
Oct 29, 2024
Viaarxiv icon

Sketch2Code: Evaluating Vision-Language Models for Interactive Web Design Prototyping

Add code
Oct 21, 2024
Viaarxiv icon

SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

Add code
Oct 04, 2024
Figure 1 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Figure 2 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Figure 3 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Figure 4 for SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?
Viaarxiv icon

Distilling an End-to-End Voice Assistant Without Instruction Training Data

Add code
Oct 03, 2024
Figure 1 for Distilling an End-to-End Voice Assistant Without Instruction Training Data
Figure 2 for Distilling an End-to-End Voice Assistant Without Instruction Training Data
Figure 3 for Distilling an End-to-End Voice Assistant Without Instruction Training Data
Figure 4 for Distilling an End-to-End Voice Assistant Without Instruction Training Data
Viaarxiv icon

Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers

Add code
Sep 06, 2024
Figure 1 for Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
Figure 2 for Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
Figure 3 for Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
Figure 4 for Can LLMs Generate Novel Research Ideas? A Large-Scale Human Study with 100+ NLP Researchers
Viaarxiv icon

PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action

Add code
Aug 29, 2024
Figure 1 for PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action
Figure 2 for PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action
Figure 3 for PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action
Figure 4 for PrivacyLens: Evaluating Privacy Norm Awareness of Language Models in Action
Viaarxiv icon

Demystifying Verbatim Memorization in Large Language Models

Add code
Jul 25, 2024
Figure 1 for Demystifying Verbatim Memorization in Large Language Models
Figure 2 for Demystifying Verbatim Memorization in Large Language Models
Figure 3 for Demystifying Verbatim Memorization in Large Language Models
Figure 4 for Demystifying Verbatim Memorization in Large Language Models
Viaarxiv icon

Are Large Language Models Consistent over Value-laden Questions?

Add code
Jul 03, 2024
Figure 1 for Are Large Language Models Consistent over Value-laden Questions?
Figure 2 for Are Large Language Models Consistent over Value-laden Questions?
Figure 3 for Are Large Language Models Consistent over Value-laden Questions?
Figure 4 for Are Large Language Models Consistent over Value-laden Questions?
Viaarxiv icon