Picture for Besmira Nushi

Besmira Nushi

Understanding Information Storage and Transfer in Multi-modal Large Language Models

Add code
Jun 06, 2024
Viaarxiv icon

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Add code
Apr 18, 2024
Figure 1 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 2 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 3 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Figure 4 for Introducing v0.5 of the AI Safety Benchmark from MLCommons
Viaarxiv icon

Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models

Add code
Apr 09, 2024
Figure 1 for Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models
Figure 2 for Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models
Figure 3 for Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models
Figure 4 for Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models
Viaarxiv icon

KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval

Add code
Oct 24, 2023
Figure 1 for KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval
Figure 2 for KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval
Figure 3 for KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval
Figure 4 for KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval
Viaarxiv icon

Diversity of Thought Improves Reasoning Abilities of Large Language Models

Add code
Oct 11, 2023
Figure 1 for Diversity of Thought Improves Reasoning Abilities of Large Language Models
Figure 2 for Diversity of Thought Improves Reasoning Abilities of Large Language Models
Figure 3 for Diversity of Thought Improves Reasoning Abilities of Large Language Models
Figure 4 for Diversity of Thought Improves Reasoning Abilities of Large Language Models
Viaarxiv icon

Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models

Add code
Sep 26, 2023
Figure 1 for Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models
Figure 2 for Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models
Figure 3 for Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models
Figure 4 for Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models
Viaarxiv icon

Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning

Add code
Apr 08, 2023
Figure 1 for Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning
Figure 2 for Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning
Figure 3 for Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning
Figure 4 for Mitigating Spurious Correlations in Multi-modal Models during Fine-tuning
Viaarxiv icon

Social Biases through the Text-to-Image Generation Lens

Add code
Mar 30, 2023
Figure 1 for Social Biases through the Text-to-Image Generation Lens
Figure 2 for Social Biases through the Text-to-Image Generation Lens
Figure 3 for Social Biases through the Text-to-Image Generation Lens
Figure 4 for Social Biases through the Text-to-Image Generation Lens
Viaarxiv icon

Benchmarking Spatial Relationships in Text-to-Image Generation

Add code
Dec 20, 2022
Figure 1 for Benchmarking Spatial Relationships in Text-to-Image Generation
Figure 2 for Benchmarking Spatial Relationships in Text-to-Image Generation
Figure 3 for Benchmarking Spatial Relationships in Text-to-Image Generation
Figure 4 for Benchmarking Spatial Relationships in Text-to-Image Generation
Viaarxiv icon

Advancing Human-AI Complementarity: The Impact of User Expertise and Algorithmic Tuning on Joint Decision Making

Add code
Aug 16, 2022
Figure 1 for Advancing Human-AI Complementarity: The Impact of User Expertise and Algorithmic Tuning on Joint Decision Making
Figure 2 for Advancing Human-AI Complementarity: The Impact of User Expertise and Algorithmic Tuning on Joint Decision Making
Figure 3 for Advancing Human-AI Complementarity: The Impact of User Expertise and Algorithmic Tuning on Joint Decision Making
Figure 4 for Advancing Human-AI Complementarity: The Impact of User Expertise and Algorithmic Tuning on Joint Decision Making
Viaarxiv icon