Picture for Weijia Shi

Weijia Shi

BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Add code
Jul 16, 2024
Viaarxiv icon

MUSE: Machine Unlearning Six-Way Evaluation for Language Models

Add code
Jul 08, 2024
Viaarxiv icon

Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling

Add code
Jul 02, 2024
Figure 1 for Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling
Figure 2 for Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling
Figure 3 for Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling
Figure 4 for Predicting vs. Acting: A Trade-off Between World Modeling & Agent Modeling
Viaarxiv icon

Evaluating Copyright Takedown Methods for Language Models

Add code
Jun 26, 2024
Figure 1 for Evaluating Copyright Takedown Methods for Language Models
Figure 2 for Evaluating Copyright Takedown Methods for Language Models
Figure 3 for Evaluating Copyright Takedown Methods for Language Models
Figure 4 for Evaluating Copyright Takedown Methods for Language Models
Viaarxiv icon

Teaching LLMs to Abstain across Languages via Multilingual Feedback

Add code
Jun 22, 2024
Figure 1 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 2 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 3 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 4 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Viaarxiv icon

Fantastic Copyrighted Beasts and How (Not) to Generate Them

Add code
Jun 20, 2024
Viaarxiv icon

Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Add code
Jun 13, 2024
Viaarxiv icon

AI Risk Management Should Incorporate Both Safety and Security

Add code
May 29, 2024
Figure 1 for AI Risk Management Should Incorporate Both Safety and Security
Viaarxiv icon

Instruction-tuned Language Models are Better Knowledge Learners

Add code
Feb 20, 2024
Viaarxiv icon

Do Membership Inference Attacks Work on Large Language Models?

Add code
Feb 12, 2024
Viaarxiv icon