Picture for Faeze Brahman

Faeze Brahman

Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement

Add code
Jul 25, 2024
Viaarxiv icon

How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models

Add code
Jun 29, 2024
Figure 1 for How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models
Figure 2 for How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models
Figure 3 for How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models
Figure 4 for How to Train Your Fact Verifier: Knowledge Transfer with Multimodal Open Models
Viaarxiv icon

WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models

Add code
Jun 26, 2024
Viaarxiv icon

WildBench: Benchmarking LLMs with Challenging Tasks from Real Users in the Wild

Add code
Jun 07, 2024
Viaarxiv icon

Information-Theoretic Distillation for Reference-less Summarization

Add code
Mar 20, 2024
Figure 1 for Information-Theoretic Distillation for Reference-less Summarization
Figure 2 for Information-Theoretic Distillation for Reference-less Summarization
Figure 3 for Information-Theoretic Distillation for Reference-less Summarization
Figure 4 for Information-Theoretic Distillation for Reference-less Summarization
Viaarxiv icon

One Size Does Not Fit All: Customizing Open-Domain Procedures

Add code
Nov 16, 2023
Figure 1 for One Size Does Not Fit All: Customizing Open-Domain Procedures
Figure 2 for One Size Does Not Fit All: Customizing Open-Domain Procedures
Figure 3 for One Size Does Not Fit All: Customizing Open-Domain Procedures
Figure 4 for One Size Does Not Fit All: Customizing Open-Domain Procedures
Viaarxiv icon

MacGyver: Are Large Language Models Creative Problem Solvers?

Add code
Nov 16, 2023
Figure 1 for MacGyver: Are Large Language Models Creative Problem Solvers?
Figure 2 for MacGyver: Are Large Language Models Creative Problem Solvers?
Figure 3 for MacGyver: Are Large Language Models Creative Problem Solvers?
Figure 4 for MacGyver: Are Large Language Models Creative Problem Solvers?
Viaarxiv icon

UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations

Add code
Nov 14, 2023
Viaarxiv icon

In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search

Add code
Nov 13, 2023
Figure 1 for In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search
Figure 2 for In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search
Figure 3 for In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search
Figure 4 for In Search of the Long-Tail: Systematic Generation of Long-Tail Knowledge via Logical Rule Guided Search
Viaarxiv icon

STEER: Unified Style Transfer with Expert Reinforcement

Add code
Nov 13, 2023
Viaarxiv icon