Picture for Samson Tan

Samson Tan

Learning to Generate Answers with Citations via Factual Consistency Models

Add code
Jun 19, 2024
Figure 1 for Learning to Generate Answers with Citations via Factual Consistency Models
Figure 2 for Learning to Generate Answers with Citations via Factual Consistency Models
Figure 3 for Learning to Generate Answers with Citations via Factual Consistency Models
Figure 4 for Learning to Generate Answers with Citations via Factual Consistency Models
Viaarxiv icon

Lessons from the Trenches on Reproducible Evaluation of Language Models

Add code
May 23, 2024
Figure 1 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 2 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 3 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 4 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Viaarxiv icon

Extreme Miscalibration and the Illusion of Adversarial Robustness

Add code
Feb 27, 2024
Figure 1 for Extreme Miscalibration and the Illusion of Adversarial Robustness
Figure 2 for Extreme Miscalibration and the Illusion of Adversarial Robustness
Figure 3 for Extreme Miscalibration and the Illusion of Adversarial Robustness
Figure 4 for Extreme Miscalibration and the Illusion of Adversarial Robustness
Viaarxiv icon

Automatic Feature Fairness in Recommendation via Adversaries

Add code
Sep 27, 2023
Figure 1 for Automatic Feature Fairness in Recommendation via Adversaries
Figure 2 for Automatic Feature Fairness in Recommendation via Adversaries
Figure 3 for Automatic Feature Fairness in Recommendation via Adversaries
Figure 4 for Automatic Feature Fairness in Recommendation via Adversaries
Viaarxiv icon

Large Language Models of Code Fail at Completing Code with Potential Bugs

Add code
Jun 06, 2023
Figure 1 for Large Language Models of Code Fail at Completing Code with Potential Bugs
Figure 2 for Large Language Models of Code Fail at Completing Code with Potential Bugs
Figure 3 for Large Language Models of Code Fail at Completing Code with Potential Bugs
Figure 4 for Large Language Models of Code Fail at Completing Code with Potential Bugs
Viaarxiv icon

ReCode: Robustness Evaluation of Code Generation Models

Add code
Dec 20, 2022
Figure 1 for ReCode: Robustness Evaluation of Code Generation Models
Figure 2 for ReCode: Robustness Evaluation of Code Generation Models
Figure 3 for ReCode: Robustness Evaluation of Code Generation Models
Figure 4 for ReCode: Robustness Evaluation of Code Generation Models
Viaarxiv icon

BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems

Add code
Nov 30, 2022
Figure 1 for BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems
Figure 2 for BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems
Figure 3 for BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems
Figure 4 for BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

Whodunit? Learning to Contrast for Authorship Attribution

Add code
Oct 10, 2022
Figure 1 for Whodunit? Learning to Contrast for Authorship Attribution
Figure 2 for Whodunit? Learning to Contrast for Authorship Attribution
Figure 3 for Whodunit? Learning to Contrast for Authorship Attribution
Figure 4 for Whodunit? Learning to Contrast for Authorship Attribution
Viaarxiv icon

The Risks of Machine Learning Systems

Add code
Apr 21, 2022
Figure 1 for The Risks of Machine Learning Systems
Figure 2 for The Risks of Machine Learning Systems
Figure 3 for The Risks of Machine Learning Systems
Viaarxiv icon