Picture for Samson Tan

Samson Tan

MoE Routing Testbed: Studying Expert Specialization and Routing Behavior at Small Scale

Add code
Apr 08, 2026
Viaarxiv icon

Learning to Generate Answers with Citations via Factual Consistency Models

Add code
Jun 19, 2024
Viaarxiv icon

Lessons from the Trenches on Reproducible Evaluation of Language Models

Add code
May 23, 2024
Figure 1 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 2 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 3 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 4 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Viaarxiv icon

Extreme Miscalibration and the Illusion of Adversarial Robustness

Add code
Feb 27, 2024
Viaarxiv icon

Automatic Feature Fairness in Recommendation via Adversaries

Add code
Sep 27, 2023
Figure 1 for Automatic Feature Fairness in Recommendation via Adversaries
Figure 2 for Automatic Feature Fairness in Recommendation via Adversaries
Figure 3 for Automatic Feature Fairness in Recommendation via Adversaries
Figure 4 for Automatic Feature Fairness in Recommendation via Adversaries
Viaarxiv icon

Large Language Models of Code Fail at Completing Code with Potential Bugs

Add code
Jun 06, 2023
Figure 1 for Large Language Models of Code Fail at Completing Code with Potential Bugs
Figure 2 for Large Language Models of Code Fail at Completing Code with Potential Bugs
Figure 3 for Large Language Models of Code Fail at Completing Code with Potential Bugs
Figure 4 for Large Language Models of Code Fail at Completing Code with Potential Bugs
Viaarxiv icon

ReCode: Robustness Evaluation of Code Generation Models

Add code
Dec 20, 2022
Figure 1 for ReCode: Robustness Evaluation of Code Generation Models
Figure 2 for ReCode: Robustness Evaluation of Code Generation Models
Figure 3 for ReCode: Robustness Evaluation of Code Generation Models
Figure 4 for ReCode: Robustness Evaluation of Code Generation Models
Viaarxiv icon

BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems

Add code
Nov 30, 2022
Figure 1 for BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems
Figure 2 for BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems
Figure 3 for BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems
Figure 4 for BotSIM: An End-to-End Bot Simulation Framework for Commercial Task-Oriented Dialog Systems
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

Whodunit? Learning to Contrast for Authorship Attribution

Add code
Oct 10, 2022
Figure 1 for Whodunit? Learning to Contrast for Authorship Attribution
Figure 2 for Whodunit? Learning to Contrast for Authorship Attribution
Figure 3 for Whodunit? Learning to Contrast for Authorship Attribution
Figure 4 for Whodunit? Learning to Contrast for Authorship Attribution
Viaarxiv icon