Alert button
Picture for Tu Trinh

Tu Trinh

Alert button

Softmax Probabilities (Mostly) Predict Large Language Model Correctness on Multiple-Choice Q&A

Add code
Bookmark button
Alert button
Feb 20, 2024
Benjamin Plaut, Khanh Nguyen, Tu Trinh

Viaarxiv icon

A StrongREJECT for Empty Jailbreaks

Add code
Bookmark button
Alert button
Feb 15, 2024
Alexandra Souly, Qingyuan Lu, Dillon Bowen, Tu Trinh, Elvis Hsieh, Sana Pandey, Pieter Abbeel, Justin Svegliato, Scott Emmons, Olivia Watkins, Sam Toyer

Viaarxiv icon

Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 29, 2022
Tu Trinh, Daniel S. Brown

Figure 1 for Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning
Figure 2 for Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning
Figure 3 for Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning
Figure 4 for Autonomous Assessment of Demonstration Sufficiency via Bayesian Inverse Reinforcement Learning
Viaarxiv icon

Efficient Game-Theoretic Planning with Prediction Heuristic for Socially-Compliant Autonomous Driving

Add code
Bookmark button
Alert button
Jul 08, 2022
Chenran Li, Tu Trinh, Letian Wang, Changliu Liu, Masayoshi Tomizuka, Wei Zhan

Figure 1 for Efficient Game-Theoretic Planning with Prediction Heuristic for Socially-Compliant Autonomous Driving
Figure 2 for Efficient Game-Theoretic Planning with Prediction Heuristic for Socially-Compliant Autonomous Driving
Figure 3 for Efficient Game-Theoretic Planning with Prediction Heuristic for Socially-Compliant Autonomous Driving
Figure 4 for Efficient Game-Theoretic Planning with Prediction Heuristic for Socially-Compliant Autonomous Driving
Viaarxiv icon