Picture for Jon Burnsky

Jon Burnsky

Robust Multimodal Safety via Conditional Decoding

Add code
Mar 31, 2026
Viaarxiv icon

TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes

Add code
Mar 26, 2025
Figure 1 for TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes
Figure 2 for TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes
Figure 3 for TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes
Figure 4 for TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes
Viaarxiv icon

TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

Add code
Feb 20, 2024
Figure 1 for TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Figure 2 for TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Figure 3 for TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Figure 4 for TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization
Viaarxiv icon