Picture for Zachary C. Lipton

Zachary C. Lipton

Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends

Add code
Jun 05, 2024
Figure 1 for Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends
Figure 2 for Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends
Figure 3 for Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends
Figure 4 for Analyzing LLM Behavior in Dialogue Summarization: Unveiling Circumstantial Hallucination Trends
Viaarxiv icon

Rethinking LLM Memorization through the Lens of Adversarial Compression

Add code
Apr 23, 2024
Figure 1 for Rethinking LLM Memorization through the Lens of Adversarial Compression
Figure 2 for Rethinking LLM Memorization through the Lens of Adversarial Compression
Figure 3 for Rethinking LLM Memorization through the Lens of Adversarial Compression
Figure 4 for Rethinking LLM Memorization through the Lens of Adversarial Compression
Viaarxiv icon

Scaling Laws for Data Filtering -- Data Curation cannot be Compute Agnostic

Add code
Apr 10, 2024
Viaarxiv icon

Auditing Fairness under Unobserved Confounding

Add code
Mar 18, 2024
Figure 1 for Auditing Fairness under Unobserved Confounding
Figure 2 for Auditing Fairness under Unobserved Confounding
Figure 3 for Auditing Fairness under Unobserved Confounding
Figure 4 for Auditing Fairness under Unobserved Confounding
Viaarxiv icon

GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence

Add code
Feb 19, 2024
Figure 1 for GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
Figure 2 for GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
Figure 3 for GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
Figure 4 for GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
Viaarxiv icon

Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing

Add code
Feb 12, 2024
Figure 1 for Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing
Figure 2 for Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing
Figure 3 for Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing
Figure 4 for Beyond the Mud: Datasets and Benchmarks for Computer Vision in Off-Road Racing
Viaarxiv icon

Contrastive Multiple Instance Learning for Weakly Supervised Person ReID

Add code
Feb 12, 2024
Figure 1 for Contrastive Multiple Instance Learning for Weakly Supervised Person ReID
Figure 2 for Contrastive Multiple Instance Learning for Weakly Supervised Person ReID
Figure 3 for Contrastive Multiple Instance Learning for Weakly Supervised Person ReID
Figure 4 for Contrastive Multiple Instance Learning for Weakly Supervised Person ReID
Viaarxiv icon

Personalized Language Modeling from Personalized Human Feedback

Add code
Feb 06, 2024
Viaarxiv icon

Red-Teaming for Generative AI: Silver Bullet or Security Theater?

Add code
Jan 29, 2024
Viaarxiv icon

The Impact of Differential Feature Under-reporting on Algorithmic Fairness

Add code
Jan 16, 2024
Figure 1 for The Impact of Differential Feature Under-reporting on Algorithmic Fairness
Figure 2 for The Impact of Differential Feature Under-reporting on Algorithmic Fairness
Figure 3 for The Impact of Differential Feature Under-reporting on Algorithmic Fairness
Figure 4 for The Impact of Differential Feature Under-reporting on Algorithmic Fairness
Viaarxiv icon