Picture for Michael R. Lyu

Michael R. Lyu

The Earth is Flat? Unveiling Factual Errors in Large Language Models

Add code
Jan 01, 2024
Viaarxiv icon

New Job, New Gender? Measuring the Social Bias in Image Generation Models

Add code
Jan 01, 2024
Viaarxiv icon

Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models

Add code
Oct 19, 2023
Figure 1 for Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models
Figure 2 for Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models
Figure 3 for Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models
Figure 4 for Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models
Viaarxiv icon

Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench

Add code
Oct 02, 2023
Viaarxiv icon

All Languages Matter: On the Multilingual Safety of Large Language Models

Add code
Oct 02, 2023
Viaarxiv icon

Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services

Add code
Aug 19, 2023
Figure 1 for Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services
Figure 2 for Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services
Figure 3 for Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services
Figure 4 for Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services
Viaarxiv icon

VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control

Add code
Aug 18, 2023
Viaarxiv icon

An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software

Add code
Aug 18, 2023
Figure 1 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Figure 2 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Figure 3 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Figure 4 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Viaarxiv icon

CLEVA: Chinese Language Models EVAluation Platform

Add code
Aug 09, 2023
Figure 1 for CLEVA: Chinese Language Models EVAluation Platform
Figure 2 for CLEVA: Chinese Language Models EVAluation Platform
Figure 3 for CLEVA: Chinese Language Models EVAluation Platform
Figure 4 for CLEVA: Chinese Language Models EVAluation Platform
Viaarxiv icon

Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench

Add code
Aug 07, 2023
Viaarxiv icon