Alert button
Picture for Michael R. Lyu

Michael R. Lyu

Alert button

Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models

Add code
Bookmark button
Alert button
Oct 19, 2023
Wenxuan Wang, Wenxiang Jiao, Jingyuan Huang, Ruyi Dai, Jen-tse Huang, Zhaopeng Tu, Michael R. Lyu

Viaarxiv icon

Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench

Add code
Bookmark button
Alert button
Oct 02, 2023
Jen-tse Huang, Wenxuan Wang, Eric John Li, Man Ho Lam, Shujie Ren, Youliang Yuan, Wenxiang Jiao, Zhaopeng Tu, Michael R. Lyu

Figure 1 for Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench
Figure 2 for Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench
Figure 3 for Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench
Figure 4 for Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench
Viaarxiv icon

All Languages Matter: On the Multilingual Safety of Large Language Models

Add code
Bookmark button
Alert button
Oct 02, 2023
Wenxuan Wang, Zhaopeng Tu, Chang Chen, Youliang Yuan, Jen-tse Huang, Wenxiang Jiao, Michael R. Lyu

Figure 1 for All Languages Matter: On the Multilingual Safety of Large Language Models
Figure 2 for All Languages Matter: On the Multilingual Safety of Large Language Models
Figure 3 for All Languages Matter: On the Multilingual Safety of Large Language Models
Figure 4 for All Languages Matter: On the Multilingual Safety of Large Language Models
Viaarxiv icon

Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services

Add code
Bookmark button
Alert button
Aug 19, 2023
Jinyang Liu, Tianyi Yang, Zhuangbin Chen, Yuxin Su, Cong Feng, Zengyin Yang, Michael R. Lyu

Figure 1 for Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services
Figure 2 for Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services
Figure 3 for Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services
Figure 4 for Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services
Viaarxiv icon

An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software

Add code
Bookmark button
Alert button
Aug 18, 2023
Wenxuan Wang, Jingyuan Huang, Jen-tse Huang, Chang Chen, Jiazhen Gu, Pinjia He, Michael R. Lyu

Figure 1 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Figure 2 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Figure 3 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Figure 4 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Viaarxiv icon

VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control

Add code
Bookmark button
Alert button
Aug 18, 2023
Zi-Yuan Hu, Yanyang Li, Michael R. Lyu, Liwei Wang

Viaarxiv icon

CLEVA: Chinese Language Models EVAluation Platform

Add code
Bookmark button
Alert button
Aug 09, 2023
Yanyang Li, Jianqiao Zhao, Duo Zheng, Zi-Yuan Hu, Zhi Chen, Xiaohui Su, Yongfeng Huang, Shijia Huang, Dahua Lin, Michael R. Lyu, Liwei Wang

Figure 1 for CLEVA: Chinese Language Models EVAluation Platform
Figure 2 for CLEVA: Chinese Language Models EVAluation Platform
Figure 3 for CLEVA: Chinese Language Models EVAluation Platform
Figure 4 for CLEVA: Chinese Language Models EVAluation Platform
Viaarxiv icon

Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench

Add code
Bookmark button
Alert button
Aug 07, 2023
Jen-tse Huang, Man Ho Lam, Eric John Li, Shujie Ren, Wenxuan Wang, Wenxiang Jiao, Zhaopeng Tu, Michael R. Lyu

Figure 1 for Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench
Figure 2 for Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench
Figure 3 for Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench
Figure 4 for Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench
Viaarxiv icon

On the Robustness of Latent Diffusion Models

Add code
Bookmark button
Alert button
Jun 14, 2023
Jianping Zhang, Zhuoer Xu, Shiwen Cui, Changhua Meng, Weibin Wu, Michael R. Lyu

Figure 1 for On the Robustness of Latent Diffusion Models
Figure 2 for On the Robustness of Latent Diffusion Models
Figure 3 for On the Robustness of Latent Diffusion Models
Figure 4 for On the Robustness of Latent Diffusion Models
Viaarxiv icon

Scalable and Adaptive Log-based Anomaly Detection with Expert in the Loop

Add code
Bookmark button
Alert button
Jun 08, 2023
Jinyang Liu, Junjie Huang, Yintong Huo, Zhihan Jiang, Jiazhen Gu, Zhuangbin Chen, Cong Feng, Minzhi Yan, Michael R. Lyu

Figure 1 for Scalable and Adaptive Log-based Anomaly Detection with Expert in the Loop
Figure 2 for Scalable and Adaptive Log-based Anomaly Detection with Expert in the Loop
Figure 3 for Scalable and Adaptive Log-based Anomaly Detection with Expert in the Loop
Figure 4 for Scalable and Adaptive Log-based Anomaly Detection with Expert in the Loop
Viaarxiv icon