Alert button
Picture for Pinjia He

Pinjia He

Alert button

Aligning LLMs for FL-free Program Repair

Add code
Bookmark button
Alert button
Apr 13, 2024
Junjielong Xu, Ying Fu, Shin Hwei Tan, Pinjia He

Viaarxiv icon

A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models

Add code
Bookmark button
Alert button
Jan 01, 2024
Yuxuan Wan, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael R. Lyu

Viaarxiv icon

Retromorphic Testing: A New Approach to the Test Oracle Problem

Add code
Bookmark button
Alert button
Oct 10, 2023
Boxi Yu, Qiuyang Mang, Qingshuo Guo, Pinjia He

Viaarxiv icon

An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software

Add code
Bookmark button
Alert button
Aug 18, 2023
Wenxuan Wang, Jingyuan Huang, Jen-tse Huang, Chang Chen, Jiazhen Gu, Pinjia He, Michael R. Lyu

Figure 1 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Figure 2 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Figure 3 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Figure 4 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Viaarxiv icon

Automated Testing and Improvement of Named Entity Recognition Systems

Add code
Bookmark button
Alert button
Aug 14, 2023
Boxi Yu, Yiyan Hu, Qiuyang Mang, Wenhan Hu, Pinjia He

Figure 1 for Automated Testing and Improvement of Named Entity Recognition Systems
Figure 2 for Automated Testing and Improvement of Named Entity Recognition Systems
Figure 3 for Automated Testing and Improvement of Named Entity Recognition Systems
Figure 4 for Automated Testing and Improvement of Named Entity Recognition Systems
Viaarxiv icon

GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher

Add code
Bookmark button
Alert button
Aug 12, 2023
Youliang Yuan, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Pinjia He, Shuming Shi, Zhaopeng Tu

Figure 1 for GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Figure 2 for GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Figure 3 for GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Figure 4 for GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Viaarxiv icon

Validating Multimedia Content Moderation Software via Semantic Fusion

Add code
Bookmark button
Alert button
May 23, 2023
Wenxuan Wang, Jingyuan Huang, Chang Chen, Jiazhen Gu, Jianping Zhang, Weibin Wu, Pinjia He, Michael Lyu

Figure 1 for Validating Multimedia Content Moderation Software via Semantic Fusion
Figure 2 for Validating Multimedia Content Moderation Software via Semantic Fusion
Figure 3 for Validating Multimedia Content Moderation Software via Semantic Fusion
Figure 4 for Validating Multimedia Content Moderation Software via Semantic Fusion
Viaarxiv icon

BiasAsker: Measuring the Bias in Conversational AI System

Add code
Bookmark button
Alert button
May 21, 2023
Yuxuan Wan, Wenxuan Wang, Pinjia He, Jiazhen Gu, Haonan Bai, Michael Lyu

Figure 1 for BiasAsker: Measuring the Bias in Conversational AI System
Figure 2 for BiasAsker: Measuring the Bias in Conversational AI System
Figure 3 for BiasAsker: Measuring the Bias in Conversational AI System
Figure 4 for BiasAsker: Measuring the Bias in Conversational AI System
Viaarxiv icon

MTTM: Metamorphic Testing for Textual Content Moderation Software

Add code
Bookmark button
Alert button
Feb 11, 2023
Wenxuan Wang, Jen-tse Huang, Weibin Wu, Jianping Zhang, Yizhan Huang, Shuqing Li, Pinjia He, Michael Lyu

Figure 1 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Figure 2 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Figure 3 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Figure 4 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Viaarxiv icon

AEON: A Method for Automatic Evaluation of NLP Test Cases

Add code
Bookmark button
Alert button
May 13, 2022
Jen-tse Huang, Jianping Zhang, Wenxuan Wang, Pinjia He, Yuxin Su, Michael R. Lyu

Figure 1 for AEON: A Method for Automatic Evaluation of NLP Test Cases
Figure 2 for AEON: A Method for Automatic Evaluation of NLP Test Cases
Figure 3 for AEON: A Method for Automatic Evaluation of NLP Test Cases
Figure 4 for AEON: A Method for Automatic Evaluation of NLP Test Cases
Viaarxiv icon