Picture for Michael R. Lyu

Michael R. Lyu

Enhancing LLM-Based Coding Tools through Native Integration of IDE-Derived Static Context

Add code
Feb 06, 2024
Figure 1 for Enhancing LLM-Based Coding Tools through Native Integration of IDE-Derived Static Context
Figure 2 for Enhancing LLM-Based Coding Tools through Native Integration of IDE-Derived Static Context
Figure 3 for Enhancing LLM-Based Coding Tools through Native Integration of IDE-Derived Static Context
Viaarxiv icon

MTAD: Tools and Benchmarks for Multivariate Time Series Anomaly Detection

Add code
Jan 10, 2024
Viaarxiv icon

The Earth is Flat? Unveiling Factual Errors in Large Language Models

Add code
Jan 01, 2024
Viaarxiv icon

A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models

Add code
Jan 01, 2024
Figure 1 for A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models
Figure 2 for A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models
Figure 3 for A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models
Figure 4 for A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models
Viaarxiv icon

New Job, New Gender? Measuring the Social Bias in Image Generation Models

Add code
Jan 01, 2024
Viaarxiv icon

Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models

Add code
Oct 19, 2023
Figure 1 for Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models
Figure 2 for Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models
Figure 3 for Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models
Figure 4 for Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models
Viaarxiv icon

Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench

Add code
Oct 02, 2023
Viaarxiv icon

All Languages Matter: On the Multilingual Safety of Large Language Models

Add code
Oct 02, 2023
Viaarxiv icon

Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services

Add code
Aug 19, 2023
Figure 1 for Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services
Figure 2 for Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services
Figure 3 for Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services
Figure 4 for Practical Anomaly Detection over Multivariate Monitoring Metrics for Online Services
Viaarxiv icon

An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software

Add code
Aug 18, 2023
Figure 1 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Figure 2 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Figure 3 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Figure 4 for An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
Viaarxiv icon