Alert button
Picture for Wenxuan Wang

Wenxuan Wang

Alert button

How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments

Add code
Bookmark button
Alert button
Mar 18, 2024
Jen-tse Huang, Eric John Li, Man Ho Lam, Tian Liang, Wenxuan Wang, Youliang Yuan, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Michael R. Lyu

Figure 1 for How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
Figure 2 for How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
Figure 3 for How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
Figure 4 for How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
Viaarxiv icon

Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with Human Intentions

Add code
Bookmark button
Alert button
Feb 17, 2024
Wenxuan Wang, Yisi Zhang, Xingjian He, Yichen Yan, Zijia Zhao, Xinlong Wang, Jing Liu

Viaarxiv icon

Asclepius: A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language Models

Add code
Bookmark button
Alert button
Feb 17, 2024
Wenxuan Wang, Yihang Su, Jingyuan Huan, Jie Liu, Wenting Chen, Yudi Zhang, Cheng-Yi Li, Kao-Jung Chang, Xiaohan Xin, Linlin Shen, Michael R. Lyu

Viaarxiv icon

New Job, New Gender? Measuring the Social Bias in Image Generation Models

Add code
Bookmark button
Alert button
Jan 01, 2024
Wenxuan Wang, Haonan Bai, Jen-tse Huang, Yuxuan Wan, Youliang Yuan, Haoyi Qiu, Nanyun Peng, Michael R. Lyu

Viaarxiv icon

The Earth is Flat? Unveiling Factual Errors in Large Language Models

Add code
Bookmark button
Alert button
Jan 01, 2024
Wenxuan Wang, Juluan Shi, Zhaopeng Tu, Youliang Yuan, Jen-tse Huang, Wenxiang Jiao, Michael R. Lyu

Viaarxiv icon

A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models

Add code
Bookmark button
Alert button
Jan 01, 2024
Yuxuan Wan, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael R. Lyu

Viaarxiv icon

Unveiling Parts Beyond Objects:Towards Finer-Granularity Referring Expression Segmentation

Add code
Bookmark button
Alert button
Dec 13, 2023
Wenxuan Wang, Tongtian Yue, Yisi Zhang, Longteng Guo, Xingjian He, Xinlong Wang, Jing Liu

Viaarxiv icon

Training Multi-layer Neural Networks on Ising Machine

Add code
Bookmark button
Alert button
Nov 06, 2023
Xujie Song, Tong Liu, Shengbo Eben Li, Jingliang Duan, Wenxuan Wang, Keqiang Li

Viaarxiv icon

Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models

Add code
Bookmark button
Alert button
Nov 06, 2023
Tian Liang, Zhiwei He, Jen-tse Huang, Wenxuan Wang, Wenxiang Jiao, Rui Wang, Yujiu Yang, Zhaopeng Tu, Shuming Shi, Xing Wang

Figure 1 for Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models
Figure 2 for Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models
Figure 3 for Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models
Figure 4 for Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models
Viaarxiv icon

LFAA: Crafting Transferable Targeted Adversarial Examples with Low-Frequency Perturbations

Add code
Bookmark button
Alert button
Nov 01, 2023
Kunyu Wang, Juluan Shi, Wenxuan Wang

Viaarxiv icon