Alert button
Picture for Fangyu Lei

Fangyu Lei

Alert button

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Add code
Bookmark button
Alert button
Apr 11, 2024
Tianbao Xie, Danyang Zhang, Jixuan Chen, Xiaochuan Li, Siheng Zhao, Ruisheng Cao, Toh Jing Hua, Zhoujun Cheng, Dongchan Shin, Fangyu Lei, Yitao Liu, Yiheng Xu, Shuyan Zhou, Silvio Savarese, Caiming Xiong, Victor Zhong, Tao Yu

Viaarxiv icon

Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent

Add code
Bookmark button
Alert button
Mar 01, 2024
Xiaoyan Yu, Tongxu Luo, Yifan Wei, Fangyu Lei, Yiming Huang, Hao Peng, Liehuang Zhu

Viaarxiv icon

MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models

Add code
Bookmark button
Alert button
Feb 20, 2024
Tongxu Luo, Jiahe Lei, Fangyu Lei, Weihao Liu, Shizhu He, Jun Zhao, Kang Liu

Viaarxiv icon

Competition-Level Problems are Effective LLM Evaluators

Add code
Bookmark button
Alert button
Dec 05, 2023
Yiming Huang, Zhenghao Lin, Xiao Liu, Yeyun Gong, Shuai Lu, Fangyu Lei, Yaobo Liang, Yelong Shen, Chen Lin, Nan Duan, Weizhu Chen

Figure 1 for Competition-Level Problems are Effective LLM Evaluators
Figure 2 for Competition-Level Problems are Effective LLM Evaluators
Figure 3 for Competition-Level Problems are Effective LLM Evaluators
Figure 4 for Competition-Level Problems are Effective LLM Evaluators
Viaarxiv icon

Assessing Knowledge Editing in Language Models via Relation Perspective

Add code
Bookmark button
Alert button
Nov 15, 2023
Yifan Wei, Xiaoyan Yu, Huanhuan Ma, Fangyu Lei, Yixuan Weng, Ran Song, Kang Liu

Figure 1 for Assessing Knowledge Editing in Language Models via Relation Perspective
Figure 2 for Assessing Knowledge Editing in Language Models via Relation Perspective
Figure 3 for Assessing Knowledge Editing in Language Models via Relation Perspective
Figure 4 for Assessing Knowledge Editing in Language Models via Relation Perspective
Viaarxiv icon

S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models

Add code
Bookmark button
Alert button
Oct 23, 2023
Fangyu Lei, Qian Liu, Yiming Huang, Shizhu He, Jun Zhao, Kang Liu

Figure 1 for S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models
Figure 2 for S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models
Figure 3 for S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models
Figure 4 for S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Models
Viaarxiv icon

TableQAKit: A Comprehensive and Practical Toolkit for Table-based Question Answering

Add code
Bookmark button
Alert button
Oct 23, 2023
Fangyu Lei, Tongxu Luo, Pengqi Yang, Weihao Liu, Hanwen Liu, Jiahe Lei, Yiming Huang, Yifan Wei, Shizhu He, Jun Zhao, Kang Liu

Figure 1 for TableQAKit: A Comprehensive and Practical Toolkit for Table-based Question Answering
Figure 2 for TableQAKit: A Comprehensive and Practical Toolkit for Table-based Question Answering
Figure 3 for TableQAKit: A Comprehensive and Practical Toolkit for Table-based Question Answering
Figure 4 for TableQAKit: A Comprehensive and Practical Toolkit for Table-based Question Answering
Viaarxiv icon

MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models

Add code
Bookmark button
Alert button
Oct 08, 2023
Yifan Wei, Yisong Su, Huanhuan Ma, Xiaoyan Yu, Fangyu Lei, Yuanzhe Zhang, Jun Zhao, Kang Liu

Figure 1 for MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models
Figure 2 for MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models
Figure 3 for MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models
Figure 4 for MenatQA: A New Dataset for Testing the Temporal Comprehension and Reasoning Abilities of Large Language Models
Viaarxiv icon

HRoT: Hybrid prompt strategy and Retrieval of Thought for Table-Text Hybrid Question Answering

Add code
Bookmark button
Alert button
Sep 22, 2023
Tongxu Luo, Fangyu Lei, Jiahe Lei, Weihao Liu, Shihu He, Jun Zhao, Kang Liu

Figure 1 for HRoT: Hybrid prompt strategy and Retrieval of Thought for Table-Text Hybrid Question Answering
Figure 2 for HRoT: Hybrid prompt strategy and Retrieval of Thought for Table-Text Hybrid Question Answering
Figure 3 for HRoT: Hybrid prompt strategy and Retrieval of Thought for Table-Text Hybrid Question Answering
Figure 4 for HRoT: Hybrid prompt strategy and Retrieval of Thought for Table-Text Hybrid Question Answering
Viaarxiv icon