Alert button
Picture for Shunyu Yao

Shunyu Yao

Alert button

DevBench: A Comprehensive Benchmark for Software Development

Add code
Bookmark button
Alert button
Mar 15, 2024
Bowen Li, Wenhan Wu, Ziwei Tang, Lin Shi, John Yang, Jinyang Li, Shunyu Yao, Chen Qian, Binyuan Hui, Qicheng Zhang, Zhiyin Yu, He Du, Ping Yang, Dahua Lin, Chao Peng, Kai Chen

Figure 1 for DevBench: A Comprehensive Benchmark for Software Development
Figure 2 for DevBench: A Comprehensive Benchmark for Software Development
Figure 3 for DevBench: A Comprehensive Benchmark for Software Development
Figure 4 for DevBench: A Comprehensive Benchmark for Software Development
Viaarxiv icon

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement

Add code
Bookmark button
Alert button
Feb 15, 2024
Zhiyong Wu, Chengcheng Han, Zichen Ding, Zhenmin Weng, Zhoumianze Liu, Shunyu Yao, Tao Yu, Lingpeng Kong

Viaarxiv icon

Large Language Model for Multi-objective Evolutionary Optimization

Add code
Bookmark button
Alert button
Oct 25, 2023
Fei Liu, Xi Lin, Zhenkun Wang, Shunyu Yao, Xialiang Tong, Mingxuan Yuan, Qingfu Zhang

Figure 1 for Large Language Model for Multi-objective Evolutionary Optimization
Figure 2 for Large Language Model for Multi-objective Evolutionary Optimization
Figure 3 for Large Language Model for Multi-objective Evolutionary Optimization
Figure 4 for Large Language Model for Multi-objective Evolutionary Optimization
Viaarxiv icon

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

Add code
Bookmark button
Alert button
Oct 10, 2023
Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, Karthik Narasimhan

Viaarxiv icon

FireAct: Toward Language Agent Fine-tuning

Add code
Bookmark button
Alert button
Oct 09, 2023
Baian Chen, Chang Shu, Ehsan Shareghi, Nigel Collier, Karthik Narasimhan, Shunyu Yao

Viaarxiv icon

Embers of Autoregression: Understanding Large Language Models Through the Problem They are Trained to Solve

Add code
Bookmark button
Alert button
Sep 24, 2023
R. Thomas McCoy, Shunyu Yao, Dan Friedman, Matthew Hardy, Thomas L. Griffiths

Viaarxiv icon

Cognitive Architectures for Language Agents

Add code
Bookmark button
Alert button
Sep 05, 2023
Theodore Sumers, Shunyu Yao, Karthik Narasimhan, Thomas L. Griffiths

Viaarxiv icon

COLLIE: Systematic Construction of Constrained Text Generation Tasks

Add code
Bookmark button
Alert button
Jul 17, 2023
Shunyu Yao, Howard Chen, Austin W. Hanjie, Runzhe Yang, Karthik Narasimhan

Figure 1 for COLLIE: Systematic Construction of Constrained Text Generation Tasks
Figure 2 for COLLIE: Systematic Construction of Constrained Text Generation Tasks
Figure 3 for COLLIE: Systematic Construction of Constrained Text Generation Tasks
Figure 4 for COLLIE: Systematic Construction of Constrained Text Generation Tasks
Viaarxiv icon