Alert button
Picture for Shunyu Yao

Shunyu Yao

Alert button

NTIRE 2024 Challenge on Short-form UGC Video Quality Assessment: Methods and Results

Add code
Bookmark button
Alert button
Apr 17, 2024
Xin Li, Kun Yuan, Yajing Pei, Yiting Lu, Ming Sun, Chao Zhou, Zhibo Chen, Radu Timofte, Wei Sun, Haoning Wu, Zicheng Zhang, Jun Jia, Zhichao Zhang, Linhan Cao, Qiubo Chen, Xiongkuo Min, Weisi Lin, Guangtao Zhai, Jianhui Sun, Tianyi Wang, Lei Li, Han Kong, Wenxuan Wang, Bing Li, Cheng Luo, Haiqiang Wang, Xiangguang Chen, Wenhui Meng, Xiang Pan, Huiying Shi, Han Zhu, Xiaozhong Xu, Lei Sun, Zhenzhong Chen, Shan Liu, Fangyuan Kong, Haotian Fan, Yifang Xu, Haoran Xu, Mengduo Yang, Jie Zhou, Jiaze Li, Shijie Wen, Mai Xu, Da Li, Shunyu Yao, Jiazhi Du, Wangmeng Zuo, Zhibo Li, Shuai He, Anlong Ming, Huiyuan Fu, Huadong Ma, Yong Wu, Fie Xue, Guozhi Zhao, Lina Du, Jie Guo, Yu Zhang, Huimin Zheng, Junhao Chen, Yue Liu, Dulan Zhou, Kele Xu, Qisheng Xu, Tao Sun, Zhixiang Ding, Yuhang Hu

Viaarxiv icon

Can Language Models Solve Olympiad Programming?

Add code
Bookmark button
Alert button
Apr 16, 2024
Quan Shi, Michael Tang, Karthik Narasimhan, Shunyu Yao

Viaarxiv icon

DevBench: A Comprehensive Benchmark for Software Development

Add code
Bookmark button
Alert button
Mar 15, 2024
Bowen Li, Wenhan Wu, Ziwei Tang, Lin Shi, John Yang, Jinyang Li, Shunyu Yao, Chen Qian, Binyuan Hui, Qicheng Zhang, Zhiyin Yu, He Du, Ping Yang, Dahua Lin, Chao Peng, Kai Chen

Figure 1 for DevBench: A Comprehensive Benchmark for Software Development
Figure 2 for DevBench: A Comprehensive Benchmark for Software Development
Figure 3 for DevBench: A Comprehensive Benchmark for Software Development
Figure 4 for DevBench: A Comprehensive Benchmark for Software Development
Viaarxiv icon

OS-Copilot: Towards Generalist Computer Agents with Self-Improvement

Add code
Bookmark button
Alert button
Feb 15, 2024
Zhiyong Wu, Chengcheng Han, Zichen Ding, Zhenmin Weng, Zhoumianze Liu, Shunyu Yao, Tao Yu, Lingpeng Kong

Viaarxiv icon

Large Language Model for Multi-objective Evolutionary Optimization

Add code
Bookmark button
Alert button
Oct 25, 2023
Fei Liu, Xi Lin, Zhenkun Wang, Shunyu Yao, Xialiang Tong, Mingxuan Yuan, Qingfu Zhang

Figure 1 for Large Language Model for Multi-objective Evolutionary Optimization
Figure 2 for Large Language Model for Multi-objective Evolutionary Optimization
Figure 3 for Large Language Model for Multi-objective Evolutionary Optimization
Figure 4 for Large Language Model for Multi-objective Evolutionary Optimization
Viaarxiv icon

SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

Add code
Bookmark button
Alert button
Oct 10, 2023
Carlos E. Jimenez, John Yang, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, Karthik Narasimhan

Viaarxiv icon

FireAct: Toward Language Agent Fine-tuning

Add code
Bookmark button
Alert button
Oct 09, 2023
Baian Chen, Chang Shu, Ehsan Shareghi, Nigel Collier, Karthik Narasimhan, Shunyu Yao

Viaarxiv icon

Embers of Autoregression: Understanding Large Language Models Through the Problem They are Trained to Solve

Add code
Bookmark button
Alert button
Sep 24, 2023
R. Thomas McCoy, Shunyu Yao, Dan Friedman, Matthew Hardy, Thomas L. Griffiths

Viaarxiv icon