Alert button
Picture for Yeganeh Kordi

Yeganeh Kordi

Alert button

Tur[k]ingBench: A Challenge Benchmark for Web Agents

Add code
Bookmark button
Alert button
Mar 21, 2024
Kevin Xu, Yeganeh Kordi, Kate Sanders, Yizhong Wang, Adam Byerly, Jack Zhang, Benjamin Van Durme, Daniel Khashabi

Figure 1 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Figure 2 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Figure 3 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Figure 4 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Viaarxiv icon

Self-Instruct: Aligning Language Model with Self Generated Instructions

Add code
Bookmark button
Alert button
Dec 20, 2022
Yizhong Wang, Yeganeh Kordi, Swaroop Mishra, Alisa Liu, Noah A. Smith, Daniel Khashabi, Hannaneh Hajishirzi

Figure 1 for Self-Instruct: Aligning Language Model with Self Generated Instructions
Figure 2 for Self-Instruct: Aligning Language Model with Self Generated Instructions
Figure 3 for Self-Instruct: Aligning Language Model with Self Generated Instructions
Figure 4 for Self-Instruct: Aligning Language Model with Self Generated Instructions
Viaarxiv icon

Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks

Add code
Bookmark button
Alert button
Apr 16, 2022
Yizhong Wang, Swaroop Mishra, Pegah Alipoormolabashi, Yeganeh Kordi, Amirreza Mirzaei, Anjana Arunkumar, Arjun Ashok, Arut Selvan Dhanasekaran, Atharva Naik, David Stap, Eshaan Pathak, Giannis Karamanolakis, Haizhi Gary Lai, Ishan Purohit, Ishani Mondal, Jacob Anderson, Kirby Kuznia, Krima Doshi, Maitreya Patel, Kuntal Kumar Pal, Mehrad Moradshahi, Mihir Parmar, Mirali Purohit, Neeraj Varshney, Phani Rohitha Kaza, Pulkit Verma, Ravsehaj Singh Puri, Rushang Karia, Shailaja Keyur Sampat, Savan Doshi, Siddhartha Mishra, Sujan Reddy, Sumanta Patro, Tanay Dixit, Xudong Shen, Chitta Baral, Yejin Choi, Hannaneh Hajishirzi, Noah A. Smith, Daniel Khashabi

Figure 1 for Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks
Figure 2 for Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks
Figure 3 for Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks
Figure 4 for Benchmarking Generalization via In-Context Instructions on 1,600+ Language Tasks
Viaarxiv icon

UnifiedQA-v2: Stronger Generalization via Broader Cross-Format Training

Add code
Bookmark button
Alert button
Feb 23, 2022
Daniel Khashabi, Yeganeh Kordi, Hannaneh Hajishirzi

Figure 1 for UnifiedQA-v2: Stronger Generalization via Broader Cross-Format Training
Viaarxiv icon