Alert button
Picture for Yizhong Wang

Yizhong Wang

Alert button

Tur[k]ingBench: A Challenge Benchmark for Web Agents

Add code
Bookmark button
Alert button
Mar 21, 2024
Kevin Xu, Yeganeh Kordi, Kate Sanders, Yizhong Wang, Adam Byerly, Jack Zhang, Benjamin Van Durme, Daniel Khashabi

Figure 1 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Figure 2 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Figure 3 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Figure 4 for Tur[k]ingBench: A Challenge Benchmark for Web Agents
Viaarxiv icon

Third-Party Language Model Performance Prediction from Instruction

Add code
Bookmark button
Alert button
Mar 19, 2024
Rahul Nadkarni, Yizhong Wang, Noah A. Smith

Figure 1 for Third-Party Language Model Performance Prediction from Instruction
Figure 2 for Third-Party Language Model Performance Prediction from Instruction
Figure 3 for Third-Party Language Model Performance Prediction from Instruction
Figure 4 for Third-Party Language Model Performance Prediction from Instruction
Viaarxiv icon

Set the Clock: Temporal Alignment of Pretrained Language Models

Add code
Bookmark button
Alert button
Feb 26, 2024
Bowen Zhao, Zander Brumbaugh, Yizhong Wang, Hannaneh Hajishirzi, Noah A. Smith

Viaarxiv icon

Can Language Models Act as Knowledge Bases at Scale?

Add code
Bookmark button
Alert button
Feb 22, 2024
Qiyuan He, Yizhong Wang, Wenya Wang

Viaarxiv icon

OLMo: Accelerating the Science of Language Models

Add code
Bookmark button
Alert button
Feb 07, 2024
Dirk Groeneveld, Iz Beltagy, Pete Walsh, Akshita Bhagia, Rodney Kinney, Oyvind Tafjord, Ananya Harsh Jha, Hamish Ivison, Ian Magnusson, Yizhong Wang, Shane Arora, David Atkinson, Russell Authur, Khyathi Raghavi Chandu, Arman Cohan, Jennifer Dumas, Yanai Elazar, Yuling Gu, Jack Hessel, Tushar Khot, William Merrill, Jacob Morrison, Niklas Muennighoff, Aakanksha Naik, Crystal Nam, Matthew E. Peters, Valentina Pyatkin, Abhilasha Ravichander, Dustin Schwenk, Saurabh Shah, Will Smith, Emma Strubell, Nishant Subramani, Mitchell Wortsman, Pradeep Dasigi, Nathan Lambert, Kyle Richardson, Luke Zettlemoyer, Jesse Dodge, Kyle Lo, Luca Soldaini, Noah A. Smith, Hannaneh Hajishirzi

Viaarxiv icon

Fine-grained Hallucination Detection and Editing for Language Models

Add code
Bookmark button
Alert button
Jan 17, 2024
Abhika Mishra, Akari Asai, Vidhisha Balachandran, Yizhong Wang, Graham Neubig, Yulia Tsvetkov, Hannaneh Hajishirzi

Viaarxiv icon

Tuning Language Models by Proxy

Add code
Bookmark button
Alert button
Jan 16, 2024
Alisa Liu, Xiaochuang Han, Yizhong Wang, Yulia Tsvetkov, Yejin Choi, Noah A. Smith

Viaarxiv icon

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Add code
Bookmark button
Alert button
Nov 20, 2023
Hamish Ivison, Yizhong Wang, Valentina Pyatkin, Nathan Lambert, Matthew Peters, Pradeep Dasigi, Joel Jang, David Wadden, Noah A. Smith, Iz Beltagy, Hannaneh Hajishirzi

Figure 1 for Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Figure 2 for Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Figure 3 for Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Figure 4 for Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Viaarxiv icon