Alert button
Picture for Martin Riddell

Martin Riddell

Alert button

Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models

Add code
Bookmark button
Alert button
Mar 06, 2024
Martin Riddell, Ansong Ni, Arman Cohan

Figure 1 for Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models
Figure 2 for Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models
Figure 3 for Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models
Figure 4 for Quantifying Contamination in Evaluating Code Generation Capabilities of Language Models
Viaarxiv icon

L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models

Add code
Bookmark button
Alert button
Oct 02, 2023
Ansong Ni, Pengcheng Yin, Yilun Zhao, Martin Riddell, Troy Feng, Rui Shen, Stephen Yin, Ye Liu, Semih Yavuz, Caiming Xiong, Shafiq Joty, Yingbo Zhou, Dragomir Radev, Arman Cohan

Figure 1 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Figure 2 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Figure 3 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Figure 4 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Viaarxiv icon

FOLIO: Natural Language Reasoning with First-Order Logic

Add code
Bookmark button
Alert button
Sep 02, 2022
Simeng Han, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Luke Benson, Lucy Sun, Ekaterina Zubova, Yujie Qiao, Matthew Burtell, David Peng, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Shafiq Joty, Alexander R. Fabbri, Wojciech Kryscinski, Xi Victoria Lin, Caiming Xiong, Dragomir Radev

Figure 1 for FOLIO: Natural Language Reasoning with First-Order Logic
Figure 2 for FOLIO: Natural Language Reasoning with First-Order Logic
Figure 3 for FOLIO: Natural Language Reasoning with First-Order Logic
Figure 4 for FOLIO: Natural Language Reasoning with First-Order Logic
Viaarxiv icon