Alert button
Picture for Arman Cohan

Arman Cohan

Alert button

L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models

Add code
Bookmark button
Alert button
Oct 02, 2023
Ansong Ni, Pengcheng Yin, Yilun Zhao, Martin Riddell, Troy Feng, Rui Shen, Stephen Yin, Ye Liu, Semih Yavuz, Caiming Xiong, Shafiq Joty, Yingbo Zhou, Dragomir Radev, Arman Cohan

Figure 1 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Figure 2 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Figure 3 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Figure 4 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Viaarxiv icon

Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?

Add code
Bookmark button
Alert button
Sep 19, 2023
Xiangru Tang, Yiming Zong, Jason Phang, Yilun Zhao, Wangchunshu Zhou, Arman Cohan, Mark Gerstein

Figure 1 for Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?
Figure 2 for Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?
Figure 3 for Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?
Figure 4 for Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?
Viaarxiv icon

ODSum: New Benchmarks for Open Domain Multi-Document Summarization

Add code
Bookmark button
Alert button
Sep 16, 2023
Yijie Zhou, Kejian Shi, Wencai Zhang, Yixin Liu, Yilun Zhao, Arman Cohan

Figure 1 for ODSum: New Benchmarks for Open Domain Multi-Document Summarization
Figure 2 for ODSum: New Benchmarks for Open Domain Multi-Document Summarization
Figure 3 for ODSum: New Benchmarks for Open Domain Multi-Document Summarization
Figure 4 for ODSum: New Benchmarks for Open Domain Multi-Document Summarization
Viaarxiv icon

When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets

Add code
Bookmark button
Alert button
Sep 15, 2023
Orion Weller, Kyle Lo, David Wadden, Dawn Lawrie, Benjamin Van Durme, Arman Cohan, Luca Soldaini

Figure 1 for When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets
Figure 2 for When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets
Figure 3 for When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets
Figure 4 for When do Generative Query and Document Expansions Fail? A Comprehensive Study Across Methods, Retrievers, and Datasets
Viaarxiv icon

Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering

Add code
Bookmark button
Alert button
May 24, 2023
Avi Caciularu, Matthew E. Peters, Jacob Goldberger, Ido Dagan, Arman Cohan

Figure 1 for Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering
Figure 2 for Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering
Figure 3 for Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering
Figure 4 for Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering
Viaarxiv icon

Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers

Add code
Bookmark button
Alert button
May 24, 2023
Yilun Zhao, Haowei Zhang, Shengyun Si, Linyong Nan, Xiangru Tang, Arman Cohan

Figure 1 for Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers
Figure 2 for Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers
Figure 3 for Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers
Figure 4 for Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers
Viaarxiv icon

A Controllable QA-based Framework for Decontextualization

Add code
Bookmark button
Alert button
May 24, 2023
Benjamin Newman, Luca Soldaini, Raymond Fok, Arman Cohan, Kyle Lo

Figure 1 for A Controllable QA-based Framework for Decontextualization
Figure 2 for A Controllable QA-based Framework for Decontextualization
Figure 3 for A Controllable QA-based Framework for Decontextualization
Figure 4 for A Controllable QA-based Framework for Decontextualization
Viaarxiv icon

QTSumm: A New Benchmark for Query-Focused Table Summarization

Add code
Bookmark button
Alert button
May 23, 2023
Yilun Zhao, Zhenting Qi, Linyong Nan, Boyu Mi, Yixin Liu, Weijin Zou, Simeng Han, Xiangru Tang, Yumo Xu, Arman Cohan, Dragomir Radev

Figure 1 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Figure 2 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Figure 3 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Figure 4 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Viaarxiv icon

On Learning to Summarize with Large Language Models as References

Add code
Bookmark button
Alert button
May 23, 2023
Yixin Liu, Alexander R. Fabbri, Pengfei Liu, Dragomir Radev, Arman Cohan

Figure 1 for On Learning to Summarize with Large Language Models as References
Figure 2 for On Learning to Summarize with Large Language Models as References
Figure 3 for On Learning to Summarize with Large Language Models as References
Figure 4 for On Learning to Summarize with Large Language Models as References
Viaarxiv icon

Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies

Add code
Bookmark button
Alert button
May 21, 2023
Linyong Nan, Yilun Zhao, Weijin Zou, Narutatsu Ri, Jaesung Tae, Ellen Zhang, Arman Cohan, Dragomir Radev

Figure 1 for Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Figure 2 for Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Figure 3 for Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Figure 4 for Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Viaarxiv icon