Alert button
Picture for Linyong Nan

Linyong Nan

Alert button

DocMath-Eval: Evaluating Numerical Reasoning Capabilities of LLMs in Understanding Long Documents with Tabular Data

Nov 16, 2023
Yilun Zhao, Yitao Long, Hongjun Liu, Linyong Nan, Lyuhao Chen, Ryo Kamoi, Yixin Liu, Xiangru Tang, Rui Zhang, Arman Cohan

Viaarxiv icon

On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering

Nov 16, 2023
Linyong Nan, Ellen Zhang, Weijin Zou, Yilun Zhao, Wenfei Zhou, Arman Cohan

Viaarxiv icon

RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations

Jun 25, 2023
Yilun Zhao, Chen Zhao, Linyong Nan, Zhenting Qi, Wenlin Zhang, Xiangru Tang, Boyu Mi, Dragomir Radev

Figure 1 for RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
Figure 2 for RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
Figure 3 for RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
Figure 4 for RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
Viaarxiv icon

Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers

May 24, 2023
Yilun Zhao, Haowei Zhang, Shengyun Si, Linyong Nan, Xiangru Tang, Arman Cohan

Figure 1 for Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers
Figure 2 for Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers
Figure 3 for Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers
Figure 4 for Large Language Models are Effective Table-to-Text Generators, Evaluators, and Feedback Providers
Viaarxiv icon

QTSumm: A New Benchmark for Query-Focused Table Summarization

May 23, 2023
Yilun Zhao, Zhenting Qi, Linyong Nan, Boyu Mi, Yixin Liu, Weijin Zou, Simeng Han, Xiangru Tang, Yumo Xu, Arman Cohan, Dragomir Radev

Figure 1 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Figure 2 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Figure 3 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Figure 4 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Viaarxiv icon

Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies

May 21, 2023
Linyong Nan, Yilun Zhao, Weijin Zou, Narutatsu Ri, Jaesung Tae, Ellen Zhang, Arman Cohan, Dragomir Radev

Figure 1 for Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Figure 2 for Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Figure 3 for Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Figure 4 for Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Viaarxiv icon

LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control

Feb 06, 2023
Yilun Zhao, Zhenting Qi, Linyong Nan, Lorenzo Jaime Yu Flores, Dragomir Radev

Figure 1 for LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control
Figure 2 for LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control
Figure 3 for LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control
Figure 4 for LoFT: Enhancing Faithfulness and Diversity for Table-to-Text Generation via Logic Form Control
Viaarxiv icon

Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation

Dec 15, 2022
Yixin Liu, Alexander R. Fabbri, Pengfei Liu, Yilun Zhao, Linyong Nan, Ruilin Han, Simeng Han, Shafiq Joty, Chien-Sheng Wu, Caiming Xiong, Dragomir Radev

Figure 1 for Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation
Figure 2 for Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation
Figure 3 for Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation
Figure 4 for Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation
Viaarxiv icon

FOLIO: Natural Language Reasoning with First-Order Logic

Sep 02, 2022
Simeng Han, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Luke Benson, Lucy Sun, Ekaterina Zubova, Yujie Qiao, Matthew Burtell, David Peng, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Shafiq Joty, Alexander R. Fabbri, Wojciech Kryscinski, Xi Victoria Lin, Caiming Xiong, Dragomir Radev

Figure 1 for FOLIO: Natural Language Reasoning with First-Order Logic
Figure 2 for FOLIO: Natural Language Reasoning with First-Order Logic
Figure 3 for FOLIO: Natural Language Reasoning with First-Order Logic
Figure 4 for FOLIO: Natural Language Reasoning with First-Order Logic
Viaarxiv icon

Leveraging Locality in Abstractive Text Summarization

May 25, 2022
Yixin Liu, Ansong Ni, Linyong Nan, Budhaditya Deb, Chenguang Zhu, Ahmed H. Awadallah, Dragomir Radev

Figure 1 for Leveraging Locality in Abstractive Text Summarization
Figure 2 for Leveraging Locality in Abstractive Text Summarization
Figure 3 for Leveraging Locality in Abstractive Text Summarization
Figure 4 for Leveraging Locality in Abstractive Text Summarization
Viaarxiv icon