Picture for Dragomir Radev

Dragomir Radev

modeLing: A Novel Dataset for Testing Linguistic Reasoning in Language Models

Add code
Jun 24, 2024
Viaarxiv icon

MedGen: A Python Natural Language Processing Toolkit for Medical Text Processing

Add code
Nov 28, 2023
Figure 1 for MedGen: A Python Natural Language Processing Toolkit for Medical Text Processing
Figure 2 for MedGen: A Python Natural Language Processing Toolkit for Medical Text Processing
Figure 3 for MedGen: A Python Natural Language Processing Toolkit for Medical Text Processing
Figure 4 for MedGen: A Python Natural Language Processing Toolkit for Medical Text Processing
Viaarxiv icon

Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization

Add code
Nov 15, 2023
Figure 1 for Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization
Figure 2 for Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization
Figure 3 for Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization
Figure 4 for Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization
Viaarxiv icon

Fair Abstractive Summarization of Diverse Perspectives

Add code
Nov 14, 2023
Figure 1 for Fair Abstractive Summarization of Diverse Perspectives
Figure 2 for Fair Abstractive Summarization of Diverse Perspectives
Figure 3 for Fair Abstractive Summarization of Diverse Perspectives
Figure 4 for Fair Abstractive Summarization of Diverse Perspectives
Viaarxiv icon

L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models

Add code
Oct 02, 2023
Figure 1 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Figure 2 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Figure 3 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Figure 4 for L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models
Viaarxiv icon

RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations

Add code
Jun 25, 2023
Figure 1 for RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
Figure 2 for RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
Figure 3 for RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
Figure 4 for RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
Viaarxiv icon

bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark

Add code
Jun 07, 2023
Figure 1 for bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
Figure 2 for bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
Figure 3 for bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
Figure 4 for bgGLUE: A Bulgarian General Language Understanding Evaluation Benchmark
Viaarxiv icon

QTSumm: A New Benchmark for Query-Focused Table Summarization

Add code
May 23, 2023
Figure 1 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Figure 2 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Figure 3 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Figure 4 for QTSumm: A New Benchmark for Query-Focused Table Summarization
Viaarxiv icon

On Learning to Summarize with Large Language Models as References

Add code
May 23, 2023
Figure 1 for On Learning to Summarize with Large Language Models as References
Figure 2 for On Learning to Summarize with Large Language Models as References
Figure 3 for On Learning to Summarize with Large Language Models as References
Figure 4 for On Learning to Summarize with Large Language Models as References
Viaarxiv icon

Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies

Add code
May 21, 2023
Figure 1 for Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Figure 2 for Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Figure 3 for Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Figure 4 for Enhancing Few-shot Text-to-SQL Capabilities of Large Language Models: A Study on Prompt Design Strategies
Viaarxiv icon