Picture for Anya Belz

Anya Belz

The QCET Taxonomy of Standard Quality Criterion Names and Definitions for the Evaluation of NLP Systems

Add code
Sep 26, 2025
Viaarxiv icon

Enhancing Study-Level Inference from Clinical Trial Papers via RL-based Numeric Reasoning

Add code
May 28, 2025
Viaarxiv icon

Query-driven Document-level Scientific Evidence Extraction from Biomedical Studies

Add code
May 09, 2025
Viaarxiv icon

HEDS 3.0: The Human Evaluation Data Sheet Version 3.0

Add code
Dec 10, 2024
Viaarxiv icon

Reproducing the Metric-Based Evaluation of a Set of Controllable Text Generation Techniques

Add code
May 13, 2024
Figure 1 for Reproducing the Metric-Based Evaluation of a Set of Controllable Text Generation Techniques
Figure 2 for Reproducing the Metric-Based Evaluation of a Set of Controllable Text Generation Techniques
Figure 3 for Reproducing the Metric-Based Evaluation of a Set of Controllable Text Generation Techniques
Figure 4 for Reproducing the Metric-Based Evaluation of a Set of Controllable Text Generation Techniques
Viaarxiv icon

High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models

Add code
Feb 19, 2024
Figure 1 for High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models
Figure 2 for High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models
Figure 3 for High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models
Figure 4 for High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language Models
Viaarxiv icon

Assessing the Portability of Parameter Matrices Trained by Parameter-Efficient Finetuning Methods

Add code
Jan 25, 2024
Viaarxiv icon

Data-to-text Generation for Severely Under-Resourced Languages with GPT-3.5: A Bit of Help Needed from Google Translate

Add code
Aug 19, 2023
Viaarxiv icon

Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP

Add code
May 02, 2023
Figure 1 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 2 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 3 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Figure 4 for Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP
Viaarxiv icon

PEFT-Ref: A Modular Reference Architecture and Typology for Parameter-Efficient Finetuning Techniques

Add code
Apr 24, 2023
Figure 1 for PEFT-Ref: A Modular Reference Architecture and Typology for Parameter-Efficient Finetuning Techniques
Figure 2 for PEFT-Ref: A Modular Reference Architecture and Typology for Parameter-Efficient Finetuning Techniques
Figure 3 for PEFT-Ref: A Modular Reference Architecture and Typology for Parameter-Efficient Finetuning Techniques
Figure 4 for PEFT-Ref: A Modular Reference Architecture and Typology for Parameter-Efficient Finetuning Techniques
Viaarxiv icon