Picture for Iryna Gurevych

Iryna Gurevych

A Course Shared Task on Evaluating LLM Output for Clinical Questions

Add code
Jul 31, 2024
Figure 1 for A Course Shared Task on Evaluating LLM Output for Clinical Questions
Figure 2 for A Course Shared Task on Evaluating LLM Output for Clinical Questions
Figure 3 for A Course Shared Task on Evaluating LLM Output for Clinical Questions
Viaarxiv icon

Overview of PerpectiveArg2024: The First Shared Task on Perspective Argument Retrieval

Add code
Jul 29, 2024
Figure 1 for Overview of PerpectiveArg2024: The First Shared Task on Perspective Argument Retrieval
Figure 2 for Overview of PerpectiveArg2024: The First Shared Task on Perspective Argument Retrieval
Figure 3 for Overview of PerpectiveArg2024: The First Shared Task on Perspective Argument Retrieval
Figure 4 for Overview of PerpectiveArg2024: The First Shared Task on Perspective Argument Retrieval
Viaarxiv icon

Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment

Add code
Jul 20, 2024
Figure 1 for Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment
Figure 2 for Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment
Figure 3 for Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment
Figure 4 for Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment
Viaarxiv icon

$\textit{GeoHard}$: Towards Measuring Class-wise Hardness through Modelling Class Semantics

Add code
Jul 17, 2024
Figure 1 for $\textit{GeoHard}$: Towards Measuring Class-wise Hardness through Modelling Class Semantics
Figure 2 for $\textit{GeoHard}$: Towards Measuring Class-wise Hardness through Modelling Class Semantics
Figure 3 for $\textit{GeoHard}$: Towards Measuring Class-wise Hardness through Modelling Class Semantics
Figure 4 for $\textit{GeoHard}$: Towards Measuring Class-wise Hardness through Modelling Class Semantics
Viaarxiv icon

InferAct: Inferring Safe Actions for LLM-Based Agents Through Preemptive Evaluation and Human Feedback

Add code
Jul 16, 2024
Figure 1 for InferAct: Inferring Safe Actions for LLM-Based Agents Through Preemptive Evaluation and Human Feedback
Figure 2 for InferAct: Inferring Safe Actions for LLM-Based Agents Through Preemptive Evaluation and Human Feedback
Figure 3 for InferAct: Inferring Safe Actions for LLM-Based Agents Through Preemptive Evaluation and Human Feedback
Figure 4 for InferAct: Inferring Safe Actions for LLM-Based Agents Through Preemptive Evaluation and Human Feedback
Viaarxiv icon

Robust Utility-Preserving Text Anonymization Based on Large Language Models

Add code
Jul 16, 2024
Viaarxiv icon

$\texttt{MixGR}$: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity

Add code
Jul 15, 2024
Figure 1 for $\texttt{MixGR}$: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity
Figure 2 for $\texttt{MixGR}$: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity
Figure 3 for $\texttt{MixGR}$: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity
Figure 4 for $\texttt{MixGR}$: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity
Viaarxiv icon

Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors

Add code
Jul 12, 2024
Viaarxiv icon

HDT: Hierarchical Document Transformer

Add code
Jul 11, 2024
Figure 1 for HDT: Hierarchical Document Transformer
Figure 2 for HDT: Hierarchical Document Transformer
Figure 3 for HDT: Hierarchical Document Transformer
Figure 4 for HDT: Hierarchical Document Transformer
Viaarxiv icon

Attribute or Abstain: Large Language Models as Long Document Assistants

Add code
Jul 10, 2024
Figure 1 for Attribute or Abstain: Large Language Models as Long Document Assistants
Figure 2 for Attribute or Abstain: Large Language Models as Long Document Assistants
Figure 3 for Attribute or Abstain: Large Language Models as Long Document Assistants
Figure 4 for Attribute or Abstain: Large Language Models as Long Document Assistants
Viaarxiv icon