Picture for Yulia Tsvetkov

Yulia Tsvetkov

Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration

Add code
Jun 22, 2024
Viaarxiv icon

Teaching LLMs to Abstain across Languages via Multilingual Feedback

Add code
Jun 22, 2024
Figure 1 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 2 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 3 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Figure 4 for Teaching LLMs to Abstain across Languages via Multilingual Feedback
Viaarxiv icon

MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning

Add code
Jun 04, 2024
Figure 1 for MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning
Figure 2 for MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning
Figure 3 for MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning
Figure 4 for MEDIQ: Question-Asking LLMs for Adaptive and Reliable Clinical Reasoning
Viaarxiv icon

Learning Syntax Without Planting Trees: Understanding When and Why Transformers Generalize Hierarchically

Add code
Apr 25, 2024
Viaarxiv icon

CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge

Add code
Apr 10, 2024
Figure 1 for CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
Figure 2 for CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
Figure 3 for CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
Figure 4 for CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
Viaarxiv icon

DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

Add code
Mar 16, 2024
Figure 1 for DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Figure 2 for DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Figure 3 for DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Figure 4 for DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages
Viaarxiv icon

Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs

Add code
Mar 05, 2024
Figure 1 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Figure 2 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Figure 3 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Figure 4 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Viaarxiv icon

Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers

Add code
Feb 27, 2024
Figure 1 for Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers
Figure 2 for Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers
Figure 3 for Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers
Figure 4 for Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers
Viaarxiv icon

Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks

Add code
Feb 18, 2024
Viaarxiv icon

DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection

Add code
Feb 16, 2024
Figure 1 for DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
Figure 2 for DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
Figure 3 for DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
Figure 4 for DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection
Viaarxiv icon