Alert button
Picture for Yulia Tsvetkov

Yulia Tsvetkov

Alert button

DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

Mar 16, 2024
Fahim Faisal, Orevaoghene Ahia, Aarohi Srivastava, Kabir Ahuja, David Chiang, Yulia Tsvetkov, Antonios Anastasopoulos

Viaarxiv icon

Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs

Mar 05, 2024
Aly M. Kassem, Omar Mahmoud, Niloofar Mireshghallah, Hyunwoo Kim, Yulia Tsvetkov, Yejin Choi, Sherif Saad, Santu Rana

Figure 1 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Figure 2 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Figure 3 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Figure 4 for Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs
Viaarxiv icon

Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers

Feb 27, 2024
Roy Xie, Orevaoghene Ahia, Yulia Tsvetkov, Antonios Anastasopoulos

Viaarxiv icon

Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks

Feb 18, 2024
Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, Yulia Tsvetkov, Tianxing He

Viaarxiv icon

DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection

Feb 16, 2024
Herun Wan, Shangbin Feng, Zhaoxuan Tan, Heng Wang, Yulia Tsvetkov, Minnan Luo

Viaarxiv icon

Do Membership Inference Attacks Work on Large Language Models?

Feb 12, 2024
Michael Duan, Anshuman Suri, Niloofar Mireshghallah, Sewon Min, Weijia Shi, Luke Zettlemoyer, Yulia Tsvetkov, Yejin Choi, David Evans, Hannaneh Hajishirzi

Viaarxiv icon

What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

Feb 01, 2024
Shangbin Feng, Herun Wan, Ningnan Wang, Zhaoxuan Tan, Minnan Luo, Yulia Tsvetkov

Viaarxiv icon

Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration

Feb 01, 2024
Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Vidhisha Balachandran, Yulia Tsvetkov

Viaarxiv icon

Fine-grained Hallucination Detection and Editing for Language Models

Jan 17, 2024
Abhika Mishra, Akari Asai, Vidhisha Balachandran, Yizhong Wang, Graham Neubig, Yulia Tsvetkov, Hannaneh Hajishirzi

Viaarxiv icon

Tuning Language Models by Proxy

Jan 16, 2024
Alisa Liu, Xiaochuang Han, Yizhong Wang, Yulia Tsvetkov, Yejin Choi, Noah A. Smith

Viaarxiv icon