Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kathleen McKeown

Columbia University

Forecasting Communication Derailments Through Conversation Generation

Apr 11, 2025

Yunfan Zhang, Kathleen McKeown, Smaranda Muresan

Figure 1 for Forecasting Communication Derailments Through Conversation Generation

Figure 2 for Forecasting Communication Derailments Through Conversation Generation

Figure 3 for Forecasting Communication Derailments Through Conversation Generation

Figure 4 for Forecasting Communication Derailments Through Conversation Generation

Abstract:Forecasting communication derailment can be useful in real-world settings such as online content moderation, conflict resolution, and business negotiations. However, despite language models' success at identifying offensive speech present in conversations, they struggle to forecast future communication derailments. In contrast to prior work that predicts conversation outcomes solely based on the past conversation history, our approach samples multiple future conversation trajectories conditioned on existing conversation history using a fine-tuned LLM. It predicts the communication outcome based on the consensus of these trajectories. We also experimented with leveraging socio-linguistic attributes, which reflect turn-level conversation dynamics, as guidance when generating future conversations. Our method of future conversation trajectories surpasses state-of-the-art results on English communication derailment prediction benchmarks and demonstrates significant accuracy gains in ablation studies.

Via

Access Paper or Ask Questions

Is the Top Still Spinning? Evaluating Subjectivity in Narrative Understanding

Apr 01, 2025

Melanie Subbiah, Akankshya Mishra, Grace Kim, Liyan Tang, Greg Durrett, Kathleen McKeown

Figure 1 for Is the Top Still Spinning? Evaluating Subjectivity in Narrative Understanding

Figure 2 for Is the Top Still Spinning? Evaluating Subjectivity in Narrative Understanding

Figure 3 for Is the Top Still Spinning? Evaluating Subjectivity in Narrative Understanding

Figure 4 for Is the Top Still Spinning? Evaluating Subjectivity in Narrative Understanding

Abstract:Determining faithfulness of a claim to a source document is an important problem across many domains. This task is generally treated as a binary judgment of whether the claim is supported or unsupported in relation to the source. In many cases, though, whether a claim is supported can be ambiguous. For instance, it may depend on making inferences from given evidence, and different people can reasonably interpret the claim as either supported or unsupported based on their agreement with those inferences. Forcing binary labels upon such claims lowers the reliability of evaluation. In this work, we reframe the task to manage the subjectivity involved with factuality judgments of ambiguous claims. We introduce LLM-generated edits of summaries as a method of providing a nuanced evaluation of claims: how much does a summary need to be edited to be unambiguous? Whether a claim gets rewritten and how much it changes can be used as an automatic evaluation metric, the Ambiguity Rewrite Metric (ARM), with a much richer feedback signal than a binary judgment of faithfulness. We focus on the area of narrative summarization as it is particularly rife with ambiguity and subjective interpretation. We show that ARM produces a 21% absolute improvement in annotator agreement on claim faithfulness, indicating that subjectivity is reduced.

* Preprint

Via

Access Paper or Ask Questions

Data Caricatures: On the Representation of African American Language in Pretraining Corpora

Mar 13, 2025

Nicholas Deas, Blake Vente, Amith Ananthram, Jessica A. Grieser, Desmond Patton, Shana Kleiner, James Shepard, Kathleen McKeown

Abstract:With a combination of quantitative experiments, human judgments, and qualitative analyses, we evaluate the quantity and quality of African American Language (AAL) representation in 12 predominantly English, open-source pretraining corpora. We specifically focus on the sources, variation, and naturalness of included AAL texts representing the AAL-speaking community. We find that AAL is underrepresented in all evaluated pretraining corpora compared to US demographics, constituting as little as 0.007% of documents. We also find that more than 25% of AAL texts in C4 may be inappropriate for LLMs to generate and reinforce harmful stereotypes. Finally, we find that most automated language, toxicity, and quality filters are more likely to conserve White Mainstream English (WME) texts over AAL in pretraining corpora.

* Preprint

Via

Access Paper or Ask Questions

Layered Insights: Generalizable Analysis of Authorial Style by Leveraging All Transformer Layers

Mar 02, 2025

Milad Alshomary, Nikhil Reddy Varimalla, Vishal Anand, Kathleen McKeown

Abstract:We propose a new approach for the authorship attribution task that leverages the various linguistic representations learned at different layers of pre-trained transformer-based models. We evaluate our approach on three datasets, comparing it to a state-of-the-art baseline in in-domain and out-of-domain scenarios. We found that utilizing various transformer layers improves the robustness of authorship attribution models when tested on out-of-domain data, resulting in new state-of-the-art results. Our analysis gives further insights into how our model's different layers get specialized in representing certain stylistic features that benefit the model when tested out of the domain.

Via

Access Paper or Ask Questions

The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination

Feb 22, 2025

Yuji Zhang, Sha Li, Cheng Qian, Jiateng Liu, Pengfei Yu, Chi Han, Yi R. Fung, Kathleen McKeown, Chengxiang Zhai, Manling Li(+1 more)

Figure 1 for The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination

Figure 2 for The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination

Figure 3 for The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination

Figure 4 for The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination

Abstract:Hallucination is a persistent challenge in large language models (LLMs), where even with rigorous quality control, models often generate distorted facts. This paradox, in which error generation continues despite high-quality training data, calls for a deeper understanding of the underlying LLM mechanisms. To address it, we propose a novel concept: knowledge overshadowing, where model's dominant knowledge can obscure less prominent knowledge during text generation, causing the model to fabricate inaccurate details. Building on this idea, we introduce a novel framework to quantify factual hallucinations by modeling knowledge overshadowing. Central to our approach is the log-linear law, which predicts that the rate of factual hallucination increases linearly with the logarithmic scale of (1) Knowledge Popularity, (2) Knowledge Length, and (3) Model Size. The law provides a means to preemptively quantify hallucinations, offering foresight into their occurrence even before model training or inference. Built on overshadowing effect, we propose a new decoding strategy CoDa, to mitigate hallucinations, which notably enhance model factuality on Overshadow (27.9%), MemoTrap (13.1%) and NQ-Swap (18.3%). Our findings not only deepen understandings of the underlying mechanisms behind hallucinations but also provide actionable insights for developing more predictable and controllable language models.

* 19 pages, 5 figures

Via

Access Paper or Ask Questions

A General Framework for Inference-time Scaling and Steering of Diffusion Models

Jan 16, 2025

Raghav Singhal, Zachary Horvitz, Ryan Teehan, Mengye Ren, Zhou Yu, Kathleen McKeown, Rajesh Ranganath

Figure 1 for A General Framework for Inference-time Scaling and Steering of Diffusion Models

Figure 2 for A General Framework for Inference-time Scaling and Steering of Diffusion Models

Figure 3 for A General Framework for Inference-time Scaling and Steering of Diffusion Models

Figure 4 for A General Framework for Inference-time Scaling and Steering of Diffusion Models

Abstract:Diffusion models produce impressive results in modalities ranging from images and video to protein design and text. However, generating samples with user-specified properties remains a challenge. Recent research proposes fine-tuning models to maximize rewards that capture desired properties, but these methods require expensive training and are prone to mode collapse. In this work, we propose Feynman Kac (FK) steering, an inference-time framework for steering diffusion models with reward functions. FK steering works by sampling a system of multiple interacting diffusion processes, called particles, and resampling particles at intermediate steps based on scores computed using functions called potentials. Potentials are defined using rewards for intermediate states and are selected such that a high value indicates that the particle will yield a high-reward sample. We explore various choices of potentials, intermediate rewards, and samplers. We evaluate FK steering on text-to-image and text diffusion models. For steering text-to-image models with a human preference reward, we find that FK steering a 0.8B parameter model outperforms a 2.6B parameter fine-tuned model on prompt fidelity, with faster sampling and no training. For steering text diffusion models with rewards for text quality and specific text attributes, we find that FK steering generates lower perplexity, more linguistically acceptable outputs and enables gradient-free control of attributes like toxicity. Our results demonstrate that inference-time scaling and steering of diffusion models, even with off-the-shelf rewards, can provide significant sample quality gains and controllability benefits. Code is available at https://github.com/zacharyhorvitz/Fk-Diffusion-Steering .

Via

Access Paper or Ask Questions

Summarization of Opinionated Political Documents with Varied Perspectives

Nov 06, 2024

Nicholas Deas, Kathleen McKeown

Abstract:Global partisan hostility and polarization has increased, and this polarization is heightened around presidential elections. Models capable of generating accurate summaries of diverse perspectives can help reduce such polarization by exposing users to alternative perspectives. In this work, we introduce a novel dataset and task for independently summarizing each political perspective in a set of passages from opinionated news articles. For this task, we propose a framework for evaluating different dimensions of perspective summary performance. We benchmark 10 models of varying sizes and architectures through both automatic and human evaluation. While recent models like GPT-4o perform well on this task, we find that all models struggle to generate summaries faithful to the intended perspective. Our analysis of summaries focuses on how extraction behavior depends on the features of the input documents.

Via

Access Paper or Ask Questions

Enhancing Multimodal Affective Analysis with Learned Live Comment Features

Oct 21, 2024

Zhaoyuan Deng, Amith Ananthram, Kathleen McKeown

Figure 1 for Enhancing Multimodal Affective Analysis with Learned Live Comment Features

Figure 2 for Enhancing Multimodal Affective Analysis with Learned Live Comment Features

Figure 3 for Enhancing Multimodal Affective Analysis with Learned Live Comment Features

Figure 4 for Enhancing Multimodal Affective Analysis with Learned Live Comment Features

Abstract:Live comments, also known as Danmaku, are user-generated messages that are synchronized with video content. These comments overlay directly onto streaming videos, capturing viewer emotions and reactions in real-time. While prior work has leveraged live comments in affective analysis, its use has been limited due to the relative rarity of live comments across different video platforms. To address this, we first construct the Live Comment for Affective Analysis (LCAffect) dataset which contains live comments for English and Chinese videos spanning diverse genres that elicit a wide spectrum of emotions. Then, using this dataset, we use contrastive learning to train a video encoder to produce synthetic live comment features for enhanced multimodal affective content analysis. Through comprehensive experimentation on a wide range of affective analysis tasks (sentiment, emotion recognition, and sarcasm detection) in both English and Chinese, we demonstrate that these synthetic live comment features significantly improve performance over state-of-the-art methods.

Via

Access Paper or Ask Questions

StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples

Oct 16, 2024

Ajay Patel, Jiacheng Zhu, Justin Qiu, Zachary Horvitz, Marianna Apidianaki, Kathleen McKeown, Chris Callison-Burch

Figure 1 for StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples

Figure 2 for StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples

Figure 3 for StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples

Figure 4 for StyleDistance: Stronger Content-Independent Style Embeddings with Synthetic Parallel Examples

Abstract:Style representations aim to embed texts with similar writing styles closely and texts with different styles far apart, regardless of content. However, the contrastive triplets often used for training these representations may vary in both style and content, leading to potential content leakage in the representations. We introduce StyleDistance, a novel approach to training stronger content-independent style embeddings. We use a large language model to create a synthetic dataset of near-exact paraphrases with controlled style variations, and produce positive and negative examples across 40 distinct style features for precise contrastive learning. We assess the quality of our synthetic data and embeddings through human and automatic evaluations. StyleDistance enhances the content-independence of style embeddings, which generalize to real-world benchmarks and outperform leading style representations in downstream applications. Our model can be found at https://huggingface.co/StyleDistance/styledistance .

Via

Access Paper or Ask Questions

Latent Space Interpretation for Stylistic Analysis and Explainable Authorship Attribution

Sep 11, 2024

Milad Alshomary, Narutatsu Ri, Marianna Apidianaki, Ajay Patel, Smaranda Muresan, Kathleen McKeown

Figure 1 for Latent Space Interpretation for Stylistic Analysis and Explainable Authorship Attribution

Figure 2 for Latent Space Interpretation for Stylistic Analysis and Explainable Authorship Attribution

Figure 3 for Latent Space Interpretation for Stylistic Analysis and Explainable Authorship Attribution

Figure 4 for Latent Space Interpretation for Stylistic Analysis and Explainable Authorship Attribution

Abstract:Recent state-of-the-art authorship attribution methods learn authorship representations of texts in a latent, non-interpretable space, hindering their usability in real-world applications. Our work proposes a novel approach to interpreting these learned embeddings by identifying representative points in the latent space and utilizing LLMs to generate informative natural language descriptions of the writing style of each point. We evaluate the alignment of our interpretable space with the latent one and find that it achieves the best prediction agreement compared to other baselines. Additionally, we conduct a human evaluation to assess the quality of these style descriptions, validating their utility as explanations for the latent space. Finally, we investigate whether human performance on the challenging AA task improves when aided by our system's explanations, finding an average improvement of around +20% in accuracy.

* 8 pages, 8 figures, under review

Via

Access Paper or Ask Questions