Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Advait Deshmukh

A Structured Clustering Approach for Inducing Media Narratives

Apr 11, 2026

Rohan Das, Advait Deshmukh, Alexandria Leto, Zohar Naaman, I-Ta Lee, Maria Leonor Pacheco

Abstract:Media narratives wield tremendous power in shaping public opinion, yet computational approaches struggle to capture the nuanced storytelling structures that communication theory emphasizes as central to how meaning is constructed. Existing approaches either miss subtle narrative patterns through coarse-grained analysis or require domain-specific taxonomies that limit scalability. To bridge this gap, we present a framework for inducing rich narrative schemas by jointly modeling events and characters via structured clustering. Our approach produces explainable narrative schemas that align with established framing theory while scaling to large corpora without exhaustive manual annotation.

* Accepted to the Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)

Via

Access Paper or Ask Questions

All Entities are Not Created Equal: Examining the Long Tail for Fine-Grained Entity Typing

Oct 22, 2024

Advait Deshmukh, Ashwin Umadi, Dananjay Srinivas, Maria Leonor Pacheco

Figure 1 for All Entities are Not Created Equal: Examining the Long Tail for Fine-Grained Entity Typing

Figure 2 for All Entities are Not Created Equal: Examining the Long Tail for Fine-Grained Entity Typing

Figure 3 for All Entities are Not Created Equal: Examining the Long Tail for Fine-Grained Entity Typing

Figure 4 for All Entities are Not Created Equal: Examining the Long Tail for Fine-Grained Entity Typing

Abstract:Pre-trained language models (PLMs) are trained on large amounts of data, which helps capture world knowledge alongside linguistic competence. Due to this, they are extensively used for ultra-fine entity typing tasks, where they provide the entity knowledge held in its parameter space. Given that PLMs learn from co-occurrence patterns, they likely contain more knowledge or less knowledge about entities depending on their how frequent they are in the pre-training data. In this work, we probe PLMs to elicit encoded entity probabilities and demonstrate that they highly correlate with their frequency in large-scale internet data. Then, we demonstrate that entity-typing approaches that rely on PLMs struggle with entities at the long tail on the distribution. Our findings suggests that we need to go beyond PLMs to produce solutions that perform well for rare, new or infrequent entities.

Via

Access Paper or Ask Questions