Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Samuel Dahan

What Am I Missing? Question-Answering as Hidden State Probing

May 29, 2026

Chu Fei Luo, Samuel Dahan, Xiaodan Zhu

Abstract:Test-time reasoning has become a significant field of study since the introduction of chain-of-thought reasoning in large language models (LLMs). However, the mechanisms of this reasoning process are still under-explored -- from the same input prompt, and even the same partial solution, LLMs can produce varied answers if sampled multiple times. We propose to leverage question-asking as an inference-time intervention that articulates information about the model's hidden state. To achieve that, we present a student-teacher setting where a student asks questions to a teacher. We train a probe on the student's hidden state before and after asking a question and find it is predictive of the trajectory's final correctness, even before generating the teacher's answer. This suggests there is a meaningful signal from the self-diagnosis that occurs during question generation rather than information transfer from the teacher. We then frame question-asking as a sequential decision problem, using this probe as a quality score, and define a gating policy to ask questions that maximize likelihood of correctness. We find that the success of question-asking as an intervention is largely dependent on the model's self-consistency. Our empirical results show a gap between detection and recovery; while our gating policy captures model correctness and uncertainty, interventions are equally likely to harm correct trajectories as they are to recover incorrect ones. This gap between diagnosis and correction has broader implications on language models' capacity for self-refinement under uncertainty.

Via

Access Paper or Ask Questions

Misinformation with Legal Consequences (MisLC): A New Task Towards Harnessing Societal Harm of Misinformation

Oct 04, 2024

Chu Fei Luo, Radin Shayanfar, Rohan Bhambhoria, Samuel Dahan, Xiaodan Zhu

Abstract:Misinformation, defined as false or inaccurate information, can result in significant societal harm when it is spread with malicious or even innocuous intent. The rapid online information exchange necessitates advanced detection mechanisms to mitigate misinformation-induced harm. Existing research, however, has predominantly focused on assessing veracity, overlooking the legal implications and social consequences of misinformation. In this work, we take a novel angle to consolidate the definition of misinformation detection using legal issues as a measurement of societal ramifications, aiming to bring interdisciplinary efforts to tackle misinformation and its consequence. We introduce a new task: Misinformation with Legal Consequence (MisLC), which leverages definitions from a wide range of legal domains covering 4 broader legal topics and 11 fine-grained legal issues, including hate speech, election laws, and privacy regulations. For this task, we advocate a two-step dataset curation approach that utilizes crowd-sourced checkworthiness and expert evaluations of misinformation. We provide insights about the MisLC task through empirical evidence, from the problem definition to experiments and expert involvement. While the latest large language models and retrieval-augmented generation are effective baselines for the task, we find they are still far from replicating expert performance.

* 8.5 pages of main body, 20 pages total; Accepted to Findings of EMNLP 2024

Via

Access Paper or Ask Questions

Experimenting with Legal AI Solutions: The Case of Question-Answering for Access to Justice

Sep 12, 2024

Jonathan Li, Rohan Bhambhoria, Samuel Dahan, Xiaodan Zhu

Abstract:Generative AI models, such as the GPT and Llama series, have significant potential to assist laypeople in answering legal questions. However, little prior work focuses on the data sourcing, inference, and evaluation of these models in the context of laypersons. To this end, we propose a human-centric legal NLP pipeline, covering data sourcing, inference, and evaluation. We introduce and release a dataset, LegalQA, with real and specific legal questions spanning from employment law to criminal law, corresponding answers written by legal experts, and citations for each answer. We develop an automatic evaluation protocol for this dataset, then show that retrieval-augmented generation from only 850 citations in the train set can match or outperform internet-wide retrieval, despite containing 9 orders of magnitude less data. Finally, we propose future directions for open-sourced efforts, which fall behind closed-sourced models.

* Accepted into GenLaw '24 (ICML 2024 workshop)

Via

Access Paper or Ask Questions

Evaluating AI for Law: Bridging the Gap with Open-Source Solutions

Apr 18, 2024

Rohan Bhambhoria, Samuel Dahan, Jonathan Li, Xiaodan Zhu

Figure 1 for Evaluating AI for Law: Bridging the Gap with Open-Source Solutions

Figure 2 for Evaluating AI for Law: Bridging the Gap with Open-Source Solutions

Figure 3 for Evaluating AI for Law: Bridging the Gap with Open-Source Solutions

Figure 4 for Evaluating AI for Law: Bridging the Gap with Open-Source Solutions

Abstract:This study evaluates the performance of general-purpose AI, like ChatGPT, in legal question-answering tasks, highlighting significant risks to legal professionals and clients. It suggests leveraging foundational models enhanced by domain-specific knowledge to overcome these issues. The paper advocates for creating open-source legal AI systems to improve accuracy, transparency, and narrative diversity, addressing general AI's shortcomings in legal contexts.

Via

Access Paper or Ask Questions

Prototype-Based Interpretability for Legal Citation Prediction

May 25, 2023

Chu Fei Luo, Rohan Bhambhoria, Samuel Dahan, Xiaodan Zhu

Figure 1 for Prototype-Based Interpretability for Legal Citation Prediction

Figure 2 for Prototype-Based Interpretability for Legal Citation Prediction

Figure 3 for Prototype-Based Interpretability for Legal Citation Prediction

Figure 4 for Prototype-Based Interpretability for Legal Citation Prediction

Abstract:Deep learning has made significant progress in the past decade, and demonstrates potential to solve problems with extensive social impact. In high-stakes decision making areas such as law, experts often require interpretability for automatic systems to be utilized in practical settings. In this work, we attempt to address these requirements applied to the important problem of legal citation prediction (LCP). We design the task with parallels to the thought-process of lawyers, i.e., with reference to both precedents and legislative provisions. After initial experimental results, we refine the target citation predictions with the feedback of legal experts. Additionally, we introduce a prototype architecture to add interpretability, achieving strong performance while adhering to decision parameters used by lawyers. Our study builds on and leverages the state-of-the-art language processing models for law, while addressing vital considerations for high-stakes tasks with practical societal impact.

* 8.5 pages, 4 figures. To be published in Findings of ACL 2023

Via

Access Paper or Ask Questions

Towards Legally Enforceable Hate Speech Detection for Public Forums

May 23, 2023

Chu Fei Luo, Rohan Bhambhoria, Xiaodan Zhu, Samuel Dahan

Figure 1 for Towards Legally Enforceable Hate Speech Detection for Public Forums

Figure 2 for Towards Legally Enforceable Hate Speech Detection for Public Forums

Figure 3 for Towards Legally Enforceable Hate Speech Detection for Public Forums

Figure 4 for Towards Legally Enforceable Hate Speech Detection for Public Forums

Abstract:Hate speech is a serious issue on public forums, and proper enforcement of hate speech laws is key for protecting groups of people against harmful and discriminatory language. However, determining what constitutes hate speech is a complex task that is highly open to subjective interpretations. Existing works do not align their systems with enforceable definitions of hate speech, which can make their outputs inconsistent with the goals of regulators. Our work introduces a new task for enforceable hate speech detection centred around legal definitions, and a dataset annotated on violations of eleven possible definitions by legal experts. Given the challenge of identifying clear, legally enforceable instances of hate speech, we augment the dataset with expert-generated samples and an automatically mined challenge set. We experiment with grounding the model decision in these definitions using zero-shot and few-shot prompting. We then report results on several large language models (LLMs). With this task definition, automatic hate speech detection can be more closely aligned to enforceable laws, and hence assist in more rigorous enforcement of legal protections against harmful speech in public forums.

* 4 pages

Via

Access Paper or Ask Questions

Interpretable Low-Resource Legal Decision Making

Jan 01, 2022

Rohan Bhambhoria, Hui Liu, Samuel Dahan, Xiaodan Zhu

Figure 1 for Interpretable Low-Resource Legal Decision Making

Figure 2 for Interpretable Low-Resource Legal Decision Making

Figure 3 for Interpretable Low-Resource Legal Decision Making

Figure 4 for Interpretable Low-Resource Legal Decision Making

Abstract:Over the past several years, legal applications of deep learning have been on the rise. However, as with other high-stakes decision making areas, the requirement for interpretability is of crucial importance. Current models utilized by legal practitioners are more of the conventional machine learning type, wherein they are inherently interpretable, yet unable to harness the performance capabilities of data-driven deep learning models. In this work, we utilize deep learning models in the area of trademark law to shed light on the issue of likelihood of confusion between trademarks. Specifically, we introduce a model-agnostic interpretable intermediate layer, a technique which proves to be effective for legal documents. Furthermore, we utilize weakly supervised learning by means of a curriculum learning strategy, effectively demonstrating the improved performance of a deep learning model. This is in contrast to the conventional models which are only able to utilize the limited number of expensive manually-annotated samples by legal experts. Although the methods presented in this work tackles the task of risk of confusion for trademarks, it is straightforward to extend them to other fields of law, or more generally, to other similar high-stakes application scenarios.

* Accepted at AAAI 2022

Via

Access Paper or Ask Questions