Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sehee Lim

On the Similarities of Embeddings in Contrastive Learning

Jun 11, 2025

Chungpa Lee, Sehee Lim, Kibok Lee, Jy-yong Sohn

Abstract:Contrastive learning (CL) operates on a simple yet effective principle: embeddings of positive pairs are pulled together, while those of negative pairs are pushed apart. Although various forms of contrastive loss have been proposed and analyzed from different perspectives, prior works lack a comprehensive framework that systematically explains a broad class of these objectives. In this paper, we present a unified framework for understanding CL, which is based on analyzing the cosine similarity between embeddings of positive and negative pairs. In full-batch settings, we show that perfect alignment of positive pairs is unattainable when similarities of negative pairs fall below a certain threshold, and that this misalignment can be alleviated by incorporating within-view negative pairs. In mini-batch settings, we demonstrate that smaller batch sizes incur stronger separation among negative pairs within batches, which leads to higher variance in similarities of negative pairs. To address this limitation of mini-batch CL, we introduce an auxiliary loss term that reduces the variance of similarities of negative pairs in CL. Empirical results demonstrate that incorporating the proposed loss consistently improves the performance of CL methods in small-batch training.

* International Conference on Machine Learning (ICML) 2025
* contrastive learning, representation learning, embedding, similarity, negative pair, positive pair

Via

Access Paper or Ask Questions

ERD: A Framework for Improving LLM Reasoning for Cognitive Distortion Classification

Mar 21, 2024

Sehee Lim, Yejin Kim, Chi-Hyun Choi, Jy-yong Sohn, Byung-Hoon Kim

Figure 1 for ERD: A Framework for Improving LLM Reasoning for Cognitive Distortion Classification

Figure 2 for ERD: A Framework for Improving LLM Reasoning for Cognitive Distortion Classification

Figure 3 for ERD: A Framework for Improving LLM Reasoning for Cognitive Distortion Classification

Figure 4 for ERD: A Framework for Improving LLM Reasoning for Cognitive Distortion Classification

Abstract:Improving the accessibility of psychotherapy with the aid of Large Language Models (LLMs) is garnering a significant attention in recent years. Recognizing cognitive distortions from the interviewee's utterances can be an essential part of psychotherapy, especially for cognitive behavioral therapy. In this paper, we propose ERD, which improves LLM-based cognitive distortion classification performance with the aid of additional modules of (1) extracting the parts related to cognitive distortion, and (2) debating the reasoning steps by multiple agents. Our experimental results on a public dataset show that ERD improves the multi-class F1 score as well as binary specificity score. Regarding the latter score, it turns out that our method is effective in debiasing the baseline method which has high false positive rate, especially when the summary of multi-agent debate is provided to LLMs.

Via

Access Paper or Ask Questions