Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rishi Jha

Harnessing the Universal Geometry of Embeddings

May 18, 2025

Rishi Jha, Collin Zhang, Vitaly Shmatikov, John X. Morris

Abstract:We introduce the first method for translating text embeddings from one vector space to another without any paired data, encoders, or predefined sets of matches. Our unsupervised approach translates any embedding to and from a universal latent representation (i.e., a universal semantic structure conjectured by the Platonic Representation Hypothesis). Our translations achieve high cosine similarity across model pairs with different architectures, parameter counts, and training datasets. The ability to translate unknown embeddings into a different space while preserving their geometry has serious implications for the security of vector databases. An adversary with access only to embedding vectors can extract sensitive information about the underlying documents, sufficient for classification and attribute inference.

Via

Access Paper or Ask Questions

Multi-Agent Systems Execute Arbitrary Malicious Code

Mar 15, 2025

Harold Triedman, Rishi Jha, Vitaly Shmatikov

Figure 1 for Multi-Agent Systems Execute Arbitrary Malicious Code

Figure 2 for Multi-Agent Systems Execute Arbitrary Malicious Code

Figure 3 for Multi-Agent Systems Execute Arbitrary Malicious Code

Figure 4 for Multi-Agent Systems Execute Arbitrary Malicious Code

Abstract:Multi-agent systems coordinate LLM-based agents to perform tasks on users' behalf. In real-world applications, multi-agent systems will inevitably interact with untrusted inputs, such as malicious Web content, files, email attachments, etc. Using several recently proposed multi-agent frameworks as concrete examples, we demonstrate that adversarial content can hijack control and communication within the system to invoke unsafe agents and functionalities. This results in a complete security breach, up to execution of arbitrary malicious code on the user's device and/or exfiltration of sensitive data from the user's containerized environment. We show that control-flow hijacking attacks succeed even if the individual agents are not susceptible to direct or indirect prompt injection, and even if they refuse to perform harmful actions.

* 30 pages, 5 figures, 8 tables

Via

Access Paper or Ask Questions

Adversarial Hubness in Multi-Modal Retrieval

Dec 18, 2024

Tingwei Zhang, Fnu Suya, Rishi Jha, Collin Zhang, Vitaly Shmatikov

Abstract:Hubness is a phenomenon in high-dimensional vector spaces where a single point from the natural distribution is unusually close to many other points. This is a well-known problem in information retrieval that causes some items to accidentally (and incorrectly) appear relevant to many queries. In this paper, we investigate how attackers can exploit hubness to turn any image or audio input in a multi-modal retrieval system into an adversarial hub. Adversarial hubs can be used to inject universal adversarial content (e.g., spam) that will be retrieved in response to thousands of different queries, as well as for targeted attacks on queries related to specific, attacker-chosen concepts. We present a method for creating adversarial hubs and evaluate the resulting hubs on benchmark multi-modal retrieval datasets and an image-to-image retrieval system based on a tutorial from Pinecone, a popular vector database. For example, in text-caption-to-image retrieval, a single adversarial hub is retrieved as the top-1 most relevant image for more than 21,000 out of 25,000 test queries (by contrast, the most common natural hub is the top-1 response to only 102 queries). We also investigate whether techniques for mitigating natural hubness are an effective defense against adversarial hubs, and show that they are not effective against hubs that target queries related to specific concepts.

Via

Access Paper or Ask Questions

Hyper-Universal Policy Approximation: Learning to Generate Actions from a Single Image using Hypernets

Jul 07, 2022

Dimitrios C. Gklezakos, Rishi Jha, Rajesh P. N. Rao

Figure 1 for Hyper-Universal Policy Approximation: Learning to Generate Actions from a Single Image using Hypernets

Figure 2 for Hyper-Universal Policy Approximation: Learning to Generate Actions from a Single Image using Hypernets

Figure 3 for Hyper-Universal Policy Approximation: Learning to Generate Actions from a Single Image using Hypernets

Figure 4 for Hyper-Universal Policy Approximation: Learning to Generate Actions from a Single Image using Hypernets

Abstract:Inspired by Gibson's notion of object affordances in human vision, we ask the question: how can an agent learn to predict an entire action policy for a novel object or environment given only a single glimpse? To tackle this problem, we introduce the concept of Universal Policy Functions (UPFs) which are state-to-action mappings that generalize not only to new goals but most importantly to novel, unseen environments. Specifically, we consider the problem of efficiently learning such policies for agents with limited computational and communication capacity, constraints that are frequently encountered in edge devices. We propose the Hyper-Universal Policy Approximator (HUPA), a hypernetwork-based model to generate small task- and environment-conditional policy networks from a single image, with good generalization properties. Our results show that HUPAs significantly outperform an embedding-based alternative for generated policies that are size-constrained. Although this work is restricted to a simple map-based navigation task, future work includes applying the principles behind HUPAs to learning more general affordances for objects and environments.

Via

Access Paper or Ask Questions

On Geodesic Distances and Contextual Embedding Compression for Text Classification

Apr 22, 2021

Rishi Jha, Kai Mihata

Figure 1 for On Geodesic Distances and Contextual Embedding Compression for Text Classification

Figure 2 for On Geodesic Distances and Contextual Embedding Compression for Text Classification

Figure 3 for On Geodesic Distances and Contextual Embedding Compression for Text Classification

Abstract:In some memory-constrained settings like IoT devices and over-the-network data pipelines, it can be advantageous to have smaller contextual embeddings. We investigate the efficacy of projecting contextual embedding data (BERT) onto a manifold, and using nonlinear dimensionality reduction techniques to compress these embeddings. In particular, we propose a novel post-processing approach, applying a combination of Isomap and PCA. We find that the geodesic distance estimations, estimates of the shortest path on a Riemannian manifold, from Isomap's k-Nearest Neighbors graph bolstered the performance of the compressed embeddings to be comparable to the original BERT embeddings. On one dataset, we find that despite a 12-fold dimensionality reduction, the compressed embeddings performed within 0.1% of the original BERT embeddings on a downstream classification task. In addition, we find that this approach works particularly well on tasks reliant on syntactic data, when compared with linear dimensionality reduction. These results show promise for a novel geometric approach to achieve lower dimensional text embeddings from existing transformers and pave the way for data-specific and application-specific embedding compressions.

* 6 pages, 2 figures

Via

Access Paper or Ask Questions