Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ko-Jen Hsiao

Orthogonal Low Rank Embedding Stabilization

Aug 11, 2025

Kevin Zielnicki, Ko-Jen Hsiao

Abstract:The instability of embedding spaces across model retraining cycles presents significant challenges to downstream applications using user or item embeddings derived from recommendation systems as input features. This paper introduces a novel orthogonal low-rank transformation methodology designed to stabilize the user/item embedding space, ensuring consistent embedding dimensions across retraining sessions. Our approach leverages a combination of efficient low-rank singular value decomposition and orthogonal Procrustes transformation to map embeddings into a standardized space. This transformation is computationally efficient, lossless, and lightweight, preserving the dot product and inference quality while reducing operational burdens. Unlike existing methods that modify training objectives or embedding structures, our approach maintains the integrity of the primary model application and can be seamlessly integrated with other stabilization techniques.

Via

Access Paper or Ask Questions

Multi-criteria Similarity-based Anomaly Detection using Pareto Depth Analysis

Aug 20, 2015

Ko-Jen Hsiao, Kevin S. Xu, Jeff Calder, Alfred O. Hero III

Figure 1 for Multi-criteria Similarity-based Anomaly Detection using Pareto Depth Analysis

Figure 2 for Multi-criteria Similarity-based Anomaly Detection using Pareto Depth Analysis

Figure 3 for Multi-criteria Similarity-based Anomaly Detection using Pareto Depth Analysis

Figure 4 for Multi-criteria Similarity-based Anomaly Detection using Pareto Depth Analysis

Abstract:We consider the problem of identifying patterns in a data set that exhibit anomalous behavior, often referred to as anomaly detection. Similarity-based anomaly detection algorithms detect abnormally large amounts of similarity or dissimilarity, e.g.~as measured by nearest neighbor Euclidean distances between a test sample and the training samples. In many application domains there may not exist a single dissimilarity measure that captures all possible anomalous patterns. In such cases, multiple dissimilarity measures can be defined, including non-metric measures, and one can test for anomalies by scalarizing using a non-negative linear combination of them. If the relative importance of the different dissimilarity measures are not known in advance, as in many anomaly detection applications, the anomaly detection algorithm may need to be executed multiple times with different choices of weights in the linear combination. In this paper, we propose a method for similarity-based anomaly detection using a novel multi-criteria dissimilarity measure, the Pareto depth. The proposed Pareto depth analysis (PDA) anomaly detection algorithm uses the concept of Pareto optimality to detect anomalies under multiple criteria without having to run an algorithm multiple times with different choices of weights. The proposed PDA approach is provably better than using linear combinations of the criteria and shows superior performance on experiments with synthetic and real data sets.

* IEEE Transactions on Neural Networks and Learning Systems 27 (2016) 1307-1321
* The work is submitted to IEEE TNNLS Special Issue on Learning in Non-(geo)metric Spaces for review on October 28, 2013, revised on July 26, 2015 and accepted on July 30, 2015. A preliminary version of this work is reported in the conference Advances in Neural Information Processing Systems (NIPS) 2012

Via

Access Paper or Ask Questions

Pareto-depth for Multiple-query Image Retrieval

Feb 21, 2014

Ko-Jen Hsiao, Jeff Calder, Alfred O. Hero III

Figure 1 for Pareto-depth for Multiple-query Image Retrieval

Figure 2 for Pareto-depth for Multiple-query Image Retrieval

Figure 3 for Pareto-depth for Multiple-query Image Retrieval

Figure 4 for Pareto-depth for Multiple-query Image Retrieval

Abstract:Most content-based image retrieval systems consider either one single query, or multiple queries that include the same object or represent the same semantic information. In this paper we consider the content-based image retrieval problem for multiple query images corresponding to different image semantics. We propose a novel multiple-query information retrieval algorithm that combines the Pareto front method (PFM) with efficient manifold ranking (EMR). We show that our proposed algorithm outperforms state of the art multiple-query retrieval algorithms on real-world image databases. We attribute this performance improvement to concavity properties of the Pareto fronts, and prove a theoretical result that characterizes the asymptotic concavity of the fronts.

Via

Access Paper or Ask Questions

Multi-criteria Anomaly Detection using Pareto Depth Analysis

Jan 07, 2013

Ko-Jen Hsiao, Kevin S. Xu, Jeff Calder, Alfred O. Hero III

Figure 1 for Multi-criteria Anomaly Detection using Pareto Depth Analysis

Figure 2 for Multi-criteria Anomaly Detection using Pareto Depth Analysis

Figure 3 for Multi-criteria Anomaly Detection using Pareto Depth Analysis

Figure 4 for Multi-criteria Anomaly Detection using Pareto Depth Analysis

Abstract:We consider the problem of identifying patterns in a data set that exhibit anomalous behavior, often referred to as anomaly detection. In most anomaly detection algorithms, the dissimilarity between data samples is calculated by a single criterion, such as Euclidean distance. However, in many cases there may not exist a single dissimilarity measure that captures all possible anomalous patterns. In such a case, multiple criteria can be defined, and one can test for anomalies by scalarizing the multiple criteria using a linear combination of them. If the importance of the different criteria are not known in advance, the algorithm may need to be executed multiple times with different choices of weights in the linear combination. In this paper, we introduce a novel non-parametric multi-criteria anomaly detection method using Pareto depth analysis (PDA). PDA uses the concept of Pareto optimality to detect anomalies under multiple criteria without having to run an algorithm multiple times with different choices of weights. The proposed PDA approach scales linearly in the number of criteria and is provably better than linear combinations of the criteria.

* Advances in Neural Information Processing Systems 25 (2012) 854-862
* Removed an unnecessary line from Algorithm 1

Via

Access Paper or Ask Questions