Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Rong Pan

Active Learning for Multiple Change Point Detection in Non-stationary Time Series with Deep Gaussian Processes

May 26, 2025

Hao Zhao, Rong Pan

Abstract:Multiple change point (MCP) detection in non-stationary time series is challenging due to the variety of underlying patterns. To address these challenges, we propose a novel algorithm that integrates Active Learning (AL) with Deep Gaussian Processes (DGPs) for robust MCP detection. Our method leverages spectral analysis to identify potential changes and employs AL to strategically select new sampling points for improved efficiency. By incorporating the modeling flexibility of DGPs with the change-identification capabilities of spectral methods, our approach adapts to diverse spectral change behaviors and effectively localizes multiple change points. Experiments on both simulated and real-world data demonstrate that our method outperforms existing techniques in terms of detection accuracy and sampling efficiency for non-stationary time series.

Via

Access Paper or Ask Questions

From Observation to Orientation: an Adaptive Integer Programming Approach to Intervention Design

Apr 10, 2025

Abdelmonem Elrefaey, Rong Pan

Figure 1 for From Observation to Orientation: an Adaptive Integer Programming Approach to Intervention Design

Figure 2 for From Observation to Orientation: an Adaptive Integer Programming Approach to Intervention Design

Figure 3 for From Observation to Orientation: an Adaptive Integer Programming Approach to Intervention Design

Figure 4 for From Observation to Orientation: an Adaptive Integer Programming Approach to Intervention Design

Abstract:Using both observational and experimental data, a causal discovery process can identify the causal relationships between variables. A unique adaptive intervention design paradigm is presented in this work, where causal directed acyclic graphs (DAGs) are for effectively recovered with practical budgetary considerations. In order to choose treatments that optimize information gain under these considerations, an iterative integer programming (IP) approach is proposed, which drastically reduces the number of experiments required. Simulations over a broad range of graph sizes and edge densities are used to assess the effectiveness of the suggested approach. Results show that the proposed adaptive IP approach achieves full causal graph recovery with fewer intervention iterations and variable manipulations than random intervention baselines, and it is also flexible enough to accommodate a variety of practical constraints.

Via

Access Paper or Ask Questions

SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers

Dec 17, 2024

Zehao Chen, Rong Pan

Figure 1 for SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers

Figure 2 for SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers

Figure 3 for SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers

Figure 4 for SVGBuilder: Component-Based Colored SVG Generation with Text-Guided Autoregressive Transformers

Abstract:Scalable Vector Graphics (SVG) are essential XML-based formats for versatile graphics, offering resolution independence and scalability. Unlike raster images, SVGs use geometric shapes and support interactivity, animation, and manipulation via CSS and JavaScript. Current SVG generation methods face challenges related to high computational costs and complexity. In contrast, human designers use component-based tools for efficient SVG creation. Inspired by this, SVGBuilder introduces a component-based, autoregressive model for generating high-quality colored SVGs from textual input. It significantly reduces computational overhead and improves efficiency compared to traditional methods. Our model generates SVGs up to 604 times faster than optimization-based approaches. To address the limitations of existing SVG datasets and support our research, we introduce ColorSVG-100K, the first large-scale dataset of colored SVGs, comprising 100,000 graphics. This dataset fills the gap in color information for SVG generation models and enhances diversity in model training. Evaluation against state-of-the-art models demonstrates SVGBuilder's superior performance in practical applications, highlighting its efficiency and quality in generating complex SVG graphics.

* Project: https://svgbuilder.github.io

Via

Access Paper or Ask Questions

Leveraging LLM for Automated Ontology Extraction and Knowledge Graph Generation

Dec 03, 2024

Mohammad Sadeq Abolhasani, Rong Pan

Figure 1 for Leveraging LLM for Automated Ontology Extraction and Knowledge Graph Generation

Figure 2 for Leveraging LLM for Automated Ontology Extraction and Knowledge Graph Generation

Figure 3 for Leveraging LLM for Automated Ontology Extraction and Knowledge Graph Generation

Figure 4 for Leveraging LLM for Automated Ontology Extraction and Knowledge Graph Generation

Abstract:Extracting relevant and structured knowledge from large, complex technical documents within the Reliability and Maintainability (RAM) domain is labor-intensive and prone to errors. Our work addresses this challenge by presenting OntoKGen, a genuine pipeline for ontology extraction and Knowledge Graph (KG) generation. OntoKGen leverages Large Language Models (LLMs) through an interactive user interface guided by our adaptive iterative Chain of Thought (CoT) algorithm to ensure that the ontology extraction process and, thus, KG generation align with user-specific requirements. Although KG generation follows a clear, structured path based on the confirmed ontology, there is no universally correct ontology as it is inherently based on the user's preferences. OntoKGen recommends an ontology grounded in best practices, minimizing user effort and providing valuable insights that may have been overlooked, all while giving the user complete control over the final ontology. Having generated the KG based on the confirmed ontology, OntoKGen enables seamless integration into schemeless, non-relational databases like Neo4j. This integration allows for flexible storage and retrieval of knowledge from diverse, unstructured sources, facilitating advanced querying, analysis, and decision-making. Moreover, the generated KG serves as a robust foundation for future integration into Retrieval Augmented Generation (RAG) systems, offering enhanced capabilities for developing domain-specific intelligent applications.

Via

Access Paper or Ask Questions

Causal Discovery by Interventions via Integer Programming

Dec 02, 2024

Abdelmonem Elrefaey, Rong Pan

Figure 1 for Causal Discovery by Interventions via Integer Programming

Figure 2 for Causal Discovery by Interventions via Integer Programming

Figure 3 for Causal Discovery by Interventions via Integer Programming

Figure 4 for Causal Discovery by Interventions via Integer Programming

Abstract:Causal discovery is essential across various scientific fields to uncover causal structures within data. Traditional methods relying on observational data have limitations due to confounding variables. This paper presents an optimization-based approach using integer programming (IP) to design minimal intervention sets that ensure causal structure identifiability. Our method provides exact and modular solutions that can be adjusted to different experimental settings and constraints. We demonstrate its effectiveness through comparative analysis across different settings, demonstrating its applicability and robustness.

Via

Access Paper or Ask Questions

Gaussian Derivative Change-point Detection for Early Warnings of Industrial System Failures

Oct 29, 2024

Hao Zhao, Rong Pan

Abstract:An early warning of future system failure is essential for conducting predictive maintenance and enhancing system availability. This paper introduces a three-step framework for assessing system health to predict imminent system breakdowns. First, the Gaussian Derivative Change-Point Detection (GDCPD) algorithm is proposed for detecting changes in the high-dimensional feature space. GDCPD conducts a multivariate Change-Point Detection (CPD) by implementing Gaussian derivative processes for identifying change locations on critical system features, as these changes eventually will lead to system failure. To assess the significance of these changes, Weighted Mahalanobis Distance (WMD) is applied in both offline and online analyses. In the offline setting, WMD helps establish a threshold that determines significant system variations, while in the online setting, it facilitates real-time monitoring, issuing alarms for potential future system breakdowns. Utilizing the insights gained from the GDCPD and monitoring scheme, Long Short-Term Memory (LSTM) network is then employed to estimate the Remaining Useful Life (RUL) of the system. The experimental study of a real-world system demonstrates the effectiveness of the proposed methodology in accurately forecasting system failures well before they occur. By integrating CPD with real-time monitoring and RUL prediction, this methodology significantly advances system health monitoring and early warning capabilities.

Via

Access Paper or Ask Questions

MEC-IP: Efficient Discovery of Markov Equivalent Classes via Integer Programming

Oct 22, 2024

Abdelmonem Elrefaey, Rong Pan

Figure 1 for MEC-IP: Efficient Discovery of Markov Equivalent Classes via Integer Programming

Figure 2 for MEC-IP: Efficient Discovery of Markov Equivalent Classes via Integer Programming

Figure 3 for MEC-IP: Efficient Discovery of Markov Equivalent Classes via Integer Programming

Figure 4 for MEC-IP: Efficient Discovery of Markov Equivalent Classes via Integer Programming

Abstract:This paper presents a novel Integer Programming (IP) approach for discovering the Markov Equivalent Class (MEC) of Bayesian Networks (BNs) through observational data. The MEC-IP algorithm utilizes a unique clique-focusing strategy and Extended Maximal Spanning Graphs (EMSG) to streamline the search for MEC, thus overcoming the computational limitations inherent in other existing algorithms. Our numerical results show that not only a remarkable reduction in computational time is achieved by our algorithm but also an improvement in causal discovery accuracy is seen across diverse datasets. These findings underscore this new algorithm's potential as a powerful tool for researchers and practitioners in causal discovery and BNSL, offering a significant leap forward toward the efficient and accurate analysis of complex data structures.

Via

Access Paper or Ask Questions

How Does Data Diversity Shape the Weight Landscape of Neural Networks?

Oct 18, 2024

Yang Ba, Michelle V. Mancenido, Rong Pan

Figure 1 for How Does Data Diversity Shape the Weight Landscape of Neural Networks?

Figure 2 for How Does Data Diversity Shape the Weight Landscape of Neural Networks?

Figure 3 for How Does Data Diversity Shape the Weight Landscape of Neural Networks?

Figure 4 for How Does Data Diversity Shape the Weight Landscape of Neural Networks?

Abstract:To enhance the generalization of machine learning models to unseen data, techniques such as dropout, weight decay ($L_2$ regularization), and noise augmentation are commonly employed. While regularization methods (i.e., dropout and weight decay) are geared toward adjusting model parameters to prevent overfitting, data augmentation increases the diversity of the input training set, a method purported to improve accuracy and calibration error. In this paper, we investigate the impact of each of these techniques on the parameter space of neural networks, with the goal of understanding how they alter the weight landscape in transfer learning scenarios. To accomplish this, we employ Random Matrix Theory to analyze the eigenvalue distributions of pre-trained models, fine-tuned using these techniques but using different levels of data diversity, for the same downstream tasks. We observe that diverse data influences the weight landscape in a similar fashion as dropout. Additionally, we compare commonly used data augmentation methods with synthetic data created by generative models. We conclude that synthetic data can bring more diversity into real input data, resulting in a better performance on out-of-distribution test instances.

Via

Access Paper or Ask Questions

Fill In The Gaps: Model Calibration and Generalization with Synthetic Data

Oct 07, 2024

Yang Ba, Michelle V. Mancenido, Rong Pan

Figure 1 for Fill In The Gaps: Model Calibration and Generalization with Synthetic Data

Figure 2 for Fill In The Gaps: Model Calibration and Generalization with Synthetic Data

Figure 3 for Fill In The Gaps: Model Calibration and Generalization with Synthetic Data

Figure 4 for Fill In The Gaps: Model Calibration and Generalization with Synthetic Data

Abstract:As machine learning models continue to swiftly advance, calibrating their performance has become a major concern prior to practical and widespread implementation. Most existing calibration methods often negatively impact model accuracy due to the lack of diversity of validation data, resulting in reduced generalizability. To address this, we propose a calibration method that incorporates synthetic data without compromising accuracy. We derive the expected calibration error (ECE) bound using the Probably Approximately Correct (PAC) learning framework. Large language models (LLMs), known for their ability to mimic real data and generate text with mixed class labels, are utilized as a synthetic data generation strategy to lower the ECE bound and improve model accuracy on real test data. Additionally, we propose data generation mechanisms for efficient calibration. Testing our method on four different natural language processing tasks, we observed an average up to 34\% increase in accuracy and 33\% decrease in ECE.

* Accepted to EMNLP 2024 Main Conference (Long paper)

Via

Access Paper or Ask Questions

Active Learning for Abrupt Shifts Change-point Detection via Derivative-Aware Gaussian Processes

Dec 05, 2023

Hao Zhao, Rong Pan

Figure 1 for Active Learning for Abrupt Shifts Change-point Detection via Derivative-Aware Gaussian Processes

Figure 2 for Active Learning for Abrupt Shifts Change-point Detection via Derivative-Aware Gaussian Processes

Figure 3 for Active Learning for Abrupt Shifts Change-point Detection via Derivative-Aware Gaussian Processes

Figure 4 for Active Learning for Abrupt Shifts Change-point Detection via Derivative-Aware Gaussian Processes

Abstract:Change-point detection (CPD) is crucial for identifying abrupt shifts in data, which influence decision-making and efficient resource allocation across various domains. To address the challenges posed by the costly and time-intensive data acquisition in CPD, we introduce the Derivative-Aware Change Detection (DACD) method. It leverages the derivative process of a Gaussian process (GP) for Active Learning (AL), aiming to pinpoint change-point locations effectively. DACD balances the exploitation and exploration of derivative processes through multiple data acquisition functions (AFs). By utilizing GP derivative mean and variance as criteria, DACD sequentially selects the next sampling data point, thus enhancing algorithmic efficiency and ensuring reliable and accurate results. We investigate the effectiveness of DACD method in diverse scenarios and show it outperforms other active learning change-point detection approaches.

Via

Access Paper or Ask Questions