Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Heri Ramampiaro

CINDI: Conditional Imputation and Noisy Data Integrity with Flows in Power Grid Data

Mar 12, 2026

David Baumgartner, Helge Langseth, Heri Ramampiaro

Abstract:Real-world multivariate time series, particularly in critical infrastructure such as electrical power grids, are often corrupted by noise and anomalies that degrade the performance of downstream tasks. Standard data cleaning approaches often rely on disjoint strategies, which involve detecting errors with one model and imputing them with another. Such approaches can fail to capture the full joint distribution of the data and ignore prediction uncertainty. This work introduces Conditional Imputation and Noisy Data Integrity (CINDI), an unsupervised probabilistic framework designed to restore data integrity in complex time series. Unlike fragmented approaches, CINDI unifies anomaly detection and imputation into a single end-to-end system built on conditional normalizing flows. By modeling the exact conditional likelihood of the data, the framework identifies low-probability segments and iteratively samples statistically consistent replacements. This allows CINDI to efficiently reuse learned information while preserving the underlying physical and statistical properties of the system. We evaluate the framework using real-world grid loss data from a Norwegian power distribution operator, though the methodology is designed to generalize to any multivariate time series domain. The results demonstrate that CINDI yields robust performance compared to competitive baselines, offering a scalable solution for maintaining reliability in noisy environments.

Via

Access Paper or Ask Questions

Temporal-Conditioned Normalizing Flows for Multivariate Time Series Anomaly Detection

Mar 10, 2026

David Baumgartner, Helge Langseth, Kenth Engø-Monsen, Heri Ramampiaro

Abstract:This paper introduces temporal-conditioned normalizing flows (tcNF), a novel framework that addresses anomaly detection in time series data with accurate modeling of temporal dependencies and uncertainty. By conditioning normalizing flows on previous observations, tcNF effectively captures complex temporal dynamics and generates accurate probability distributions of expected behavior. This autoregressive approach enables robust anomaly detection by identifying low-probability events within the learned distribution. We evaluate tcNF on diverse datasets, demonstrating good accuracy and robustness compared to existing methods. A comprehensive analysis of strengths and limitations and open-source code is provided to facilitate reproducibility and future research.

Via

Access Paper or Ask Questions

PersonalSum: A User-Subjective Guided Personalized Summarization Dataset for Large Language Models

Oct 04, 2024

Lemei Zhang, Peng Liu, Marcus Tiedemann Oekland Henriksboe, Even W. Lauvrak, Jon Atle Gulla, Heri Ramampiaro

Figure 1 for PersonalSum: A User-Subjective Guided Personalized Summarization Dataset for Large Language Models

Figure 2 for PersonalSum: A User-Subjective Guided Personalized Summarization Dataset for Large Language Models

Figure 3 for PersonalSum: A User-Subjective Guided Personalized Summarization Dataset for Large Language Models

Figure 4 for PersonalSum: A User-Subjective Guided Personalized Summarization Dataset for Large Language Models

Abstract:With the rapid advancement of Natural Language Processing in recent years, numerous studies have shown that generic summaries generated by Large Language Models (LLMs) can sometimes surpass those annotated by experts, such as journalists, according to human evaluations. However, there is limited research on whether these generic summaries meet the individual needs of ordinary people. The biggest obstacle is the lack of human-annotated datasets from the general public. Existing work on personalized summarization often relies on pseudo datasets created from generic summarization datasets or controllable tasks that focus on specific named entities or other aspects, such as the length and specificity of generated summaries, collected from hypothetical tasks without the annotators' initiative. To bridge this gap, we propose a high-quality, personalized, manually annotated abstractive summarization dataset called PersonalSum. This dataset is the first to investigate whether the focus of public readers differs from the generic summaries generated by LLMs. It includes user profiles, personalized summaries accompanied by source sentences from given articles, and machine-generated generic summaries along with their sources. We investigate several personal signals - entities/topics, plot, and structure of articles - that may affect the generation of personalized summaries using LLMs in a few-shot in-context learning scenario. Our preliminary results and analysis indicate that entities/topics are merely one of the key factors that impact the diverse preferences of users, and personalized summarization remains a significant challenge for existing LLMs.

* Accepted at NeurIPS 2024 Track on Datasets and Benchmarks. Code available at https://github.com/SmartmediaAI/PersonalSum

Via

Access Paper or Ask Questions

Towards human performance on automatic motion tracking of infant spontaneous movements

Oct 22, 2020

Daniel Groos, Lars Adde, Ragnhild Støen, Heri Ramampiaro, Espen A. F. Ihlen

Figure 1 for Towards human performance on automatic motion tracking of infant spontaneous movements

Figure 2 for Towards human performance on automatic motion tracking of infant spontaneous movements

Figure 3 for Towards human performance on automatic motion tracking of infant spontaneous movements

Figure 4 for Towards human performance on automatic motion tracking of infant spontaneous movements

Abstract:Assessment of spontaneous movements can predict the long-term developmental outcomes in high-risk infants. In order to develop algorithms for automated prediction of later function based on early motor repertoire, high-precision tracking of segments and joints are required. Four types of convolutional neural networks were investigated on a novel infant pose dataset, covering the large variation in 1 424 videos from a clinical international community. The precision level of the networks was evaluated as the deviation between the estimated keypoint positions and human expert annotations. The computational efficiency was also assessed to determine the feasibility of the neural networks in clinical practice. The study shows that the precision of the best performing infant motion tracker is similar to the inter-rater error of human experts, while still operating efficiently. In conclusion, the proposed tracking of infant movements can pave the way for early detection of motor disorders in children with perinatal brain injuries by quantifying infant movements from video recordings with human precision.

* This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

Via

Access Paper or Ask Questions

Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles

Jul 19, 2020

Tárik S. Salem, Helge Langseth, Heri Ramampiaro

Figure 1 for Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles

Figure 2 for Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles

Figure 3 for Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles

Figure 4 for Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles

Abstract:Prediction intervals are a machine- and human-interpretable way to represent predictive uncertainty in a regression analysis. In this paper, we present a method for generating prediction intervals along with point estimates from an ensemble of neural networks. We propose a multi-objective loss function fusing quality measures related to prediction intervals and point estimates, and a penalty function, which enforces semantic integrity of the results and stabilizes the training process of the neural networks. The ensembled prediction intervals are aggregated as a split normal mixture accounting for possible multimodality and asymmetricity of the posterior predictive distribution, and resulting in prediction intervals that capture aleatoric and epistemic uncertainty. Our results show that both our quality-driven loss function and our aggregation method contribute to well-calibrated prediction intervals and point estimates.

* Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI), PMLR volume 124, 2020

Via

Access Paper or Ask Questions

EfficientPose: Scalable single-person pose estimation

Apr 25, 2020

Daniel Groos, Heri Ramampiaro, Espen Ihlen

Figure 1 for EfficientPose: Scalable single-person pose estimation

Figure 2 for EfficientPose: Scalable single-person pose estimation

Figure 3 for EfficientPose: Scalable single-person pose estimation

Figure 4 for EfficientPose: Scalable single-person pose estimation

Abstract:Human pose estimation facilitates markerless movement analysis in sports, as well as in clinical applications. Still, state-of-the-art models for human pose estimation generally do not meet the requirements for real-life deployment. The main reason for this is that the more the field progresses, the more expensive the approaches become, with high computational demands. To cope with the challenges caused by this trend, we propose a convolutional neural network architecture that benefits from the recently proposed EfficientNets to deliver scalable single-person pose estimation. To this end, we introduce EfficientPose, which is a family of models harnessing an effective multi-scale feature extractor, computation efficient detection blocks utilizing mobile inverted bottleneck convolutions, and upscaling improving precision of pose configurations. EfficientPose enables real-world deployment on edge devices through 500K parameter model consuming less than one GFLOP. The results from our experiments, using the challenging MPII single-person benchmark, show that the proposed EfficientPose models substantially outperform the widely-used OpenPose model in terms of accuracy, while being at the same time up to 15 times smaller and 20 times more computationally efficient than its counterpart.

* Submitted to Applied Intelligence Journal (APIN)

Via

Access Paper or Ask Questions

Forecasting Intra-Hour Imbalances in Electric Power Systems

Feb 01, 2019

Tárik S. Salem, Karan Kathuria, Heri Ramampiaro, Helge Langseth

Figure 1 for Forecasting Intra-Hour Imbalances in Electric Power Systems

Figure 2 for Forecasting Intra-Hour Imbalances in Electric Power Systems

Figure 3 for Forecasting Intra-Hour Imbalances in Electric Power Systems

Abstract:Keeping the electricity production in balance with the actual demand is becoming a difficult and expensive task in spite of an involvement of experienced human operators. This is due to the increasing complexity of the electric power grid system with the intermittent renewable production as one of the contributors. A beforehand information about an occurring imbalance can help the transmission system operator to adjust the production plans, and thus ensure a high security of supply by reducing the use of costly balancing reserves, and consequently reduce undesirable fluctuations of the 50 Hz power system frequency. In this paper, we introduce the relatively new problem of an intra-hour imbalance forecasting for the transmission system operator (TSO). We focus on the use case of the Norwegian TSO, Statnett. We present a complementary imbalance forecasting tool that is able to support the TSO in determining the trend of future imbalances, and show the potential to proactively alleviate imbalances with a higher accuracy compared to the contemporary solution.

* Accepted at the Thirty-First Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-19)

Via

Access Paper or Ask Questions

Securing Tag-based recommender systems against profile injection attacks: A comparative study. (Extended Report)

Jan 24, 2019

Georgios K. Pitsilis, Heri Ramampiaro, Helge Langseth

Abstract:This work addresses the challenges related to attacks on collaborative tagging systems, which often comes in a form of malicious annotations or profile injection attacks. In particular, we study various countermeasures against two types of such attacks for social tagging systems, the Overload attack and the Piggyback attack. The countermeasure schemes studied here include baseline classifiers such as, Naive Bayes filter and Support Vector Machine, as well as a Deep Learning approach. Our evaluation performed over synthetic spam data generated from del.icio.us dataset, shows that in most cases, Deep Learning can outperform the classical solutions, providing high-level protection against threats.

* 20 pages, 5 figures, 4 tables, Extended report of paper presented at "Late Breaking Results" poster session, RecSys 2018, October 2-7, Vancouver, BC, Canada

Via

Access Paper or Ask Questions

Extracting News Events from Microblogs

Jun 20, 2018

Øystein Repp, Heri Ramampiaro

Figure 1 for Extracting News Events from Microblogs

Figure 2 for Extracting News Events from Microblogs

Figure 3 for Extracting News Events from Microblogs

Figure 4 for Extracting News Events from Microblogs

Abstract:Twitter stream has become a large source of information for many people, but the magnitude of tweets and the noisy nature of its content have made harvesting the knowledge from Twitter a challenging task for researchers for a long time. Aiming at overcoming some of the main challenges of extracting the hidden information from tweet streams, this work proposes a new approach for real-time detection of news events from the Twitter stream. We divide our approach into three steps. The first step is to use a neural network or deep learning to detect news-relevant tweets from the stream. The second step is to apply a novel streaming data clustering algorithm to the detected news tweets to form news events. The third and final step is to rank the detected events based on the size of the event clusters and growth speed of the tweet frequencies. We evaluate the proposed system on a large, publicly available corpus of annotated news events from Twitter. As part of the evaluation, we compare our approach with a related state-of-the-art solution. Overall, our experiments and user-based evaluation show that our approach on detecting current (real) news events delivers a state-of-the-art performance.

* In Journal of Statistics and Management Systems, 21(4), pp. 695-723. Taylor & Francis

Via

Access Paper or Ask Questions

Context-Aware Sequence-to-Sequence Models for Conversational Systems

May 22, 2018

Silje Christensen, Simen Johnsrud, Massimiliano Ruocco, Heri Ramampiaro

Figure 1 for Context-Aware Sequence-to-Sequence Models for Conversational Systems

Abstract:This work proposes a novel approach based on sequence-to-sequence (seq2seq) models for context-aware conversational systems. Exist- ing seq2seq models have been shown to be good for generating natural responses in a data-driven conversational system. However, they still lack mechanisms to incorporate previous conversation turns. We investigate RNN-based methods that efficiently integrate previous turns as a context for generating responses. Overall, our experimental results based on human judgment demonstrate the feasibility and effectiveness of the proposed approach.

Via

Access Paper or Ask Questions