Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Marianne Clausel

IECL

Identifiability in Causal Abstractions: A Hierarchy of Criteria

Jul 08, 2025

Clément Yvernes, Emilie Devijver, Marianne Clausel, Eric Gaussier

Abstract:Identifying the effect of a treatment from observational data typically requires assuming a fully specified causal diagram. However, such diagrams are rarely known in practice, especially in complex or high-dimensional settings. To overcome this limitation, recent works have explored the use of causal abstractions-simplified representations that retain partial causal information. In this paper, we consider causal abstractions formalized as collections of causal diagrams, and focus on the identifiability of causal queries within such collections. We introduce and formalize several identifiability criteria under this setting. Our main contribution is to organize these criteria into a structured hierarchy, highlighting their relationships. This hierarchical view enables a clearer understanding of what can be identified under varying levels of causal knowledge. We illustrate our framework through examples from the literature and provide tools to reason about identifiability when full causal knowledge is unavailable.

* Accepted at the CAR Workshop at UAI2025

Via

Access Paper or Ask Questions

ImmunoFOMO: Are Language Models missing what oncologists see?

Jun 13, 2025

Aman Sinha, Bogdan-Valentin Popescu, Xavier Coubez, Marianne Clausel, Mathieu Constant

Abstract:Language models (LMs) capabilities have grown with a fast pace over the past decade leading researchers in various disciplines, such as biomedical research, to increasingly explore the utility of LMs in their day-to-day applications. Domain specific language models have already been in use for biomedical natural language processing (NLP) applications. Recently however, the interest has grown towards medical language models and their understanding capabilities. In this paper, we investigate the medical conceptual grounding of various language models against expert clinicians for identification of hallmarks of immunotherapy in breast cancer abstracts. Our results show that pre-trained language models have potential to outperform large language models in identifying very specific (low-level) concepts.

Via

Access Paper or Ask Questions

Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification?

Jul 17, 2024

Aman Sinha, Timothee Mickus, Marianne Clausel, Mathieu Constant, Xavier Coubez

Figure 1 for Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification?

Figure 2 for Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification?

Figure 3 for Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification?

Figure 4 for Domain-specific or Uncertainty-aware models: Does it really make a difference for biomedical text classification?

Abstract:The success of pretrained language models (PLMs) across a spate of use-cases has led to significant investment from the NLP community towards building domain-specific foundational models. On the other hand, in mission critical settings such as biomedical applications, other aspects also factor in-chief of which is a model's ability to produce reasonable estimates of its own uncertainty. In the present study, we discuss these two desiderata through the lens of how they shape the entropy of a model's output probability distribution. We find that domain specificity and uncertainty awareness can often be successfully combined, but the exact task at hand weighs in much more strongly.

* BioNLP 2024

Via

Access Paper or Ask Questions

Ensembles of Probabilistic Regression Trees

Jun 20, 2024

Alexandre Seiller, Éric Gaussier, Emilie Devijver, Marianne Clausel, Sami Alkhoury

Figure 1 for Ensembles of Probabilistic Regression Trees

Figure 2 for Ensembles of Probabilistic Regression Trees

Figure 3 for Ensembles of Probabilistic Regression Trees

Figure 4 for Ensembles of Probabilistic Regression Trees

Abstract:Tree-based ensemble methods such as random forests, gradient-boosted trees, and Bayesianadditive regression trees have been successfully used for regression problems in many applicationsand research studies. In this paper, we study ensemble versions of probabilisticregression trees that provide smooth approximations of the objective function by assigningeach observation to each region with respect to a probability distribution. We prove thatthe ensemble versions of probabilistic regression trees considered are consistent, and experimentallystudy their bias-variance trade-off and compare them with the state-of-the-art interms of performance prediction.

Via

Access Paper or Ask Questions

Risk prediction of pathological gambling on social media

Mar 28, 2024

Angelina Parfenova, Marianne Clausel

Abstract:This paper addresses the problem of risk prediction on social media data, specifically focusing on the classification of Reddit users as having a pathological gambling disorder. To tackle this problem, this paper focuses on incorporating temporal and emotional features into the model. The preprocessing phase involves dealing with the time irregularity of posts by padding sequences. Two baseline architectures are used for preliminary evaluation: BERT classifier on concatenated posts per user and GRU with LSTM on sequential data. Experimental results demonstrate that the sequential models outperform the concatenation-based model. The results of the experiments conclude that the incorporation of a time decay layer (TD) and passing the emotion classification layer (EmoBERTa) through LSTM improves the performance significantly. Experiments concluded that the addition of a self-attention layer didn't significantly improve the performance of the model, however provided easily interpretable attention scores. The developed architecture with the inclusion of EmoBERTa and TD layers achieved a high F1 score, beating existing benchmarks on pathological gambling dataset. Future work may involve the early prediction of risk factors associated with pathological gambling disorder and testing models on other datasets. Overall, this research highlights the significance of the sequential processing of posts including temporal and emotional features to boost the predictive power, as well as adding an attention layer for interpretability.

Via

Access Paper or Ask Questions

Modelling Irregularly Sampled Time Series Without Imputation

Sep 15, 2023

Rohit Agarwal, Aman Sinha, Dilip K. Prasad, Marianne Clausel, Alexander Horsch, Mathieu Constant, Xavier Coubez

Figure 1 for Modelling Irregularly Sampled Time Series Without Imputation

Figure 2 for Modelling Irregularly Sampled Time Series Without Imputation

Figure 3 for Modelling Irregularly Sampled Time Series Without Imputation

Figure 4 for Modelling Irregularly Sampled Time Series Without Imputation

Abstract:Modelling irregularly-sampled time series (ISTS) is challenging because of missing values. Most existing methods focus on handling ISTS by converting irregularly sampled data into regularly sampled data via imputation. These models assume an underlying missing mechanism leading to unwanted bias and sub-optimal performance. We present SLAN (Switch LSTM Aggregate Network), which utilizes a pack of LSTMs to model ISTS without imputation, eliminating the assumption of any underlying process. It dynamically adapts its architecture on the fly based on the measured sensors. SLAN exploits the irregularity information to capture each sensor's local summary explicitly and maintains a global summary state throughout the observational period. We demonstrate the efficacy of SLAN on publicly available datasets, namely, MIMIC-III, Physionet 2012 and Physionet 2019. The code is available at https://github.com/Rohit102497/SLAN.

Via

Access Paper or Ask Questions

Polarimetric phase retrieval: uniqueness and algorithms

Jun 26, 2022

Julien Flamant, Konstantin Usevich, Marianne Clausel, David Brie

Figure 1 for Polarimetric phase retrieval: uniqueness and algorithms

Figure 2 for Polarimetric phase retrieval: uniqueness and algorithms

Figure 3 for Polarimetric phase retrieval: uniqueness and algorithms

Figure 4 for Polarimetric phase retrieval: uniqueness and algorithms

Abstract:This work introduces a novel Fourier phase retrieval model, called polarimetric phase retrieval that enables a systematic use of polarization information in Fourier phase retrieval problems. We provide a complete characterization of uniqueness properties of this new model by unraveling equivalencies with a peculiar polynomial factorization problem. We introduce two different but complementary categories of reconstruction methods. The first one is algebraic and relies on the use of approximate greatest common divisor computations using Sylvester matrices. The second one carefully adapts existing algorithms for Fourier phase retrieval, namely semidefinite positive relaxation and Wirtinger-Flow, to solve the polarimetric phase retrieval problem. Finally, a set of numerical experiments permits a detailed assessment of the numerical behavior and relative performances of each proposed reconstruction strategy. We further highlight a reconstruction strategy that combines both approaches for scalable, computationally efficient and asymptotically MSE optimal performance.

* 37 pages, 10 figures

Via

Access Paper or Ask Questions

Wind power predictions from nowcasts to 4-hour forecasts: a learning approach with variable selection

Apr 20, 2022

Dimitri Bouche, Rémi Flamary, Florence d'Alché-Buc, Riwal Plougonven, Marianne Clausel, Jordi Badosa, Philippe Drobinski

Figure 1 for Wind power predictions from nowcasts to 4-hour forecasts: a learning approach with variable selection

Figure 2 for Wind power predictions from nowcasts to 4-hour forecasts: a learning approach with variable selection

Figure 3 for Wind power predictions from nowcasts to 4-hour forecasts: a learning approach with variable selection

Figure 4 for Wind power predictions from nowcasts to 4-hour forecasts: a learning approach with variable selection

Abstract:We study the prediction of short term wind speed and wind power (every 10 minutes up to 4 hours ahead). Accurate forecasts for those quantities are crucial to mitigate the negative effects of wind farms' intermittent production on energy systems and markets. For those time scales, outputs of numerical weather prediction models are usually overlooked even though they should provide valuable information on higher scales dynamics. In this work, we combine those outputs with local observations using machine learning. So as to make the results usable for practitioners, we focus on simple and well known methods which can handle a high volume of data. We study first variable selection through two simple techniques, a linear one and a nonlinear one. Then we exploit those results to forecast wind speed and wind power still with an emphasis on linear models versus nonlinear ones. For the wind power prediction, we also compare the indirect approach (wind speed predictions passed through a power curve) and the indirect one (directly predict wind power).

Via

Access Paper or Ask Questions

Learning over No-Preferred and Preferred Sequence of Items for Robust Recommendation (Extended Abstract)

Feb 26, 2022

Aleksandra Burashnikova, Yury Maximov, Marianne Clausel, Charlotte Laclau, Franck Iutzeler, Massih-Reza Amini

Figure 1 for Learning over No-Preferred and Preferred Sequence of Items for Robust Recommendation (Extended Abstract)

Figure 2 for Learning over No-Preferred and Preferred Sequence of Items for Robust Recommendation (Extended Abstract)

Figure 3 for Learning over No-Preferred and Preferred Sequence of Items for Robust Recommendation (Extended Abstract)

Figure 4 for Learning over No-Preferred and Preferred Sequence of Items for Robust Recommendation (Extended Abstract)

Abstract:This paper is an extended version of [Burashnikova et al., 2021, arXiv: 2012.06910], where we proposed a theoretically supported sequential strategy for training a large-scale Recommender System (RS) over implicit feedback, mainly in the form of clicks. The proposed approach consists in minimizing pairwise ranking loss over blocks of consecutive items constituted by a sequence of non-clicked items followed by a clicked one for each user. We present two variants of this strategy where model parameters are updated using either the momentum method or a gradient-based approach. To prevent updating the parameters for an abnormally high number of clicks over some targeted items (mainly due to bots), we introduce an upper and a lower threshold on the number of updates for each user. These thresholds are estimated over the distribution of the number of blocks in the training set. They affect the decision of RS by shifting the distribution of items that are shown to the users. Furthermore, we provide a convergence analysis of both algorithms and demonstrate their practical efficiency over six large-scale collections with respect to various ranking measures.

* 7 pages, 2 tables; extended abstract accepted to IJCAI 2022. arXiv admin note: substantial text overlap with arXiv:2012.06910, arXiv:1902.08495

Via

Access Paper or Ask Questions

Recommender systems: when memory matters

Dec 04, 2021

Aleksandra Burashnikova, Marianne Clausel, Massih-Reza Amini, Yury Maximov, Nicolas Dante

Figure 1 for Recommender systems: when memory matters

Figure 2 for Recommender systems: when memory matters

Figure 3 for Recommender systems: when memory matters

Figure 4 for Recommender systems: when memory matters

Abstract:In this paper, we study the effect of long memory in the learnability of a sequential recommender system including users' implicit feedback. We propose an online algorithm, where model parameters are updated user per user over blocks of items constituted by a sequence of unclicked items followed by a clicked one. We illustrate through thorough empirical evaluations that filtering users with respect to the degree of long memory contained in their interactions with the system allows to substantially gain in performance with respect to MAP and NDCG, especially in the context of training large-scale Recommender Systems.

* Accepted to the 44-th European Conference on Information Retrieval (ECIR), 2022. arXiv admin note: text overlap with arXiv:2012.06910

Via

Access Paper or Ask Questions