Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Kevin Michalewicz

Protein generation with embedding learning for motif diversification

Oct 21, 2025

Kevin Michalewicz, Chen Jin, Philip Alexander Teare, Tom Diethe, Mauricio Barahona, Barbara Bravi, Asher Mullokandov

Abstract:A fundamental challenge in protein design is the trade-off between generating structural diversity while preserving motif biological function. Current state-of-the-art methods, such as partial diffusion in RFdiffusion, often fail to resolve this trade-off: small perturbations yield motifs nearly identical to the native structure, whereas larger perturbations violate the geometric constraints necessary for biological function. We introduce Protein Generation with Embedding Learning (PGEL), a general framework that learns high-dimensional embeddings encoding sequence and structural features of a target motif in the representation space of a diffusion model's frozen denoiser, and then enhances motif diversity by introducing controlled perturbations in the embedding space. PGEL is thus able to loosen geometric constraints while satisfying typical design metrics, leading to more diverse yet viable structures. We demonstrate PGEL on three representative cases: a monomer, a protein-protein interface, and a cancer-related transcription factor complex. In all cases, PGEL achieves greater structural diversity, better designability, and improved self-consistency, as compared to partial diffusion. Our results establish PGEL as a general strategy for embedding-driven protein generation allowing for systematic, viable diversification of functional motifs.

Via

Access Paper or Ask Questions

Lifted Coefficient of Determination: Fast model-free prediction intervals and likelihood-free model comparison

Oct 11, 2024

Daniel Salnikov, Kevin Michalewicz, Dan Leonte

Figure 1 for Lifted Coefficient of Determination: Fast model-free prediction intervals and likelihood-free model comparison

Figure 2 for Lifted Coefficient of Determination: Fast model-free prediction intervals and likelihood-free model comparison

Figure 3 for Lifted Coefficient of Determination: Fast model-free prediction intervals and likelihood-free model comparison

Figure 4 for Lifted Coefficient of Determination: Fast model-free prediction intervals and likelihood-free model comparison

Abstract:We propose the $\textit{lifted linear model}$, and derive model-free prediction intervals that become tighter as the correlation between predictions and observations increases. These intervals motivate the $\textit{Lifted Coefficient of Determination}$, a model comparison criterion for arbitrary loss functions in prediction-based settings, e.g., regression, classification or counts. We extend the prediction intervals to more general error distributions, and propose a fast model-free outlier detection algorithm for regression. Finally, we illustrate the framework via numerical experiments.

* 14 pages, 5 figures

Via

Access Paper or Ask Questions

Deep Learning-based galaxy image deconvolution

Nov 17, 2022

Utsav Akhaury, Jean-Luc Starck, Pascale Jablonka, Frédéric Courbin, Kevin Michalewicz

Figure 1 for Deep Learning-based galaxy image deconvolution

Figure 2 for Deep Learning-based galaxy image deconvolution

Figure 3 for Deep Learning-based galaxy image deconvolution

Figure 4 for Deep Learning-based galaxy image deconvolution

Abstract:With the onset of large-scale astronomical surveys capturing millions of images, there is an increasing need to develop fast and accurate deconvolution algorithms that generalize well to different images. A powerful and accessible deconvolution method would allow for the reconstruction of a cleaner estimation of the sky. The deconvolved images would be helpful to perform photometric measurements to help make progress in the fields of galaxy formation and evolution. We propose a new deconvolution method based on the Learnlet transform. Eventually, we investigate and compare the performance of different Unet architectures and Learnlet for image deconvolution in the astrophysical domain by following a two-step approach: a Tikhonov deconvolution with a closed-form solution, followed by post-processing with a neural network. To generate our training dataset, we extract HST cutouts from the CANDELS survey in the F606W filter (V-band) and corrupt these images to simulate their blurred-noisy versions. Our numerical results based on these simulations show a detailed comparison between the considered methods for different noise levels.

* 15 pages, 5 figures

Via

Access Paper or Ask Questions