Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alexandros G. Dimakis

Multiresolution Textual Inversion

Nov 30, 2022
Giannis Daras, Alexandros G. Dimakis

Figure 1 for Multiresolution Textual Inversion

Figure 2 for Multiresolution Textual Inversion

Figure 3 for Multiresolution Textual Inversion

Figure 4 for Multiresolution Textual Inversion

We extend Textual Inversion to learn pseudo-words that represent a concept at different resolutions. This allows us to generate images that use the concept with different levels of detail and also to manipulate different resolutions using language. Once learned, the user can generate images at different levels of agreement to the original concept; "A photo of $S^*(0)$" produces the exact object while the prompt "A photo of $S^*(0.8)$" only matches the rough outlines and colors. Our framework allows us to generate images that use different resolutions of an image (e.g. details, textures, styles) as separate pseudo-words that can be composed in various ways. We open-soure our code in the following URL: https://github.com/giannisdaras/multires_textual_inversion

* Accepted at NeurIPS 2022 Workshop on Score-Based Methods. 5 pages, 4 Figures, work in progress

Via

Access Paper or Ask Questions

Zonotope Domains for Lagrangian Neural Network Verification

Oct 14, 2022
Matt Jordan, Jonathan Hayase, Alexandros G. Dimakis, Sewoong Oh

Figure 1 for Zonotope Domains for Lagrangian Neural Network Verification

Figure 2 for Zonotope Domains for Lagrangian Neural Network Verification

Figure 3 for Zonotope Domains for Lagrangian Neural Network Verification

Figure 4 for Zonotope Domains for Lagrangian Neural Network Verification

Neural network verification aims to provide provable bounds for the output of a neural network for a given input range. Notable prior works in this domain have either generated bounds using abstract domains, which preserve some dependency between intermediate neurons in the network; or framed verification as an optimization problem and solved a relaxation using Lagrangian methods. A key drawback of the latter technique is that each neuron is treated independently, thereby ignoring important neuron interactions. We provide an approach that merges these two threads and uses zonotopes within a Lagrangian decomposition. Crucially, we can decompose the problem of verifying a deep neural network into the verification of many 2-layer neural networks. While each of these problems is provably hard, we provide efficient relaxation methods that are amenable to efficient dual ascent procedures. Our technique yields bounds that improve upon both linear programming and Lagrangian-based verification techniques in both time and bound tightness.

* Accepted into NeurIPS 2022. Code: https://github.com/revbucket/dual-verification

Via

Access Paper or Ask Questions

Soft Diffusion: Score Matching for General Corruptions

Sep 12, 2022
Giannis Daras, Mauricio Delbracio, Hossein Talebi, Alexandros G. Dimakis, Peyman Milanfar

Figure 1 for Soft Diffusion: Score Matching for General Corruptions

Figure 2 for Soft Diffusion: Score Matching for General Corruptions

Figure 3 for Soft Diffusion: Score Matching for General Corruptions

Figure 4 for Soft Diffusion: Score Matching for General Corruptions

We define a broader family of corruption processes that generalizes previously known diffusion models. To reverse these general diffusions, we propose a new objective called Soft Score Matching that provably learns the score function for any linear corruption process and yields state of the art results for CelebA. Soft Score Matching incorporates the degradation process in the network and trains the model to predict a clean image that after corruption matches the diffused observation. We show that our objective learns the gradient of the likelihood under suitable regularity conditions for the family of corruption processes. We further develop a principled way to select the corruption levels for general diffusion processes and a novel sampling method that we call Momentum Sampler. We evaluate our framework with the corruption being Gaussian Blur and low magnitude additive noise. Our method achieves state-of-the-art FID score $1.85$ on CelebA-64, outperforming all previous linear diffusion models. We also show significant computational benefits compared to vanilla denoising diffusion.

* 17 pages, 8 figures, work in progress

Via

Access Paper or Ask Questions

Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems

Jun 22, 2022
Giannis Daras, Yuval Dagan, Alexandros G. Dimakis, Constantinos Daskalakis

Figure 1 for Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems

Figure 2 for Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems

Figure 3 for Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems

Figure 4 for Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems

We prove fast mixing and characterize the stationary distribution of the Langevin Algorithm for inverting random weighted DNN generators. This result extends the work of Hand and Voroninski from efficient inversion to efficient posterior sampling. In practice, to allow for increased expressivity, we propose to do posterior sampling in the latent space of a pre-trained generative model. To achieve that, we train a score-based model in the latent space of a StyleGAN-2 and we use it to solve inverse problems. Our framework, Score-Guided Intermediate Layer Optimization (SGILO), extends prior work by replacing the sparsity regularization with a generative prior in the intermediate layer. Experimentally, we obtain significant improvements over the previous state-of-the-art, especially in the low measurement regime.

* Accepted to ICML 2022. 32 pages, 9 Figures

Via

Access Paper or Ask Questions

Discovering the Hidden Vocabulary of DALLE-2

Jun 01, 2022
Giannis Daras, Alexandros G. Dimakis

Figure 1 for Discovering the Hidden Vocabulary of DALLE-2

Figure 2 for Discovering the Hidden Vocabulary of DALLE-2

Figure 3 for Discovering the Hidden Vocabulary of DALLE-2

Figure 4 for Discovering the Hidden Vocabulary of DALLE-2

We discover that DALLE-2 seems to have a hidden vocabulary that can be used to generate images with absurd prompts. For example, it seems that \texttt{Apoploe vesrreaitais} means birds and \texttt{Contarra ccetnxniams luryca tanniounons} (sometimes) means bugs or pests. We find that these prompts are often consistent in isolation but also sometimes in combinations. We present our black-box method to discover words that seem random but have some correspondence to visual concepts. This creates important security and interpretability challenges.

* 6 pages, 4 figures

Via

Access Paper or Ask Questions

Deblurring via Stochastic Refinement

Dec 28, 2021
Jay Whang, Mauricio Delbracio, Hossein Talebi, Chitwan Saharia, Alexandros G. Dimakis, Peyman Milanfar

Figure 1 for Deblurring via Stochastic Refinement

Figure 2 for Deblurring via Stochastic Refinement

Figure 3 for Deblurring via Stochastic Refinement

Figure 4 for Deblurring via Stochastic Refinement

Image deblurring is an ill-posed problem with multiple plausible solutions for a given input image. However, most existing methods produce a deterministic estimate of the clean image and are trained to minimize pixel-level distortion. These metrics are known to be poorly correlated with human perception, and often lead to unrealistic reconstructions. We present an alternative framework for blind deblurring based on conditional diffusion models. Unlike existing techniques, we train a stochastic sampler that refines the output of a deterministic predictor and is capable of producing a diverse set of plausible reconstructions for a given input. This leads to a significant improvement in perceptual quality over existing state-of-the-art methods across multiple standard benchmarks. Our predict-and-refine approach also enables much more efficient sampling compared to typical diffusion models. Combined with a carefully tuned network architecture and inference procedure, our method is competitive in terms of distortion metrics such as PSNR. These results show clear benefits of our diffusion-based method for deblurring and challenge the widely used strategy of producing a single, deterministic reconstruction.

Via

Access Paper or Ask Questions

Solving Inverse Problems with NerfGANs

Dec 16, 2021
Giannis Daras, Wen-Sheng Chu, Abhishek Kumar, Dmitry Lagun, Alexandros G. Dimakis

Figure 1 for Solving Inverse Problems with NerfGANs

Figure 2 for Solving Inverse Problems with NerfGANs

Figure 3 for Solving Inverse Problems with NerfGANs

Figure 4 for Solving Inverse Problems with NerfGANs

We introduce a novel framework for solving inverse problems using NeRF-style generative models. We are interested in the problem of 3-D scene reconstruction given a single 2-D image and known camera parameters. We show that naively optimizing the latent space leads to artifacts and poor novel view rendering. We attribute this problem to volume obstructions that are clear in the 3-D geometry and become visible in the renderings of novel views. We propose a novel radiance field regularization method to obtain better 3-D surfaces and improved novel views given single view observations. Our method naturally extends to general inverse problems including inpainting where one observes only partially a single view. We experimentally evaluate our method, achieving visual improvements and performance boosts over the baselines in a wide range of tasks. Our method achieves $30-40\%$ MSE reduction and $15-25\%$ reduction in LPIPS loss compared to the previous state of the art.

* 16 pages, 18 figures

Via

Access Paper or Ask Questions

Inverse Problems Leveraging Pre-trained Contrastive Representations

Oct 26, 2021
Sriram Ravula, Georgios Smyrnis, Matt Jordan, Alexandros G. Dimakis

Figure 1 for Inverse Problems Leveraging Pre-trained Contrastive Representations

Figure 2 for Inverse Problems Leveraging Pre-trained Contrastive Representations

Figure 3 for Inverse Problems Leveraging Pre-trained Contrastive Representations

Figure 4 for Inverse Problems Leveraging Pre-trained Contrastive Representations

We study a new family of inverse problems for recovering representations of corrupted data. We assume access to a pre-trained representation learning network R(x) that operates on clean images, like CLIP. The problem is to recover the representation of an image R(x), if we are only given a corrupted version A(x), for some known forward operator A. We propose a supervised inversion method that uses a contrastive objective to obtain excellent representations for highly corrupted images. Using a linear probe on our robust representations, we achieve a higher accuracy than end-to-end supervised baselines when classifying images with various types of distortions, including blurring, additive noise, and random pixel masking. We evaluate on a subset of ImageNet and observe that our method is robust to varying levels of distortion. Our method outperforms end-to-end baselines even with a fraction of the labeled data in a wide range of forward operators.

* Initial version. Final version to appear in Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021)

Via

Access Paper or Ask Questions