Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Raja Giryes

School of Electrical Engineering, Tel Aviv University, Tel Aviv, Israel

ProCST: Boosting Semantic Segmentation using Progressive Cyclic Style-Transfer

Apr 25, 2022

Shahaf Ettedgui, Shady Abu-Hussein, Raja Giryes

Figure 1 for ProCST: Boosting Semantic Segmentation using Progressive Cyclic Style-Transfer

Figure 2 for ProCST: Boosting Semantic Segmentation using Progressive Cyclic Style-Transfer

Figure 3 for ProCST: Boosting Semantic Segmentation using Progressive Cyclic Style-Transfer

Figure 4 for ProCST: Boosting Semantic Segmentation using Progressive Cyclic Style-Transfer

Abstract:Using synthetic data for training neural networks that achieve good performance on real-world data is an important task as it has the potential to reduce the need for costly data annotation. Yet, a network that is trained on synthetic data alone does not perform well on real data due to the domain gap between the two. Reducing this gap, also known as domain adaptation, has been widely studied in recent years. In the unsupervised domain adaptation (UDA) framework, unlabeled real data is used during training with labeled synthetic data to obtain a neural network that performs well on real data. In this work, we focus on image data. For the semantic segmentation task, it has been shown that performing image-to-image translation from source to target, and then training a network for segmentation on source annotations - leads to poor results. Therefore a joint training of both is essential, which has been a common practice in many techniques. Yet, closing the large domain gap between the source and the target by directly performing the adaptation between the two is challenging. In this work, we propose a novel two-stage framework for improving domain adaptation techniques. In the first step, we progressively train a multi-scale neural network to perform an initial transfer between the source data to the target data. We denote the new transformed data as "Source in Target" (SiT). Then, we use the generated SiT data as the input to any standard UDA approach. This new data has a reduced domain gap from the desired target domain, and the applied UDA approach further closes the gap. We demonstrate the improvement achieved by our framework with two state-of-the-art methods for semantic segmentation, DAFormer and ProDA, on two UDA tasks, GTA5 to Cityscapes and Synthia to Cityscapes. Code and state-of-the-art checkpoints of ProCST+DAFormer are provided.

* Code available at https://github.com/shahaf1313/ProCST

Via

Access Paper or Ask Questions

Stress-Testing LiDAR Registration

Apr 16, 2022

Amnon Drory, Shai Avidan, Raja Giryes

Figure 1 for Stress-Testing LiDAR Registration

Figure 2 for Stress-Testing LiDAR Registration

Figure 3 for Stress-Testing LiDAR Registration

Figure 4 for Stress-Testing LiDAR Registration

Abstract:Point cloud registration (PCR) is an important task in many fields including autonomous driving with LiDAR sensors. PCR algorithms have improved significantly in recent years, by combining deep-learned features with robust estimation methods. These algorithms succeed in scenarios such as indoor scenes and object models registration. However, testing in the automotive LiDAR setting, which presents its own challenges, has been limited. The standard benchmark for this setting, KITTI-10m, has essentially been saturated by recent algorithms: many of them achieve near-perfect recall. In this work, we stress-test recent PCR techniques with LiDAR data. We propose a method for selecting balanced registration sets, which are challenging sets of frame-pairs from LiDAR datasets. They contain a balanced representation of the different relative motions that appear in a dataset, i.e. small and large rotations, small and large offsets in space and time, and various combinations of these. We perform a thorough comparison of accuracy and run-time on these benchmarks. Perhaps unexpectedly, we find that the fastest and simultaneously most accurate approach is a version of advanced RANSAC. We further improve results with a novel pre-filtering method.

Via

Access Paper or Ask Questions

Denoiser-based projections for 2-D super-resolution multi-reference alignment

Apr 10, 2022

Jonathan Shani, Raja Giryes, Tamir Bendory

Figure 1 for Denoiser-based projections for 2-D super-resolution multi-reference alignment

Figure 2 for Denoiser-based projections for 2-D super-resolution multi-reference alignment

Figure 3 for Denoiser-based projections for 2-D super-resolution multi-reference alignment

Figure 4 for Denoiser-based projections for 2-D super-resolution multi-reference alignment

Abstract:We study the 2-D super-resolution multi-reference alignment (SR-MRA) problem: estimating an image from its down-sampled, circularly-translated, and noisy copies. The SR-MRA problem serves as a mathematical abstraction of the structure determination problem for biological molecules. Since the SR-MRA problem is ill-posed without prior knowledge, accurate image estimation relies on designing priors that well-describe the statistics of the images of interest. In this work, we build on recent advances in image processing, and harness the power of denoisers as priors of images. In particular, we suggest to use denoisers as projections, and design two computational frameworks to estimate the image: projected expectation-maximization and projected method of moments. We provide an efficient GPU implementation, and demonstrate the effectiveness of these algorithms by extensive numerical experiments on a wide range of parameters and images.

Via

Access Paper or Ask Questions

Shallow Transits -- Deep Learning II: Identify Individual Exoplanetary Transits in Red Noise using Deep Learning

Mar 15, 2022

Elad Dvash, Yam Peleg, Shay Zucker, Raja Giryes

Figure 1 for Shallow Transits -- Deep Learning II: Identify Individual Exoplanetary Transits in Red Noise using Deep Learning

Figure 2 for Shallow Transits -- Deep Learning II: Identify Individual Exoplanetary Transits in Red Noise using Deep Learning

Figure 3 for Shallow Transits -- Deep Learning II: Identify Individual Exoplanetary Transits in Red Noise using Deep Learning

Figure 4 for Shallow Transits -- Deep Learning II: Identify Individual Exoplanetary Transits in Red Noise using Deep Learning

Abstract:In a previous paper, we have introduced a deep learning neural network that should be able to detect the existence of very shallow periodic planetary transits in the presence of red noise. The network in that feasibility study would not provide any further details about the detected transits. The current paper completes this missing part. We present a neural network that tags samples that were obtained during transits. This is essentially similar to the task of identifying the semantic context of each pixel in an image -- an important task in computer vision, called `semantic segmentation', which is often performed by deep neural networks. The neural network we present makes use of novel deep learning concepts such as U-Nets, Generative Adversarial Networks (GAN), and adversarial loss. The resulting segmentation should allow further studies of the light curves which are tagged as containing transits. This approach towards the detection and study of very shallow transits is bound to play a significant role in future space-based transit surveys such as PLATO, which are specifically aimed to detect those extremely difficult cases of long-period shallow transits. Our segmentation network also adds to the growing toolbox of deep learning approaches which are being increasingly used in the study of exoplanets, but so far mainly for vetting transits, rather than their initial detection.

* 16 pages, 14 figures, accepted for publication in the Astronomical Journal

Via

Access Paper or Ask Questions

Generative Adversarial Networks

Mar 01, 2022

Gilad Cohen, Raja Giryes

Figure 1 for Generative Adversarial Networks

Figure 2 for Generative Adversarial Networks

Figure 3 for Generative Adversarial Networks

Figure 4 for Generative Adversarial Networks

Abstract:Generative Adversarial Networks (GANs) are very popular frameworks for generating high-quality data, and are immensely used in both the academia and industry in many domains. Arguably, their most substantial impact has been in the area of computer vision, where they achieve state-of-the-art image generation. This chapter gives an introduction to GANs, by discussing their principle mechanism and presenting some of their inherent problems during training and evaluation. We focus on these three issues: (1) mode collapse, (2) vanishing gradients, and (3) generation of low-quality images. We then list some architecture-variant and loss-variant GANs that remedy the above challenges. Lastly, we present two utilization examples of GANs for real-world applications: Data augmentation and face images generation.

Via

Access Paper or Ask Questions

SPAGHETTI: Editing Implicit Shapes Through Part Aware Generation

Jan 31, 2022

Amir Hertz, Or Perel, Raja Giryes, Olga Sorkine-Hornung, Daniel Cohen-Or

Figure 1 for SPAGHETTI: Editing Implicit Shapes Through Part Aware Generation

Figure 2 for SPAGHETTI: Editing Implicit Shapes Through Part Aware Generation

Figure 3 for SPAGHETTI: Editing Implicit Shapes Through Part Aware Generation

Figure 4 for SPAGHETTI: Editing Implicit Shapes Through Part Aware Generation

Abstract:Neural implicit fields are quickly emerging as an attractive representation for learning based techniques. However, adopting them for 3D shape modeling and editing is challenging. We introduce a method for $\mathbf{E}$diting $\mathbf{I}$mplicit $\mathbf{S}$hapes $\mathbf{T}$hrough $\mathbf{P}$art $\mathbf{A}$ware $\mathbf{G}$enera$\mathbf{T}$ion, permuted in short as SPAGHETTI. Our architecture allows for manipulation of implicit shapes by means of transforming, interpolating and combining shape segments together, without requiring explicit part supervision. SPAGHETTI disentangles shape part representation into extrinsic and intrinsic geometric information. This characteristic enables a generative framework with part-level control. The modeling capabilities of SPAGHETTI are demonstrated using an interactive graphical interface, where users can directly edit neural implicit shapes.

Via

Access Paper or Ask Questions

Extending the Vocabulary of Fictional Languages using Neural Networks

Jan 18, 2022

Thomas Zacharias, Ashutosh Taklikar, Raja Giryes

Abstract:Fictional languages have become increasingly popular over the recent years appearing in novels, movies, TV shows, comics, and video games. While some of these fictional languages have a complete vocabulary, most do not. We propose a deep learning solution to the problem. Using style transfer and machine translation tools, we generate new words for a given target fictional language, while maintaining the style of its creator, hence extending this language vocabulary.

* 10 pages, 1 figure, NeurIPS Workshop on Machine Learning for Creativity and Design 2021

Via

Access Paper or Ask Questions

DeepMLS: Geometry-Aware Control Point Deformation

Jan 05, 2022

Meitar Shechter, Rana Hanocka, Gal Metzer, Raja Giryes, Daniel Cohen-Or

Figure 1 for DeepMLS: Geometry-Aware Control Point Deformation

Figure 2 for DeepMLS: Geometry-Aware Control Point Deformation

Figure 3 for DeepMLS: Geometry-Aware Control Point Deformation

Figure 4 for DeepMLS: Geometry-Aware Control Point Deformation

Abstract:We introduce DeepMLS, a space-based deformation technique, guided by a set of displaced control points. We leverage the power of neural networks to inject the underlying shape geometry into the deformation parameters. The goal of our technique is to enable a realistic and intuitive shape deformation. Our method is built upon moving least-squares (MLS), since it minimizes a weighted sum of the given control point displacements. Traditionally, the influence of each control point on every point in space (i.e., the weighting function) is defined using inverse distance heuristics. In this work, we opt to learn the weighting function, by training a neural network on the control points from a single input shape, and exploit the innate smoothness of neural networks. Our geometry-aware control point deformation is agnostic to the surface representation and quality; it can be applied to point clouds or meshes, including non-manifold and disconnected surface soups. We show that our technique facilitates intuitive piecewise smooth deformations, which are well suited for manufactured objects. We show the advantages of our approach compared to existing surface and space-based deformation techniques, both quantitatively and qualitatively.

Via

Access Paper or Ask Questions

Video Reconstruction from a Single Motion Blurred Image using Learned Dynamic Phase Coding

Dec 28, 2021

Erez Yosef, Shay Elmalem, Raja Giryes

Figure 1 for Video Reconstruction from a Single Motion Blurred Image using Learned Dynamic Phase Coding

Figure 2 for Video Reconstruction from a Single Motion Blurred Image using Learned Dynamic Phase Coding

Figure 3 for Video Reconstruction from a Single Motion Blurred Image using Learned Dynamic Phase Coding

Figure 4 for Video Reconstruction from a Single Motion Blurred Image using Learned Dynamic Phase Coding

Abstract:Video reconstruction from a single motion-blurred image is a challenging problem, which can enhance existing cameras' capabilities. Recently, several works addressed this task using conventional imaging and deep learning. Yet, such purely-digital methods are inherently limited, due to direction ambiguity and noise sensitivity. Some works proposed to address these limitations using non-conventional image sensors, however, such sensors are extremely rare and expensive. To circumvent these limitations with simpler means, we propose a hybrid optical-digital method for video reconstruction that requires only simple modifications to existing optical systems. We use a learned dynamic phase-coding in the lens aperture during the image acquisition to encode the motion trajectories, which serve as prior information for the video reconstruction process. The proposed computational camera generates a sharp frame burst of the scene at various frame rates from a single coded motion-blurred image, using an image-to-video convolutional neural network. We present advantages and improved performance compared to existing methods, using both simulations and a real-world camera prototype.

Via

Access Paper or Ask Questions

Unsupervised Domain Generalization by Learning a Bridge Across Domains

Dec 04, 2021

Sivan Harary, Eli Schwartz, Assaf Arbelle, Peter Staar, Shady Abu-Hussein, Elad Amrani, Roei Herzig, Amit Alfassy, Raja Giryes, Hilde Kuehne(+4 more)

Figure 1 for Unsupervised Domain Generalization by Learning a Bridge Across Domains

Figure 2 for Unsupervised Domain Generalization by Learning a Bridge Across Domains

Figure 3 for Unsupervised Domain Generalization by Learning a Bridge Across Domains

Figure 4 for Unsupervised Domain Generalization by Learning a Bridge Across Domains

Abstract:The ability to generalize learned representations across significantly different visual domains, such as between real photos, clipart, paintings, and sketches, is a fundamental capacity of the human visual system. In this paper, different from most cross-domain works that utilize some (or full) source domain supervision, we approach a relatively new and very practical Unsupervised Domain Generalization (UDG) setup of having no training supervision in neither source nor target domains. Our approach is based on self-supervised learning of a Bridge Across Domains (BrAD) - an auxiliary bridge domain accompanied by a set of semantics preserving visual (image-to-image) mappings to BrAD from each of the training domains. The BrAD and mappings to it are learned jointly (end-to-end) with a contrastive self-supervised representation model that semantically aligns each of the domains to its BrAD-projection, and hence implicitly drives all the domains (seen or unseen) to semantically align to each other. In this work, we show how using an edge-regularized BrAD our approach achieves significant gains across multiple benchmarks and a range of tasks, including UDG, Few-shot UDA, and unsupervised generalization across multi-domain datasets (including generalization to unseen domains and classes).

Via

Access Paper or Ask Questions