Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

C. -C. Jay Kuo

TGHop: An Explainable, Efficient and Lightweight Method for Texture Generation

Jul 08, 2021
Xuejing Lei, Ganning Zhao, Kaitai Zhang, C. -C. Jay Kuo

Figure 1 for TGHop: An Explainable, Efficient and Lightweight Method for Texture Generation

Figure 2 for TGHop: An Explainable, Efficient and Lightweight Method for Texture Generation

Figure 3 for TGHop: An Explainable, Efficient and Lightweight Method for Texture Generation

Figure 4 for TGHop: An Explainable, Efficient and Lightweight Method for Texture Generation

An explainable, efficient and lightweight method for texture generation, called TGHop (an acronym of Texture Generation PixelHop), is proposed in this work. Although synthesis of visually pleasant texture can be achieved by deep neural networks, the associated models are large in size, difficult to explain in theory, and computationally expensive in training. In contrast, TGHop is small in its model size, mathematically transparent, efficient in training and inference, and able to generate high quality texture. Given an exemplary texture, TGHop first crops many sample patches out of it to form a collection of sample patches called the source. Then, it analyzes pixel statistics of samples from the source and obtains a sequence of fine-to-coarse subspaces for these patches by using the PixelHop++ framework. To generate texture patches with TGHop, we begin with the coarsest subspace, which is called the core, and attempt to generate samples in each subspace by following the distribution of real samples. Finally, texture patches are stitched to form texture images of a large size. It is demonstrated by experimental results that TGHop can generate texture images of superior quality with a small model size and at a fast speed.

* arXiv admin note: substantial text overlap with arXiv:2009.01376

Via

Access Paper or Ask Questions

E-PixelHop: An Enhanced PixelHop Method for Object Classification

Jul 07, 2021
Yijing Yang, Vasileios Magoulianitis, C. -C. Jay Kuo

Figure 1 for E-PixelHop: An Enhanced PixelHop Method for Object Classification

Figure 2 for E-PixelHop: An Enhanced PixelHop Method for Object Classification

Figure 3 for E-PixelHop: An Enhanced PixelHop Method for Object Classification

Figure 4 for E-PixelHop: An Enhanced PixelHop Method for Object Classification

Based on PixelHop and PixelHop++, which are recently developed using the successive subspace learning (SSL) framework, we propose an enhanced solution for object classification, called E-PixelHop, in this work. E-PixelHop consists of the following steps. First, to decouple the color channels for a color image, we apply principle component analysis and project RGB three color channels onto two principle subspaces which are processed separately for classification. Second, to address the importance of multi-scale features, we conduct pixel-level classification at each hop with various receptive fields. Third, to further improve pixel-level classification accuracy, we develop a supervised label smoothing (SLS) scheme to ensure prediction consistency. Forth, pixel-level decisions from each hop and from each color subspace are fused together for image-level decision. Fifth, to resolve confusing classes for further performance boosting, we formulate E-PixelHop as a two-stage pipeline. In the first stage, multi-class classification is performed to get a soft decision for each class, where the top 2 classes with the highest probabilities are called confusing classes. Then,we conduct a binary classification in the second stage. The main contributions lie in Steps 1, 3 and 5.We use the classification of the CIFAR-10 dataset as an example to demonstrate the effectiveness of the above-mentioned key components of E-PixelHop.

* 12 pages, 7 figures

Via

Access Paper or Ask Questions

AnomalyHop: An SSL-based Image Anomaly Localization Method

May 08, 2021
Kaitai Zhang, Bin Wang, Wei Wang, Fahad Sohrab, Moncef Gabbouj, C. -C. Jay Kuo

Figure 1 for AnomalyHop: An SSL-based Image Anomaly Localization Method

Figure 2 for AnomalyHop: An SSL-based Image Anomaly Localization Method

Figure 3 for AnomalyHop: An SSL-based Image Anomaly Localization Method

Figure 4 for AnomalyHop: An SSL-based Image Anomaly Localization Method

An image anomaly localization method based on the successive subspace learning (SSL) framework, called AnomalyHop, is proposed in this work. AnomalyHop consists of three modules: 1) feature extraction via successive subspace learning (SSL), 2) normality feature distributions modeling via Gaussian models, and 3) anomaly map generation and fusion. Comparing with state-of-the-art image anomaly localization methods based on deep neural networks (DNNs), AnomalyHop is mathematically transparent, easy to train, and fast in its inference speed. Besides, its area under the ROC curve (ROC-AUC) performance on the MVTec AD dataset is 95.9%, which is among the best of several benchmarking methods. Our codes are publicly available at Github.

* 5 pages, 3 figures

Via

Access Paper or Ask Questions

Dynamic Texture Synthesis by Incorporating Long-range Spatial and Temporal Correlations

Apr 14, 2021
Kaitai Zhang, Bin Wang, Hong-Shuo Chen, Ye Wang, Shiyu Mou, C. -C. Jay Kuo

Figure 1 for Dynamic Texture Synthesis by Incorporating Long-range Spatial and Temporal Correlations

Figure 2 for Dynamic Texture Synthesis by Incorporating Long-range Spatial and Temporal Correlations

Figure 3 for Dynamic Texture Synthesis by Incorporating Long-range Spatial and Temporal Correlations

Figure 4 for Dynamic Texture Synthesis by Incorporating Long-range Spatial and Temporal Correlations

The main challenge of dynamic texture synthesis lies in how to maintain spatial and temporal consistency in synthesized videos. The major drawback of existing dynamic texture synthesis models comes from poor treatment of the long-range texture correlation and motion information. To address this problem, we incorporate a new loss term, called the Shifted Gram loss, to capture the structural and long-range correlation of the reference texture video. Furthermore, we introduce a frame sampling strategy to exploit long-period motion across multiple frames. With these two new techniques, the application scope of existing texture synthesis models can be extended. That is, they can synthesize not only homogeneous but also structured dynamic texture patterns. Thorough experimental results are provided to demonstrate that our proposed dynamic texture synthesis model offers state-of-the-art visual performance.

* 7 pages, 6 figures

Via

Access Paper or Ask Questions

Dynamic Texture Synthesis By Incorporating Long-range Spatial and Temporal Correlations

Apr 13, 2021
Kaitai Zhang, Bin Wang, Hong-Shuo Chen, Ye Wang, Shiyu Mou, C. -C. Jay Kuo

* 7 pages, 6 figures

Via

Access Paper or Ask Questions

CalibDNN: Multimodal Sensor Calibration for Perception Using Deep Neural Networks

Mar 27, 2021
Ganning Zhao, Jiesi Hu, Suya You, C. -C. Jay Kuo

Current perception systems often carry multimodal imagers and sensors such as 2D cameras and 3D LiDAR sensors. To fuse and utilize the data for downstream perception tasks, robust and accurate calibration of the multimodal sensor data is essential. We propose a novel deep learning-driven technique (CalibDNN) for accurate calibration among multimodal sensor, specifically LiDAR-Camera pairs. The key innovation of the proposed work is that it does not require any specific calibration targets or hardware assistants, and the entire processing is fully automatic with a single model and single iteration. Results comparison among different methods and extensive experiments on different datasets demonstrates the state-of-the-art performance.

Via

Access Paper or Ask Questions

R-PointHop: A Green, Accurate and Unsupervised Point Cloud Registration Method

Mar 15, 2021
Pranav Kadam, Min Zhang, Shan Liu, C. -C. Jay Kuo

Figure 1 for R-PointHop: A Green, Accurate and Unsupervised Point Cloud Registration Method

Figure 2 for R-PointHop: A Green, Accurate and Unsupervised Point Cloud Registration Method

Figure 3 for R-PointHop: A Green, Accurate and Unsupervised Point Cloud Registration Method

Figure 4 for R-PointHop: A Green, Accurate and Unsupervised Point Cloud Registration Method

Inspired by the recent PointHop classification method, an unsupervised 3D point cloud registration method, called R-PointHop, is proposed in this work. R-PointHop first determines a local reference frame (LRF) for every point using its nearest neighbors and finds its local attributes. Next, R-PointHop obtains local-to-global hierarchical features by point downsampling, neighborhood expansion, attribute construction and dimensionality reduction steps. Thus, we can build the correspondence of points in the hierarchical feature space using the nearest neighbor rule. Afterwards, a subset of salient points of good correspondence is selected to estimate the 3D transformation. The use of LRF allows for hierarchical features of points to be invariant with respect to rotation and translation, thus making R-PointHop more robust in building point correspondence even when rotation angles are large. Experiments are conducted on the ModelNet40 and the Stanford Bunny dataset, which demonstrate the effectiveness of R-PointHop on the 3D point cloud registration task. R-PointHop is a green and accurate solution since its model size and training time are smaller than those of deep learning methods by an order of magnitude while its registration errors are smaller. Our codes are available on GitHub.

* 13 pages, 11 figures

Via

Access Paper or Ask Questions

DefakeHop: A Light-Weight High-Performance Deepfake Detector

Mar 11, 2021
Hong-Shuo Chen, Mozhdeh Rouhsedaghat, Hamza Ghani, Shuowen Hu, Suya You, C. -C. Jay Kuo

Figure 1 for DefakeHop: A Light-Weight High-Performance Deepfake Detector

Figure 2 for DefakeHop: A Light-Weight High-Performance Deepfake Detector

Figure 3 for DefakeHop: A Light-Weight High-Performance Deepfake Detector

Figure 4 for DefakeHop: A Light-Weight High-Performance Deepfake Detector

A light-weight high-performance Deepfake detection method, called DefakeHop, is proposed in this work. State-of-the-art Deepfake detection methods are built upon deep neural networks. DefakeHop extracts features automatically using the successive subspace learning (SSL) principle from various parts of face images. The features are extracted by c/w Saab transform and further processed by our feature distillation module using spatial dimension reduction and soft classification for each channel to get a more concise description of the face. Extensive experiments are conducted to demonstrate the effectiveness of the proposed DefakeHop method. With a small model size of 42,845 parameters, DefakeHop achieves state-of-the-art performance with the area under the ROC curve (AUC) of 100%, 94.95%, and 90.56% on UADFV, Celeb-DF v1 and Celeb-DF v2 datasets, respectively.

* Accepted at ICME 2021

Via

Access Paper or Ask Questions

Successive Subspace Learning: An Overview

Feb 27, 2021
Mozhdeh Rouhsedaghat, Masoud Monajatipoor, Zohreh Azizi, C. -C. Jay Kuo

Figure 1 for Successive Subspace Learning: An Overview

Successive Subspace Learning (SSL) offers a light-weight unsupervised feature learning method based on inherent statistical properties of data units (e.g. image pixels and points in point cloud sets). It has shown promising results, especially on small datasets. In this paper, we intuitively explain this method, provide an overview of its development, and point out some open questions and challenges for future research.

* 4 pages, 1 figure

Via

Access Paper or Ask Questions

A Machine Learning Approach to Optimal Inverse Discrete Cosine Transform (IDCT) Design

Jan 31, 2021
Yifan Wang, Zhanxuan Mei, Chia-Yang Tsai, Ioannis Katsavounidis, C. -C. Jay Kuo

Figure 1 for A Machine Learning Approach to Optimal Inverse Discrete Cosine Transform (IDCT) Design

Figure 2 for A Machine Learning Approach to Optimal Inverse Discrete Cosine Transform (IDCT) Design

Figure 3 for A Machine Learning Approach to Optimal Inverse Discrete Cosine Transform (IDCT) Design

Figure 4 for A Machine Learning Approach to Optimal Inverse Discrete Cosine Transform (IDCT) Design

The design of the optimal inverse discrete cosine transform (IDCT) to compensate the quantization error is proposed for effective lossy image compression in this work. The forward and inverse DCTs are designed in pair in current image/video coding standards without taking the quantization effect into account. Yet, the distribution of quantized DCT coefficients deviate from that of original DCT coefficients. This is particularly obvious when the quality factor of JPEG compressed images is small. To address this problem, we first use a set of training images to learn the compound effect of forward DCT, quantization and dequantization in cascade. Then, a new IDCT kernel is learned to reverse the effect of such a pipeline. Experiments are conducted to demonstrate that the advantage of the new method, which has a gain of 0.11-0.30dB over the standard JPEG over a wide range of quality factors.

* conference

Via

Access Paper or Ask Questions