Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhiyi Chen

Deep Koopman-based Control of Quality Variation in Multistage Manufacturing Systems

Jul 24, 2024

Zhiyi Chen, Harshal Maske, Devesh Upadhyay, Huanyi Shui, Xun Huan, Jun Ni

Abstract:This paper presents a modeling-control synthesis to address the quality control challenges in multistage manufacturing systems (MMSs). A new feedforward control scheme is developed to minimize the quality variations caused by process disturbances in MMSs. Notably, the control framework leverages a stochastic deep Koopman (SDK) model to capture the quality propagation mechanism in the MMSs, highlighted by its ability to transform the nonlinear propagation dynamics into a linear one. Two roll-to-roll case studies are presented to validate the proposed method and demonstrate its effectiveness. The overall method is suitable for nonlinear MMSs and does not require extensive expert knowledge.

* The paper was in the proceeding of 2024 American Control Conference. This submitted version addresses a minor correction to one equation (Eq. 14), while the results and conclusions remain the same

Via

Access Paper or Ask Questions

Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model

Apr 02, 2024

Xu He, Qiaochu Huang, Zhensong Zhang, Zhiwei Lin, Zhiyong Wu, Sicheng Yang, Minglei Li, Zhiyi Chen, Songcen Xu, Xiaofei Wu

Figure 1 for Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model

Figure 2 for Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model

Figure 3 for Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model

Figure 4 for Co-Speech Gesture Video Generation via Motion-Decoupled Diffusion Model

Abstract:Co-speech gestures, if presented in the lively form of videos, can achieve superior visual effects in human-machine interaction. While previous works mostly generate structural human skeletons, resulting in the omission of appearance information, we focus on the direct generation of audio-driven co-speech gesture videos in this work. There are two main challenges: 1) A suitable motion feature is needed to describe complex human movements with crucial appearance information. 2) Gestures and speech exhibit inherent dependencies and should be temporally aligned even of arbitrary length. To solve these problems, we present a novel motion-decoupled framework to generate co-speech gesture videos. Specifically, we first introduce a well-designed nonlinear TPS transformation to obtain latent motion features preserving essential appearance information. Then a transformer-based diffusion model is proposed to learn the temporal correlation between gestures and speech, and performs generation in the latent motion space, followed by an optimal motion selection module to produce long-term coherent and consistent gesture videos. For better visual perception, we further design a refinement network focusing on missing details of certain areas. Extensive experimental results show that our proposed framework significantly outperforms existing approaches in both motion and video-related evaluations. Our code, demos, and more resources are available at https://github.com/thuhcsi/S2G-MDDiffusion.

* 22 pages, 8 figures, CVPR 2024

Via

Access Paper or Ask Questions

Stochastic Deep Koopman Model for Quality Propagation Analysis in Multistage Manufacturing Systems

Sep 18, 2023

Zhiyi Chen, Harshal Maske, Huanyi Shui, Devesh Upadhyay, Michael Hopka, Joseph Cohen, Xingjian Lai, Xun Huan, Jun Ni

Figure 1 for Stochastic Deep Koopman Model for Quality Propagation Analysis in Multistage Manufacturing Systems

Figure 2 for Stochastic Deep Koopman Model for Quality Propagation Analysis in Multistage Manufacturing Systems

Figure 3 for Stochastic Deep Koopman Model for Quality Propagation Analysis in Multistage Manufacturing Systems

Figure 4 for Stochastic Deep Koopman Model for Quality Propagation Analysis in Multistage Manufacturing Systems

Abstract:The modeling of multistage manufacturing systems (MMSs) has attracted increased attention from both academia and industry. Recent advancements in deep learning methods provide an opportunity to accomplish this task with reduced cost and expertise. This study introduces a stochastic deep Koopman (SDK) framework to model the complex behavior of MMSs. Specifically, we present a novel application of Koopman operators to propagate critical quality information extracted by variational autoencoders. Through this framework, we can effectively capture the general nonlinear evolution of product quality using a transferred linear representation, thus enhancing the interpretability of the data-driven model. To evaluate the performance of the SDK framework, we carried out a comparative study on an open-source dataset. The main findings of this paper are as follows. Our results indicate that SDK surpasses other popular data-driven models in accuracy when predicting stagewise product quality within the MMS. Furthermore, the unique linear propagation property in the stochastic latent space of SDK enables traceability for quality evolution throughout the process, thereby facilitating the design of root cause analysis schemes. Notably, the proposed framework requires minimal knowledge of the underlying physics of production lines. It serves as a virtual metrology tool that can be applied to various MMSs, contributing to the ultimate goal of Zero Defect Manufacturing.

Via

Access Paper or Ask Questions

DiffPrep: Differentiable Data Preprocessing Pipeline Search for Learning over Tabular Data

Aug 20, 2023

Peng Li, Zhiyi Chen, Xu Chu, Kexin Rong

Abstract:Data preprocessing is a crucial step in the machine learning process that transforms raw data into a more usable format for downstream ML models. However, it can be costly and time-consuming, often requiring the expertise of domain experts. Existing automated machine learning (AutoML) frameworks claim to automate data preprocessing. However, they often use a restricted search space of data preprocessing pipelines which limits the potential performance gains, and they are often too slow as they require training the ML model multiple times. In this paper, we propose DiffPrep, a method that can automatically and efficiently search for a data preprocessing pipeline for a given tabular dataset and a differentiable ML model such that the performance of the ML model is maximized. We formalize the problem of data preprocessing pipeline search as a bi-level optimization problem. To solve this problem efficiently, we transform and relax the discrete, non-differential search space into a continuous and differentiable one, which allows us to perform the pipeline search using gradient descent with training the ML model only once. Our experiments show that DiffPrep achieves the best test accuracy on 15 out of the 18 real-world datasets evaluated and improves the model's test accuracy by up to 6.6 percentage points.

* ACM SIGMOD 2023
* Published at SIGMOD 2023

Via

Access Paper or Ask Questions

Principal Gradient Direction and Confidence Reservoir Sampling for Continual Learning

Aug 21, 2021

Zhiyi Chen, Tong Lin

Figure 1 for Principal Gradient Direction and Confidence Reservoir Sampling for Continual Learning

Figure 2 for Principal Gradient Direction and Confidence Reservoir Sampling for Continual Learning

Figure 3 for Principal Gradient Direction and Confidence Reservoir Sampling for Continual Learning

Figure 4 for Principal Gradient Direction and Confidence Reservoir Sampling for Continual Learning

Abstract:Task-free online continual learning aims to alleviate catastrophic forgetting of the learner on a non-iid data stream. Experience Replay (ER) is a SOTA continual learning method, which is broadly used as the backbone algorithm for other replay-based methods. However, the training strategy of ER is too simple to take full advantage of replayed examples and its reservoir sampling strategy is also suboptimal. In this work, we propose a general proximal gradient framework so that ER can be viewed as a special case. We further propose two improvements accordingly: Principal Gradient Direction (PGD) and Confidence Reservoir Sampling (CRS). In Principal Gradient Direction, we optimize a target gradient that not only represents the major contribution of past gradients, but also retains the new knowledge of the current gradient. We then present Confidence Reservoir Sampling for maintaining a more informative memory buffer based on a margin-based metric that measures the value of stored examples. Experiments substantiate the effectiveness of both our improvements and our new algorithm consistently boosts the performance of MIR-replay, a SOTA ER-based method: our algorithm increases the average accuracy up to 7.9% and reduces forgetting up to 15.4% on four datasets.

Via

Access Paper or Ask Questions

Contrastive Rendering for Ultrasound Image Segmentation

Oct 10, 2020

Haoming Li, Xin Yang, Jiamin Liang, Wenlong Shi, Chaoyu Chen, Haoran Dou, Rui Li, Rui Gao, Guangquan Zhou, Jinghui Fang(+5 more)

Figure 1 for Contrastive Rendering for Ultrasound Image Segmentation

Figure 2 for Contrastive Rendering for Ultrasound Image Segmentation

Figure 3 for Contrastive Rendering for Ultrasound Image Segmentation

Figure 4 for Contrastive Rendering for Ultrasound Image Segmentation

Abstract:Ultrasound (US) image segmentation embraced its significant improvement in deep learning era. However, the lack of sharp boundaries in US images still remains an inherent challenge for segmentation. Previous methods often resort to global context, multi-scale cues or auxiliary guidance to estimate the boundaries. It is hard for these methods to approach pixel-level learning for fine-grained boundary generating. In this paper, we propose a novel and effective framework to improve boundary estimation in US images. Our work has three highlights. First, we propose to formulate the boundary estimation as a rendering task, which can recognize ambiguous points (pixels/voxels) and calibrate the boundary prediction via enriched feature representation learning. Second, we introduce point-wise contrastive learning to enhance the similarity of points from the same class and contrastively decrease the similarity of points from different classes. Boundary ambiguities are therefore further addressed. Third, both rendering and contrastive learning tasks contribute to consistent improvement while reducing network parameters. As a proof-of-concept, we performed validation experiments on a challenging dataset of 86 ovarian US volumes. Results show that our proposed method outperforms state-of-the-art methods and has the potential to be used in clinical practice.

* 10 pages, 5 figures, 2 tables, 13 references

Via

Access Paper or Ask Questions

Synthesis and Edition of Ultrasound Images via Sketch Guided Progressive Growing GANs

Apr 01, 2020

Jiamin Liang, Xin Yang, Haoming Li, Yi Wang, Manh The Van, Haoran Dou, Chaoyu Chen, Jinghui Fang, Xiaowen Liang, Zixin Mai(+3 more)

Figure 1 for Synthesis and Edition of Ultrasound Images via Sketch Guided Progressive Growing GANs

Figure 2 for Synthesis and Edition of Ultrasound Images via Sketch Guided Progressive Growing GANs

Figure 3 for Synthesis and Edition of Ultrasound Images via Sketch Guided Progressive Growing GANs

Figure 4 for Synthesis and Edition of Ultrasound Images via Sketch Guided Progressive Growing GANs

Abstract:Ultrasound (US) is widely accepted in clinic for anatomical structure inspection. However, lacking in resources to practice US scan, novices often struggle to learn the operation skills. Also, in the deep learning era, automated US image analysis is limited by the lack of annotated samples. Efficiently synthesizing realistic, editable and high resolution US images can solve the problems. The task is challenging and previous methods can only partially complete it. In this paper, we devise a new framework for US image synthesis. Particularly, we firstly adopt a sketch generative adversarial networks (Sgan) to introduce background sketch upon object mask in a conditioned generative adversarial network. With enriched sketch cues, Sgan can generate realistic US images with editable and fine-grained structure details. Although effective, Sgan is hard to generate high resolution US images. To achieve this, we further implant the Sgan into a progressive growing scheme (PGSgan). By smoothly growing both generator and discriminator, PGSgan can gradually synthesize US images from low to high resolution. By synthesizing ovary and follicle US images, our extensive perceptual evaluation, user study and segmentation results prove the promising efficacy and efficiency of the proposed PGSgan.

* IEEE International Symposium on Biomedical Imaging (IEEE ISBI 2020)

Via

Access Paper or Ask Questions

Accelerating System Log Processing by Semi-supervised Learning: A Technical Report

Oct 29, 2018

Guofu Li, Pengjia Zhu, Zhiyi Chen

Figure 1 for Accelerating System Log Processing by Semi-supervised Learning: A Technical Report

Figure 2 for Accelerating System Log Processing by Semi-supervised Learning: A Technical Report

Figure 3 for Accelerating System Log Processing by Semi-supervised Learning: A Technical Report

Figure 4 for Accelerating System Log Processing by Semi-supervised Learning: A Technical Report

Abstract:There is an increasing need for more automated system-log analysis tools for large scale online system in a timely manner. However, conventional way to monitor and classify the log output based on keyword list does not scale well for complex system in which codes contributed by a large group of developers, with diverse ways of encoding the error messages, often with misleading pre-set labels. In this paper, we propose that the design of a large scale online log analysis should follow the "Least Prior Knowledge Principle", in which unsupervised or semi-supervised solution with the minimal prior knowledge of the log should be encoded directly. Thereby, we report our experience in designing a two-stage machine learning based method, in which the system logs are regarded as the output of a quasi-natural language, pre-filtered by a perplexity score threshold, and then undergo a fine-grained classification procedure. Tests on empirical data show that our method has obvious advantage regarding to the processing speed and classification accuracy.

Via

Access Paper or Ask Questions

Security Matters: A Survey on Adversarial Machine Learning

Oct 23, 2018

Guofu Li, Pengjia Zhu, Jin Li, Zhemin Yang, Ning Cao, Zhiyi Chen

Figure 1 for Security Matters: A Survey on Adversarial Machine Learning

Figure 2 for Security Matters: A Survey on Adversarial Machine Learning

Figure 3 for Security Matters: A Survey on Adversarial Machine Learning

Abstract:Adversarial machine learning is a fast growing research area, which considers the scenarios when machine learning systems may face potential adversarial attackers, who intentionally synthesize input data to make a well-trained model to make mistake. It always involves a defending side, usually a classifier, and an attacking side that aims to cause incorrect output. The earliest studies on the adversarial examples for machine learning algorithms start from the information security area, which considers a much wider varieties of attacking methods. But recent research focus that popularized by the deep learning community places strong emphasis on how the "imperceivable" perturbations on the normal inputs may cause dramatic mistakes by the deep learning with supposed super-human accuracy. This paper serves to give a comprehensive introduction to a range of aspects of the adversarial deep learning topic, including its foundations, typical attacking and defending strategies, and some extended studies.

Via

Access Paper or Ask Questions