Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yichen Zhang

Central South University

Mix-up Self-Supervised Learning for Contrast-agnostic Applications

Apr 02, 2022

Yichen Zhang, Yifang Yin, Ying Zhang, Roger Zimmermann

Figure 1 for Mix-up Self-Supervised Learning for Contrast-agnostic Applications

Figure 2 for Mix-up Self-Supervised Learning for Contrast-agnostic Applications

Figure 3 for Mix-up Self-Supervised Learning for Contrast-agnostic Applications

Figure 4 for Mix-up Self-Supervised Learning for Contrast-agnostic Applications

Abstract:Contrastive self-supervised learning has attracted significant research attention recently. It learns effective visual representations from unlabeled data by embedding augmented views of the same image close to each other while pushing away embeddings of different images. Despite its great success on ImageNet classification, COCO object detection, etc., its performance degrades on contrast-agnostic applications, e.g., medical image classification, where all images are visually similar to each other. This creates difficulties in optimizing the embedding space as the distance between images is rather small. To solve this issue, we present the first mix-up self-supervised learning framework for contrast-agnostic applications. We address the low variance across images based on cross-domain mix-up and build the pretext task based on two synergistic objectives: image reconstruction and transparency prediction. Experimental results on two benchmark datasets validate the effectiveness of our method, where an improvement of 2.5% ~ 7.4% in top-1 accuracy was obtained compared to existing self-supervised learning methods.

* Accepted by ICME 2021

Via

Access Paper or Ask Questions

Neither Fast Nor Slow: How to Fly Through Narrow Tunnels

Jan 10, 2022

Luqi Wang, Hao Xu, Yichen Zhang, Shaojie Shen

Figure 1 for Neither Fast Nor Slow: How to Fly Through Narrow Tunnels

Figure 2 for Neither Fast Nor Slow: How to Fly Through Narrow Tunnels

Figure 3 for Neither Fast Nor Slow: How to Fly Through Narrow Tunnels

Figure 4 for Neither Fast Nor Slow: How to Fly Through Narrow Tunnels

Abstract:Nowadays, multirotors are playing important roles in abundant types of missions. During these missions, entering confined and narrow tunnels that are barely accessible to humans is desirable yet extremely challenging for multirotors. The restricted space and significant ego airflow disturbances induce control issues at both fast and slow flight speeds, meanwhile bringing about problems in state estimation and perception. Thus, a smooth trajectory at a proper speed is necessary for safe tunnel flights. To address these challenges, in this letter, a complete autonomous aerial system that can fly smoothly through tunnels with dimensions narrow to 0.6 m is presented. The system contains a motion planner that generates smooth mini-jerk trajectories along the tunnel center lines, which are extracted according to the map and Euclidean Distance Field (EDF), and its practical speed range is obtained through computational fluid dynamics (CFD) and flight data analyses. Extensive flight experiments on the quadrotor are conducted inside multiple narrow tunnels to validate the planning framework as well as the robustness of the whole system.

* 8 pages, 9 figures, submitted to IEEE RA-L

Via

Access Paper or Ask Questions

Unsupervised data augmentation for object detection

Apr 30, 2021

Yichen Zhang, Zeyang Song, Wenbo Li

Figure 1 for Unsupervised data augmentation for object detection

Figure 2 for Unsupervised data augmentation for object detection

Figure 3 for Unsupervised data augmentation for object detection

Figure 4 for Unsupervised data augmentation for object detection

Abstract:Data augmentation has always been an effective way to overcome overfitting issue when the dataset is small. There are already lots of augmentation operations such as horizontal flip, random crop or even Mixup. However, unlike image classification task, we cannot simply perform these operations for object detection task because of the lack of labeled bounding boxes information for corresponding generated images. To address this challenge, we propose a framework making use of Generative Adversarial Networks(GAN) to perform unsupervised data augmentation. To be specific, based on the recently supreme performance of YOLOv4, we propose a two-step pipeline that enables us to generate an image where the object lies in a certain position. In this way, we can accomplish the goal that generating an image with bounding box label.

Via

Access Paper or Ask Questions

Omni-swarm: A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swarm

Apr 04, 2021

Hao Xu, Yichen Zhang, Boyu Zhou, Luqi Wang, Xinjie Yao, Guotao Meng, Shaojie Shen

Figure 1 for Omni-swarm: A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swarm

Figure 2 for Omni-swarm: A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swarm

Figure 3 for Omni-swarm: A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swarm

Figure 4 for Omni-swarm: A Decentralized Omnidirectional Visual-Inertial-UWB State Estimation System for Aerial Swarm

Abstract:The decentralized state estimation is one of the most fundamental components for autonomous aerial swarm systems in GPS-denied areas, which still remains a highly challenging research topic. To address this research niche, the Omni-swarm, a decentralized omnidirectional visual-inertial-UWB state estimation system for the aerial swarm is proposed in this paper. In order to solve the issues of observability, complicated initialization, insufficient accuracy and lack of global consistency, we introduce an omnidirectional perception system as the front-end of the Omni-swarm, consisting of omnidirectional sensors, which includes stereo fisheye cameras and ultra-wideband (UWB) sensors, and algorithms, which includes fisheye visual inertial odometry (VIO), multi-drone map-based localization and visual object detector. A graph-based optimization and forward propagation working as the back-end of the Omni-swarm to fuse the measurements from the front-end. According to the experiment result, the proposed decentralized state estimation method on the swarm system achieves centimeter-level relative state estimation accuracy while ensuring global consistency. Moreover, supported by the Omni-swarm, inter-drone collision avoidance can be accomplished in a whole decentralized scheme without any external device, demonstrating the potential of Omni-swarm to be the foundation of autonomous aerial swarm flights in different scenarios.

Via

Access Paper or Ask Questions

Provably Correct Controller Synthesis of Switched Stochastic Systems with Metric Temporal Logic Specifications: A Case Study on Power Systems

Mar 26, 2021

Zhe Xu, Yichen Zhang

Figure 1 for Provably Correct Controller Synthesis of Switched Stochastic Systems with Metric Temporal Logic Specifications: A Case Study on Power Systems

Figure 2 for Provably Correct Controller Synthesis of Switched Stochastic Systems with Metric Temporal Logic Specifications: A Case Study on Power Systems

Figure 3 for Provably Correct Controller Synthesis of Switched Stochastic Systems with Metric Temporal Logic Specifications: A Case Study on Power Systems

Figure 4 for Provably Correct Controller Synthesis of Switched Stochastic Systems with Metric Temporal Logic Specifications: A Case Study on Power Systems

Abstract:In this paper, we present a provably correct controller synthesis approach for switched stochastic control systems with metric temporal logic (MTL) specifications with provable probabilistic guarantees. We first present the stochastic control bisimulation function for switched stochastic control systems, which bounds the trajectory divergence between the switched stochastic control system and its nominal deterministic control system in a probabilistic fashion. We then develop a method to compute optimal control inputs by solving an optimization problem for the nominal trajectory of the deterministic control system with robustness against initial state variations and stochastic uncertainties. We implement our robust stochastic controller synthesis approach on both a four-bus power system and a nine-bus power system under generation loss disturbances, with MTL specifications expressing requirements for the grid frequency deviations, wind turbine generator rotor speed variations and the power flow constraints at different power lines.

* arXiv admin note: substantial text overlap with arXiv:1911.11347

Via

Access Paper or Ask Questions

Encoding Frequency Constraints in Preventive Unit Commitment Using Deep Learning with Region-of-Interest Active Sampling

Feb 18, 2021

Yichen Zhang, Hantao Cui, Jianzhe Liu, Feng Qiu, Tianqi Hong, Rui Yao, Fangxing Li

Figure 1 for Encoding Frequency Constraints in Preventive Unit Commitment Using Deep Learning with Region-of-Interest Active Sampling

Figure 2 for Encoding Frequency Constraints in Preventive Unit Commitment Using Deep Learning with Region-of-Interest Active Sampling

Figure 3 for Encoding Frequency Constraints in Preventive Unit Commitment Using Deep Learning with Region-of-Interest Active Sampling

Figure 4 for Encoding Frequency Constraints in Preventive Unit Commitment Using Deep Learning with Region-of-Interest Active Sampling

Abstract:With the increasing penetration of renewable energy, frequency response and its security are of significant concerns for reliable power system operations. Frequency-constrained unit commitment (FCUC) is proposed to address this challenge. Despite existing efforts in modeling frequency characteristics in unit commitment (UC), current strategies can only handle oversimplified low-order frequency response models and do not consider wide-range operating conditions. This paper presents a generic data-driven framework for FCUC under high renewable penetration. Deep neural networks (DNNs) are trained to predict the frequency response using real data or high-fidelity simulation data. Next, the DNN is reformulated as a set of mixed-integer linear constraints to be incorporated into the ordinary UC formulation. In the data generation phase, all possible power injections are considered, and a region-of-interests active sampling is proposed to include power injection samples with frequency nadirs closer to the UFLC threshold, which significantly enhances the accuracy of frequency constraints in FCUC. The proposed FCUC is verified on the the IEEE 39-bus system. Then, a full-order dynamic model simulation using PSS/E verifies the effectiveness of FCUC in frequency-secure generator commitments.

Via

Access Paper or Ask Questions

Online Statistical Inference for Gradient-free Stochastic Optimization

Feb 05, 2021

Xi Chen, Zehua Lai, He Li, Yichen Zhang

Figure 1 for Online Statistical Inference for Gradient-free Stochastic Optimization

Figure 2 for Online Statistical Inference for Gradient-free Stochastic Optimization

Abstract:As gradient-free stochastic optimization gains emerging attention for a wide range of applications recently, the demand for uncertainty quantification of parameters obtained from such approaches arises. In this paper, we investigate the problem of statistical inference for model parameters based on gradient-free stochastic optimization methods that use only function values rather than gradients. We first present central limit theorem results for Polyak-Ruppert-averaging type gradient-free estimators. The asymptotic distribution reflects the trade-off between the rate of convergence and function query complexity. We next construct valid confidence intervals for model parameters through the estimation of the covariance matrix in a fully online fashion. We further give a general gradient-free framework for covariance estimation and analyze the role of function query complexity in the convergence rate of the covariance estimator. This provides a one-pass computationally efficient procedure for simultaneously obtaining an estimator of model parameters and conducting statistical inference. Finally, we provide numerical experiments to verify our theoretical results and illustrate some extensions of our method for various machine learning and deep learning applications.

Via

Access Paper or Ask Questions

Variance Reduction on Adaptive Stochastic Mirror Descent

Dec 26, 2020

Wenjie Li, Zhanyu Wang, Yichen Zhang, Guang Cheng

Figure 1 for Variance Reduction on Adaptive Stochastic Mirror Descent

Figure 2 for Variance Reduction on Adaptive Stochastic Mirror Descent

Figure 3 for Variance Reduction on Adaptive Stochastic Mirror Descent

Figure 4 for Variance Reduction on Adaptive Stochastic Mirror Descent

Abstract:We study the idea of variance reduction applied to adaptive stochastic mirror descent algorithms in nonsmooth nonconvex finite-sum optimization problems. We propose a simple yet generalized adaptive mirror descent algorithm with variance reduction named SVRAMD and provide its convergence analysis in different settings. We prove that variance reduction reduces the gradient complexity of most adaptive mirror descent algorithms and boost their convergence. In particular, our general theory implies variance reduction can be applied to algorithms using time-varying step sizes and self-adaptive algorithms such as AdaGrad and RMSProp. Moreover, our convergence rates recover the best existing rates of non-adaptive algorithms. We check the validity of our claims using experiments in deep learning.

* NeurIPS 2020 OPT workshop

Via

Access Paper or Ask Questions

Hybrid Imitation Learning for Real-Time Service Restoration in Resilient Distribution Systems

Dec 04, 2020

Yichen Zhang, Feng Qiu, Tianqi Hong, Zhaoyu Wang, Fangxing Li

Figure 1 for Hybrid Imitation Learning for Real-Time Service Restoration in Resilient Distribution Systems

Figure 2 for Hybrid Imitation Learning for Real-Time Service Restoration in Resilient Distribution Systems

Figure 3 for Hybrid Imitation Learning for Real-Time Service Restoration in Resilient Distribution Systems

Figure 4 for Hybrid Imitation Learning for Real-Time Service Restoration in Resilient Distribution Systems

Abstract:Self-healing capability is one of the most critical factors for a resilient distribution system, which requires intelligent agents to automatically perform restorative actions online, including network reconfiguration and reactive power dispatch. These agents should be equipped with a predesigned decision policy to meet real-time requirements and handle highly complex $N-k$ scenarios. The disturbance randomness hampers the application of exploration-dominant algorithms like traditional reinforcement learning (RL), and the agent training problem under $N-k$ scenarios has not been thoroughly solved. In this paper, we propose the imitation learning (IL) framework to train such policies, where the agent will interact with an expert to learn its optimal policy, and therefore significantly improve the training efficiency compared with the RL methods. To handle tie-line operations and reactive power dispatch simultaneously, we design a hybrid policy network for such a discrete-continuous hybrid action space. We employ the 33-node system under $N-k$ disturbances to verify the proposed framework.

Via

Access Paper or Ask Questions

FUEL: Fast UAV Exploration using Incremental Frontier Structure and Hierarchical Planning

Oct 22, 2020

Boyu Zhou, Yichen Zhang, Xinyi Chen, Shaojie Shen

Figure 1 for FUEL: Fast UAV Exploration using Incremental Frontier Structure and Hierarchical Planning

Figure 2 for FUEL: Fast UAV Exploration using Incremental Frontier Structure and Hierarchical Planning

Figure 3 for FUEL: Fast UAV Exploration using Incremental Frontier Structure and Hierarchical Planning

Figure 4 for FUEL: Fast UAV Exploration using Incremental Frontier Structure and Hierarchical Planning

Abstract:Autonomous exploration is a fundamental problem for various applications of unmanned aerial vehicles. Existing methods, however, were demonstrated to have low efficiency, due to the lack of optimality consideration, conservative motion plans and low decision frequencies. In this paper, we propose FUEL, a hierarchical framework that can support Fast UAV Exploration in complex unknown environments. We maintain crucial information in the entire space required by exploration planning by a frontier information structure (FIS), which can be updated incrementally when the space is explored. Supported by the FIS, a hierarchical planner plan exploration motions in three steps, which find efficient global coverage paths, refine a local set of viewpoints and generate minimum-time trajectories in sequence. We present extensive benchmark and real-world tests, in which our method completes the exploration tasks with unprecedented efficiency (3-8 times faster) compared to state-of-the-art approaches. Our method will be made open source to benefit the community.

* Video: https://www.youtube.com/watch?v=_dGgZUrWk-8; Demo: https://github.com/HKUST-Aerial-Robotics/FUEL

Via

Access Paper or Ask Questions