Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Chen Liang

TimeCSL: Unsupervised Contrastive Learning of General Shapelets for Explorable Time Series Analysis

Apr 07, 2024

Zhiyu Liang, Chen Liang, Zheng Liang, Hongzhi Wang, Bo Zheng

Figure 1 for TimeCSL: Unsupervised Contrastive Learning of General Shapelets for Explorable Time Series Analysis

Figure 2 for TimeCSL: Unsupervised Contrastive Learning of General Shapelets for Explorable Time Series Analysis

Figure 3 for TimeCSL: Unsupervised Contrastive Learning of General Shapelets for Explorable Time Series Analysis

Abstract:Unsupervised (a.k.a. Self-supervised) representation learning (URL) has emerged as a new paradigm for time series analysis, because it has the ability to learn generalizable time series representation beneficial for many downstream tasks without using labels that are usually difficult to obtain. Considering that existing approaches have limitations in the design of the representation encoder and the learning objective, we have proposed Contrastive Shapelet Learning (CSL), the first URL method that learns the general-purpose shapelet-based representation through unsupervised contrastive learning, and shown its superior performance in several analysis tasks, such as time series classification, clustering, and anomaly detection. In this paper, we develop TimeCSL, an end-to-end system that makes full use of the general and interpretable shapelets learned by CSL to achieve explorable time series analysis in a unified pipeline. We introduce the system components and demonstrate how users interact with TimeCSL to solve different analysis tasks in the unified pipeline, and gain insight into their time series by exploring the learned shapelets and representation.

Via

Access Paper or Ask Questions

Communication Efficient Distributed Training with Distributed Lion

Mar 30, 2024

Bo Liu, Lemeng Wu, Lizhang Chen, Kaizhao Liang, Jiaxu Zhu, Chen Liang, Raghuraman Krishnamoorthi, Qiang Liu

Figure 1 for Communication Efficient Distributed Training with Distributed Lion

Figure 2 for Communication Efficient Distributed Training with Distributed Lion

Figure 3 for Communication Efficient Distributed Training with Distributed Lion

Figure 4 for Communication Efficient Distributed Training with Distributed Lion

Abstract:The Lion optimizer has been a promising competitor with the AdamW for training large AI models, with advantages on memory, computation, and sample efficiency. In this paper, we introduce Distributed Lion, an innovative adaptation of Lion for distributed training environments. Leveraging the sign operator in Lion, our Distributed Lion only requires communicating binary or lower-precision vectors between workers to the center server, significantly reducing the communication cost. Our theoretical analysis confirms Distributed Lion's convergence properties. Empirical results demonstrate its robustness across a range of tasks, worker counts, and batch sizes, on both vision and language problems. Notably, Distributed Lion attains comparable performance to standard Lion or AdamW optimizers applied on aggregated gradients, but with significantly reduced communication bandwidth. This feature is particularly advantageous for training large models. In addition, we also demonstrate that Distributed Lion presents a more favorable performance-bandwidth balance compared to existing efficient distributed methods such as deep gradient compression and ternary gradients.

* 22 pages

Via

Access Paper or Ask Questions

Unsupervised semantic segmentation of high-resolution UAV imagery for road scene parsing

Feb 05, 2024

Zihan Ma, Yongshang Li, Ronggui Ma, Chen Liang

Figure 1 for Unsupervised semantic segmentation of high-resolution UAV imagery for road scene parsing

Figure 2 for Unsupervised semantic segmentation of high-resolution UAV imagery for road scene parsing

Figure 3 for Unsupervised semantic segmentation of high-resolution UAV imagery for road scene parsing

Figure 4 for Unsupervised semantic segmentation of high-resolution UAV imagery for road scene parsing

Abstract:Two challenges are presented when parsing road scenes in UAV images. First, the high resolution of UAV images makes processing difficult. Second, supervised deep learning methods require a large amount of manual annotations to train robust and accurate models. In this paper, an unsupervised road parsing framework that leverages recent advances in vision language models and fundamental computer vision model is introduced.Initially, a vision language model is employed to efficiently process ultra-large resolution UAV images to quickly detect road regions of interest in the images. Subsequently, the vision foundation model SAM is utilized to generate masks for the road regions without category information. Following that, a self-supervised representation learning network extracts feature representations from all masked regions. Finally, an unsupervised clustering algorithm is applied to cluster these feature representations and assign IDs to each cluster. The masked regions are combined with the corresponding IDs to generate initial pseudo-labels, which initiate an iterative self-training process for regular semantic segmentation. The proposed method achieves an impressive 89.96% mIoU on the development dataset without relying on any manual annotation. Particularly noteworthy is the extraordinary flexibility of the proposed method, which even goes beyond the limitations of human-defined categories and is able to acquire knowledge of new categories from the dataset itself.

Via

Access Paper or Ask Questions

Accelerated Cloud for Artificial Intelligence (ACAI)

Jan 30, 2024

Dachi Chen, Weitian Ding, Chen Liang, Chang Xu, Junwei Zhang, Majd Sakr

Abstract:Training an effective Machine learning (ML) model is an iterative process that requires effort in multiple dimensions. Vertically, a single pipeline typically includes an initial ETL (Extract, Transform, Load) of raw datasets, a model training stage, and an evaluation stage where the practitioners obtain statistics of the model performance. Horizontally, many such pipelines may be required to find the best model within a search space of model configurations. Many practitioners resort to maintaining logs manually and writing simple glue code to automate the workflow. However, carrying out this process on the cloud is not a trivial task in terms of resource provisioning, data management, and bookkeeping of job histories to make sure the results are reproducible. We propose an end-to-end cloud-based machine learning platform, Accelerated Cloud for AI (ACAI), to help improve the productivity of ML practitioners. ACAI achieves this goal by enabling cloud-based storage of indexed, labeled, and searchable data, as well as automatic resource provisioning, job scheduling, and experiment tracking. Specifically, ACAI provides practitioners (1) a data lake for storing versioned datasets and their corresponding metadata, and (2) an execution engine for executing ML jobs on the cloud with automatic resource provisioning (auto-provision), logging and provenance tracking. To evaluate ACAI, we test the efficacy of our auto-provisioner on the MNIST handwritten digit classification task, and we study the usability of our system using experiments and interviews. We show that our auto-provisioner produces a 1.7x speed-up and 39% cost reduction, and our system reduces experiment time for ML scientists by 20% on typical ML use cases.

Via

Access Paper or Ask Questions

Grayscale Image Colorization with GAN and CycleGAN in Different Image Domain

Jan 21, 2024

Chen Liang, Yunchen Sheng, Yichen Mo

Abstract:Automatic colorization of grayscale image has been a challenging task. Previous research have applied supervised methods in conquering this problem [ 1]. In this paper, we reproduces a GAN-based coloring model, and experiments one of its variant. We also proposed a CycleGAN based model and experiments those methods on various datasets. The result shows that the proposed CycleGAN model does well in human-face coloring and comic coloring, but lack the ability to diverse colorization.

Via

Access Paper or Ask Questions

A Novel Dual-Stage Evolutionary Algorithm for Finding Robust Solutions

Jan 02, 2024

Wei Du, Wenxuan Fang, Chen Liang, Yang Tang, Yaochu Jin

Figure 1 for A Novel Dual-Stage Evolutionary Algorithm for Finding Robust Solutions

Figure 2 for A Novel Dual-Stage Evolutionary Algorithm for Finding Robust Solutions

Figure 3 for A Novel Dual-Stage Evolutionary Algorithm for Finding Robust Solutions

Figure 4 for A Novel Dual-Stage Evolutionary Algorithm for Finding Robust Solutions

Abstract:In robust optimization problems, the magnitude of perturbations is relatively small. Consequently, solutions within certain regions are less likely to represent the robust optima when perturbations are introduced. Hence, a more efficient search process would benefit from increased opportunities to explore promising regions where global optima or good local optima are situated. In this paper, we introduce a novel robust evolutionary algorithm named the dual-stage robust evolutionary algorithm (DREA) aimed at discovering robust solutions. DREA operates in two stages: the peak-detection stage and the robust solution-searching stage. The primary objective of the peak-detection stage is to identify peaks in the fitness landscape of the original optimization problem. Conversely, the robust solution-searching stage focuses on swiftly identifying the robust optimal solution using information obtained from the peaks discovered in the initial stage. These two stages collectively enable the proposed DREA to efficiently obtain the robust optimal solution for the optimization problem. This approach achieves a balance between solution optimality and robustness by separating the search processes for optimal and robust optimal solutions. Experimental results demonstrate that DREA significantly outperforms five state-of-the-art algorithms across 18 test problems characterized by diverse complexities. Moreover, when evaluated on higher-dimensional robust optimization problems (100-$D$ and 200-$D$), DREA also demonstrates superior performance compared to all five counterpart algorithms.

Via

Access Paper or Ask Questions

Unsupervised Multi-modal Feature Alignment for Time Series Representation Learning

Dec 09, 2023

Chen Liang, Donghua Yang, Zhiyu Liang, Hongzhi Wang, Zheng Liang, Xiyang Zhang, Jianfeng Huang

Figure 1 for Unsupervised Multi-modal Feature Alignment for Time Series Representation Learning

Figure 2 for Unsupervised Multi-modal Feature Alignment for Time Series Representation Learning

Figure 3 for Unsupervised Multi-modal Feature Alignment for Time Series Representation Learning

Figure 4 for Unsupervised Multi-modal Feature Alignment for Time Series Representation Learning

Abstract:In recent times, the field of unsupervised representation learning (URL) for time series data has garnered significant interest due to its remarkable adaptability across diverse downstream applications. Unsupervised learning goals differ from downstream tasks, making it tricky to ensure downstream task utility by focusing only on temporal feature characterization. Researchers have proposed multiple transformations to extract discriminative patterns implied in informative time series, trying to fill the gap. Despite the introduction of a variety of feature engineering techniques, e.g. spectral domain, wavelet transformed features, features in image form and symbolic features etc. the utilization of intricate feature fusion methods and dependence on heterogeneous features during inference hampers the scalability of the solutions. To address this, our study introduces an innovative approach that focuses on aligning and binding time series representations encoded from different modalities, inspired by spectral graph theory, thereby guiding the neural encoder to uncover latent pattern associations among these multi-modal features. In contrast to conventional methods that fuse features from multiple modalities, our proposed approach simplifies the neural architecture by retaining a single time series encoder, consequently leading to preserved scalability. We further demonstrate and prove mechanisms for the encoder to maintain better inductive bias. In our experimental evaluation, we validated the proposed method on a diverse set of time series datasets from various domains. Our approach outperforms existing state-of-the-art URL methods across diverse downstream tasks.

Via

Access Paper or Ask Questions

TARGET: Template-Transferable Backdoor Attack Against Prompt-based NLP Models via GPT4

Nov 29, 2023

Zihao Tan, Qingliang Chen, Yongjian Huang, Chen Liang

Figure 1 for TARGET: Template-Transferable Backdoor Attack Against Prompt-based NLP Models via GPT4

Figure 2 for TARGET: Template-Transferable Backdoor Attack Against Prompt-based NLP Models via GPT4

Figure 3 for TARGET: Template-Transferable Backdoor Attack Against Prompt-based NLP Models via GPT4

Figure 4 for TARGET: Template-Transferable Backdoor Attack Against Prompt-based NLP Models via GPT4

Abstract:Prompt-based learning has been widely applied in many low-resource NLP tasks such as few-shot scenarios. However, this paradigm has been shown to be vulnerable to backdoor attacks. Most of the existing attack methods focus on inserting manually predefined templates as triggers in the pre-training phase to train the victim model and utilize the same triggers in the downstream task to perform inference, which tends to ignore the transferability and stealthiness of the templates. In this work, we propose a novel approach of TARGET (Template-trAnsfeRable backdoor attack aGainst prompt-basEd NLP models via GPT4), which is a data-independent attack method. Specifically, we first utilize GPT4 to reformulate manual templates to generate tone-strong and normal templates, and the former are injected into the model as a backdoor trigger in the pre-training phase. Then, we not only directly employ the above templates in the downstream task, but also use GPT4 to generate templates with similar tone to the above templates to carry out transferable attacks. Finally we have conducted extensive experiments on five NLP datasets and three BERT series models, with experimental results justifying that our TARGET method has better attack performance and stealthiness compared to the two-external baseline methods on direct attacks, and in addition achieves satisfactory attack capability in the unseen tone-similar templates.

Via

Access Paper or Ask Questions

Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta

Nov 16, 2023

Wei Zhang, Dai Li, Chen Liang, Fang Zhou, Zhongke Zhang, Xuewei Wang, Ru Li, Yi Zhou, Yaning Huang, Dong Liang(+11 more)

Figure 1 for Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta

Figure 2 for Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta

Figure 3 for Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta

Figure 4 for Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta

Abstract:Effective user representations are pivotal in personalized advertising. However, stringent constraints on training throughput, serving latency, and memory, often limit the complexity and input feature set of online ads ranking models. This challenge is magnified in extensive systems like Meta's, which encompass hundreds of models with diverse specifications, rendering the tailoring of user representation learning for each model impractical. To address these challenges, we present Scaling User Modeling (SUM), a framework widely deployed in Meta's ads ranking system, designed to facilitate efficient and scalable sharing of online user representation across hundreds of ads models. SUM leverages a few designated upstream user models to synthesize user embeddings from massive amounts of user features with advanced modeling techniques. These embeddings then serve as inputs to downstream online ads ranking models, promoting efficient representation sharing. To adapt to the dynamic nature of user features and ensure embedding freshness, we designed SUM Online Asynchronous Platform (SOAP), a latency free online serving system complemented with model freshness and embedding stabilization, which enables frequent user model updates and online inference of user embeddings upon each user request. We share our hands-on deployment experiences for the SUM framework and validate its superiority through comprehensive experiments. To date, SUM has been launched to hundreds of ads ranking models in Meta, processing hundreds of billions of user requests daily, yielding significant online metric gains and infrastructure cost savings.

* 8 pages, 3 figures

Via

Access Paper or Ask Questions

Learning Reduced-Order Soft Robot Controller

Nov 03, 2023

Chen Liang, Xifeng Gao, Kui Wu, Zherong Pan

Figure 1 for Learning Reduced-Order Soft Robot Controller

Figure 2 for Learning Reduced-Order Soft Robot Controller

Figure 3 for Learning Reduced-Order Soft Robot Controller

Figure 4 for Learning Reduced-Order Soft Robot Controller

Abstract:Deformable robots are notoriously difficult to model or control due to its high-dimensional configuration spaces. Direct trajectory optimization suffers from the curse-of-dimensionality and incurs a high computational cost, while learning-based controller optimization methods are sensitive to hyper-parameter tuning. To overcome these limitations, we hypothesize that high fidelity soft robots can be both simulated and controlled by restricting to low-dimensional spaces. Under such assumption, we propose a two-stage algorithm to identify such simulation- and control-spaces. Our method first identifies the so-called simulation-space that captures the salient deformation modes, to which the robot's governing equation is restricted. We then identify the control-space, to which control signals are restricted. We propose a multi-fidelity Riemannian Bayesian bilevel optimization to identify task-specific control spaces. We show that the dimension of control-space can be less than $10$ for a high-DOF soft robot to accomplish walking and swimming tasks, allowing low-dimensional MPC controllers to be applied to soft robots with tractable computational complexity.

Via

Access Paper or Ask Questions