Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jiawei Wu

VQSynery: Robust Drug Synergy Prediction With Vector Quantization Mechanism

Mar 05, 2024

Jiawei Wu, Mingyuan Yan, Dianbo Liu

Abstract:The pursuit of optimizing cancer therapies is significantly advanced by the accurate prediction of drug synergy. Traditional methods, such as clinical trials, are reliable yet encumbered by extensive time and financial demands. The emergence of high-throughput screening and computational innovations has heralded a shift towards more efficient methodologies for exploring drug interactions. In this study, we present VQSynergy, a novel framework that employs the Vector Quantization (VQ) mechanism, integrated with gated residuals and a tailored attention mechanism, to enhance the precision and generalizability of drug synergy predictions. Our findings demonstrate that VQSynergy surpasses existing models in terms of robustness, particularly under Gaussian noise conditions, highlighting its superior performance and utility in the complex and often noisy domain of drug synergy research. This study underscores the potential of VQSynergy in revolutionizing the field through its advanced predictive capabilities, thereby contributing to the optimization of cancer treatment strategies.

Via

Access Paper or Ask Questions

GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding

Feb 03, 2024

Cunxiao Du, Jing Jiang, Xu Yuanchen, Jiawei Wu, Sicheng Yu, Yongqi Li, Shenggui Li, Kai Xu, Liqiang Nie, Zhaopeng Tu(+1 more)

Figure 1 for GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding

Figure 2 for GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding

Figure 3 for GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding

Figure 4 for GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding

Abstract:Speculative decoding is a relatively new decoding framework that leverages small and efficient draft models to reduce the latency of LLMs. In this study, we introduce GliDe and CaPE, two low-hassle modifications to vanilla speculative decoding to further improve the decoding speed of a frozen LLM. Specifically, GliDe is a modified draft model architecture that reuses the cached keys and values from the target LLM, while CaPE is a proposal expansion method that uses the draft model's confidence scores to help select additional candidate tokens for verification. Extensive experiments on different benchmarks demonstrate that our proposed GliDe draft model significantly reduces the expected decoding latency. Additional evaluation using walltime reveals that GliDe can accelerate Vicuna models up to 2.17x and further extend the improvement to 2.61x with CaPE. We will release our code, data, and the trained draft models.

Via

Access Paper or Ask Questions

Class-Specific Distribution Alignment for Semi-Supervised Medical Image Classification

Jul 29, 2023

Zhongzheng Huang, Jiawei Wu, Tao Wang, Zuoyong Li, Anastasia Ioannou

Figure 1 for Class-Specific Distribution Alignment for Semi-Supervised Medical Image Classification

Figure 2 for Class-Specific Distribution Alignment for Semi-Supervised Medical Image Classification

Figure 3 for Class-Specific Distribution Alignment for Semi-Supervised Medical Image Classification

Figure 4 for Class-Specific Distribution Alignment for Semi-Supervised Medical Image Classification

Abstract:Despite the success of deep neural networks in medical image classification, the problem remains challenging as data annotation is time-consuming, and the class distribution is imbalanced due to the relative scarcity of diseases. To address this problem, we propose Class-Specific Distribution Alignment (CSDA), a semi-supervised learning framework based on self-training that is suitable to learn from highly imbalanced datasets. Specifically, we first provide a new perspective to distribution alignment by considering the process as a change of basis in the vector space spanned by marginal predictions, and then derive CSDA to capture class-dependent marginal predictions on both labeled and unlabeled data, in order to avoid the bias towards majority classes. Furthermore, we propose a Variable Condition Queue (VCQ) module to maintain a proportionately balanced number of unlabeled samples for each class. Experiments on three public datasets HAM10000, CheXpert and Kvasir show that our method provides competitive performance on semi-supervised skin disease, thoracic disease, and endoscopic image classification tasks.

* Paper appears in Computers in Biology and Medicine 2023, 164, 107280

Via

Access Paper or Ask Questions

Semi-Supervised Medical Image Segmentation with Co-Distribution Alignment

Jul 24, 2023

Tao Wang, Zhongzheng Huang, Jiawei Wu, Yuanzheng Cai, Zuoyong Li

Abstract:Medical image segmentation has made significant progress when a large amount of labeled data are available. However, annotating medical image segmentation datasets is expensive due to the requirement of professional skills. Additionally, classes are often unevenly distributed in medical images, which severely affects the classification performance on minority classes. To address these problems, this paper proposes Co-Distribution Alignment (Co-DA) for semi-supervised medical image segmentation. Specifically, Co-DA aligns marginal predictions on unlabeled data to marginal predictions on labeled data in a class-wise manner with two differently initialized models before using the pseudo-labels generated by one model to supervise the other. Besides, we design an over-expectation cross-entropy loss for filtering the unlabeled pixels to reduce noise in their pseudo-labels. Quantitative and qualitative experiments on three public datasets demonstrate that the proposed approach outperforms existing state-of-the-art semi-supervised medical image segmentation methods on both the 2D CaDIS dataset and the 3D LGE-MRI and ACDC datasets, achieving an mIoU of 0.8515 with only 24% labeled data on CaDIS, and a Dice score of 0.8824 and 0.8773 with only 20% data on LGE-MRI and ACDC, respectively.

* Paper appears in Bioengineering 2023, 10(7), 869

Via

Access Paper or Ask Questions

dugMatting: Decomposed-Uncertainty-Guided Matting

Jun 02, 2023

Jiawei Wu, Changqing Zhang, Zuoyong Li, Huazhu Fu, Xi Peng, Joey Tianyi Zhou

Abstract:Cutting out an object and estimating its opacity mask, known as image matting, is a key task in image and video editing. Due to the highly ill-posed issue, additional inputs, typically user-defined trimaps or scribbles, are usually needed to reduce the uncertainty. Although effective, it is either time consuming or only suitable for experienced users who know where to place the strokes. In this work, we propose a decomposed-uncertainty-guided matting (dugMatting) algorithm, which explores the explicitly decomposed uncertainties to efficiently and effectively improve the results. Basing on the characteristic of these uncertainties, the epistemic uncertainty is reduced in the process of guiding interaction (which introduces prior knowledge), while the aleatoric uncertainty is reduced in modeling data distribution (which introduces statistics for both data and possible noise). The proposed matting framework relieves the requirement for users to determine the interaction areas by using simple and efficient labeling. Extensively quantitative and qualitative results validate that the proposed method significantly improves the original matting algorithms in terms of both efficiency and efficacy.

Via

Access Paper or Ask Questions

Continual Transfer Learning for Cross-Domain Click-Through Rate Prediction at Taobao

Aug 11, 2022

Lixin Liu, Yanling Wang, Tianming Wang, Dong Guan, Jiawei Wu, Jingxu Chen, Rong Xiao, Wenxiang Zhu, Fei Fang

Figure 1 for Continual Transfer Learning for Cross-Domain Click-Through Rate Prediction at Taobao

Figure 2 for Continual Transfer Learning for Cross-Domain Click-Through Rate Prediction at Taobao

Figure 3 for Continual Transfer Learning for Cross-Domain Click-Through Rate Prediction at Taobao

Figure 4 for Continual Transfer Learning for Cross-Domain Click-Through Rate Prediction at Taobao

Abstract:As one of the largest e-commerce platforms in the world, Taobao's recommendation systems (RSs) serve the demands of shopping for hundreds of millions of customers. Click-Through Rate (CTR) prediction is a core component of the RS. One of the biggest characteristics in CTR prediction at Taobao is that there exist multiple recommendation domains where the scales of different domains vary significantly. Therefore, it is crucial to perform cross-domain CTR prediction to transfer knowledge from large domains to small domains to alleviate the data sparsity issue. However, existing cross-domain CTR prediction methods are proposed for static knowledge transfer, ignoring that all domains in real-world RSs are continually time-evolving. In light of this, we present a necessary but novel task named Continual Transfer Learning (CTL), which transfers knowledge from a time-evolving source domain to a time-evolving target domain. In this work, we propose a simple and effective CTL model called CTNet to solve the problem of continual cross-domain CTR prediction at Taobao, and CTNet can be trained efficiently. Particularly, CTNet considers an important characteristic in the industry that models has been continually well-trained for a very long time. So CTNet aims to fully utilize all the well-trained model parameters in both source domain and target domain to avoid losing historically acquired knowledge, and only needs incremental target domain data for training to guarantee efficiency. Extensive offline experiments and online A/B testing at Taobao demonstrate the efficiency and effectiveness of CTNet. CTNet is now deployed online in the recommender systems of Taobao, serving the main traffic of hundreds of millions of active users.

* 10 pages

Via

Access Paper or Ask Questions

Improving Robustness and Generality of NLP Models Using Disentangled Representations

Sep 21, 2020

Jiawei Wu, Xiaoya Li, Xiang Ao, Yuxian Meng, Fei Wu, Jiwei Li

Figure 1 for Improving Robustness and Generality of NLP Models Using Disentangled Representations

Figure 2 for Improving Robustness and Generality of NLP Models Using Disentangled Representations

Figure 3 for Improving Robustness and Generality of NLP Models Using Disentangled Representations

Figure 4 for Improving Robustness and Generality of NLP Models Using Disentangled Representations

Abstract:Supervised neural networks, which first map an input $x$ to a single representation $z$, and then map $z$ to the output label $y$, have achieved remarkable success in a wide range of natural language processing (NLP) tasks. Despite their success, neural models lack for both robustness and generality: small perturbations to inputs can result in absolutely different outputs; the performance of a model trained on one domain drops drastically when tested on another domain. In this paper, we present methods to improve robustness and generality of NLP models from the standpoint of disentangled representation learning. Instead of mapping $x$ to a single representation $z$, the proposed strategy maps $x$ to a set of representations $\{z_1,z_2,...,z_K\}$ while forcing them to be disentangled. These representations are then mapped to different logits $l$s, the ensemble of which is used to make the final prediction $y$. We propose different methods to incorporate this idea into currently widely-used models, including adding an $L$2 regularizer on $z$s or adding Total Correlation (TC) under the framework of variational information bottleneck (VIB). We show that models trained with the proposed criteria provide better robustness and domain adaptation ability in a wide range of supervised learning tasks.

Via

Access Paper or Ask Questions

Analyzing COVID-19 on Online Social Media: Trends, Sentiments and Emotions

Jun 05, 2020

Xiaoya Li, Mingxin Zhou, Jiawei Wu, Arianna Yuan, Fei Wu, Jiwei Li

Figure 1 for Analyzing COVID-19 on Online Social Media: Trends, Sentiments and Emotions

Figure 2 for Analyzing COVID-19 on Online Social Media: Trends, Sentiments and Emotions

Figure 3 for Analyzing COVID-19 on Online Social Media: Trends, Sentiments and Emotions

Figure 4 for Analyzing COVID-19 on Online Social Media: Trends, Sentiments and Emotions

Abstract:At the time of writing, the ongoing pandemic of coronavirus disease (COVID-19) has caused severe impacts on society, economy and people's daily lives. People constantly express their opinions on various aspects of the pandemic on social media, making user-generated content an important source for understanding public emotions and concerns. In this paper, we perform a comprehensive analysis on the affective trajectories of the American people and the Chinese people based on Twitter and Weibo posts between January 20th, 2020 and May 11th 2020. Specifically, by identifying people's sentiments, emotions (i.e., anger, disgust, fear, happiness, sadness, surprise) and the emotional triggers (e.g., what a user is angry/sad about) we are able to depict the dynamics of public affect in the time of COVID-19. By contrasting two very different countries, China and the Unites States, we reveal sharp differences in people's views on COVID-19 in different cultures. Our study provides a computational approach to unveiling public emotions and concerns on the pandemic in real-time, which would potentially help policy-makers better understand people's need and thus make optimal policy.

Via

Access Paper or Ask Questions

Towards Cognitive Routing based on Deep Reinforcement Learning

Mar 19, 2020

Jiawei Wu, Jianxue Li, Yang Xiao, Jun Liu

Figure 1 for Towards Cognitive Routing based on Deep Reinforcement Learning

Figure 2 for Towards Cognitive Routing based on Deep Reinforcement Learning

Figure 3 for Towards Cognitive Routing based on Deep Reinforcement Learning

Figure 4 for Towards Cognitive Routing based on Deep Reinforcement Learning

Abstract:Routing is one of the key functions for stable operation of network infrastructure. Nowadays, the rapid growth of network traffic volume and changing of service requirements call for more intelligent routing methods than before. Towards this end, we propose a definition of cognitive routing and an implementation approach based on Deep Reinforcement Learning (DRL). To facilitate the research of DRL-based cognitive routing, we introduce a simulator named RL4Net for DRL-based routing algorithm development and simulation. Then, we design and implement a DDPG-based routing algorithm. The simulation results on an example network topology show that the DDPG-based routing algorithm achieves better performance than OSPF and random weight algorithms. It demonstrate the preliminary feasibility and potential advantage of cognitive routing for future network.

* 6 pages, 7 figures

Via

Access Paper or Ask Questions

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Sep 09, 2019

Jiawei Wu, Wenhan Xiong, William Yang Wang

Figure 1 for Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Figure 2 for Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Figure 3 for Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Figure 4 for Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Abstract:Many tasks in natural language processing can be viewed as multi-label classification problems. However, most of the existing models are trained with the standard cross-entropy loss function and use a fixed prediction policy (e.g., a threshold of 0.5) for all the labels, which completely ignores the complexity and dependencies among different labels. In this paper, we propose a meta-learning method to capture these complex label dependencies. More specifically, our method utilizes a meta-learner to jointly learn the training policies and prediction policies for different labels. The training policies are then used to train the classifier with the cross-entropy loss function, and the prediction policies are further implemented for prediction. Experimental results on fine-grained entity typing and text classification demonstrate that our proposed method can obtain more accurate multi-label classification results.

* 11pages, 5 figures, accepted to EMNLP 2019

Via

Access Paper or Ask Questions