Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lei Wang

MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

Sep 18, 2021
Yihuai Lan, Lei Wang, Qiyuan Zhang, Yunshi Lan, Bing Tian Dai, Yan Wang, Dongxiang Zhang, Ee-Peng Lim

Figure 1 for MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

Figure 2 for MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

Figure 3 for MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

Figure 4 for MWPToolkit: An Open-Source Framework for Deep Learning-Based Math Word Problem Solvers

Developing automatic Math Word Problem (MWP) solvers has been an interest of NLP researchers since the 1960s. Over the last few years, there are a growing number of datasets and deep learning-based methods proposed for effectively solving MWPs. However, most existing methods are benchmarked soly on one or two datasets, varying in different configurations, which leads to a lack of unified, standardized, fair, and comprehensive comparison between methods. This paper presents MWPToolkit, the first open-source framework for solving MWPs. In MWPToolkit, we decompose the procedure of existing MWP solvers into multiple core components and decouple their models into highly reusable modules. We also provide a hyper-parameter search function to boost the performance. In total, we implement and compare 17 MWP solvers on 4 widely-used single equation generation benchmarks and 2 multiple equations generation benchmarks. These features enable our MWPToolkit to be suitable for researchers to reproduce advanced baseline models and develop new MWP solvers quickly. Code and documents are available at https://github.com/LYH-YF/MWPToolkit.

* 9 pages, 2 figures

Via

Access Paper or Ask Questions

Progressive Hard-case Mining across Pyramid Levels in Object Detection

Sep 15, 2021
Binghong Wu, Yehui Yang, Dalu Yang, Junde Wu, Haifeng Huang, Lei Wang, Junwei Liu, Yanwu Xu

Figure 1 for Progressive Hard-case Mining across Pyramid Levels in Object Detection

Figure 2 for Progressive Hard-case Mining across Pyramid Levels in Object Detection

Figure 3 for Progressive Hard-case Mining across Pyramid Levels in Object Detection

Figure 4 for Progressive Hard-case Mining across Pyramid Levels in Object Detection

In object detection, multi-level prediction (e.g., FPN, YOLO) and resampling skills (e.g., focal loss, ATSS) have drastically improved one-stage detector performance. However, how to improve the performance by optimizing the feature pyramid level-by-level remains unexplored. We find that, during training, the ratio of positive over negative samples varies across pyramid levels (\emph{level imbalance}), which is not addressed by current one-stage detectors. To mediate the influence of level imbalance, we propose a Unified Multi-level Optimization Paradigm (UMOP) consisting of two components: 1) an independent classification loss supervising each pyramid level with individual resampling considerations; 2) a progressive hard-case mining loss defining all losses across the pyramid levels without extra level-wise settings. With UMOP as a plug-and-play scheme, modern one-stage detectors can attain a ~1.5 AP improvement with fewer training iterations and no additional computation overhead. Our best model achieves 55.1 AP on COCO test-dev. Code is available at https://github.com/zimoqingfeng/UMOP.

Via

Access Paper or Ask Questions

LibFewShot: A Comprehensive Library for Few-shot Learning

Sep 10, 2021
Wenbin Li, Chuanqi Dong, Pinzhuo Tian, Tiexin Qin, Xuesong Yang, Ziyi Wang, Jing Huo, Yinghuan Shi, Lei Wang, Yang Gao, Jiebo Luo

Figure 1 for LibFewShot: A Comprehensive Library for Few-shot Learning

Figure 2 for LibFewShot: A Comprehensive Library for Few-shot Learning

Figure 3 for LibFewShot: A Comprehensive Library for Few-shot Learning

Figure 4 for LibFewShot: A Comprehensive Library for Few-shot Learning

Few-shot learning, especially few-shot image classification, has received increasing attention and witnessed significant advances in recent years. Some recent studies implicitly show that many generic techniques or ``tricks'', such as data augmentation, pre-training, knowledge distillation, and self-supervision, may greatly boost the performance of a few-shot learning method. Moreover, different works may employ different software platforms, different training schedules, different backbone architectures and even different input image sizes, making fair comparisons difficult and practitioners struggle with reproducibility. To address these situations, we propose a comprehensive library for few-shot learning (LibFewShot) by re-implementing seventeen state-of-the-art few-shot learning methods in a unified framework with the same single codebase in PyTorch. Furthermore, based on LibFewShot, we provide comprehensive evaluations on multiple benchmark datasets with multiple backbone architectures to evaluate common pitfalls and effects of different training tricks. In addition, given the recent doubts on the necessity of meta- or episodic-training mechanism, our evaluation results show that such kind of mechanism is still necessary especially when combined with pre-training. We hope our work can not only lower the barriers for beginners to work on few-shot learning but also remove the effects of the nontrivial tricks to facilitate intrinsic research on few-shot learning. The source code is available from https://github.com/RL-VIG/LibFewShot.

* 14 pages

Via

Access Paper or Ask Questions

Outage Analysis and Beamwidth Optimization for Positioning-Assisted Beamforming

Sep 01, 2021
Bingcheng Zhu, Zaichen Zhang, Julian Cheng, Lei Wang, Jian Dang, Liang Wu

Figure 1 for Outage Analysis and Beamwidth Optimization for Positioning-Assisted Beamforming

Figure 2 for Outage Analysis and Beamwidth Optimization for Positioning-Assisted Beamforming

Figure 3 for Outage Analysis and Beamwidth Optimization for Positioning-Assisted Beamforming

Figure 4 for Outage Analysis and Beamwidth Optimization for Positioning-Assisted Beamforming

Conventional beamforming is based on channel estimation, which can be computationally intensive and inaccurate when the antenna array is large. In this work, we study the outage probability of positioning-assisted beamforming systems. Closed-form outage probability bounds are derived by considering positioning error, link distance and beamwidth. Based on the analytical result, we show that the beamwidth should be optimized with respect to the link distance and the transmit power, and such optimization significantly suppresses the outage probability.

Via

Access Paper or Ask Questions

Improving Ranking Correlation of Supernet with Candidates Enhancement and Progressive Training

Aug 12, 2021
Ziwei Yang, Ruyi Zhang, Zhi Yang, Xubo Yang, Lei Wang, Zheyang Li

Figure 1 for Improving Ranking Correlation of Supernet with Candidates Enhancement and Progressive Training

Figure 2 for Improving Ranking Correlation of Supernet with Candidates Enhancement and Progressive Training

Figure 3 for Improving Ranking Correlation of Supernet with Candidates Enhancement and Progressive Training

Figure 4 for Improving Ranking Correlation of Supernet with Candidates Enhancement and Progressive Training

One-shot neural architecture search (NAS) applies weight-sharing supernet to reduce the unaffordable computation overhead of automated architecture designing. However, the weight-sharing technique worsens the ranking consistency of performance due to the interferences between different candidate networks. To address this issue, we propose a candidates enhancement method and progressive training pipeline to improve the ranking correlation of supernet. Specifically, we carefully redesign the sub-networks in the supernet and map the original supernet to a new one of high capacity. In addition, we gradually add narrow branches of supernet to reduce the degree of weight sharing which effectively alleviates the mutual interference between sub-networks. Finally, our method ranks the 1st place in the Supernet Track of CVPR2021 1st Lightweight NAS Challenge.

* 5 pages, 2 figures. CVPR2021 NAS challenge

Via

Access Paper or Ask Questions

Cascade Bagging for Accuracy Prediction with Few Training Samples

Aug 12, 2021
Ruyi Zhang, Ziwei Yang, Zhi Yang, Xubo Yang, Lei Wang, Zheyang Li

Figure 1 for Cascade Bagging for Accuracy Prediction with Few Training Samples

Figure 2 for Cascade Bagging for Accuracy Prediction with Few Training Samples

Figure 3 for Cascade Bagging for Accuracy Prediction with Few Training Samples

Figure 4 for Cascade Bagging for Accuracy Prediction with Few Training Samples

Accuracy predictor is trained to predict the validation accuracy of an network from its architecture encoding. It can effectively assist in designing networks and improving Neural Architecture Search(NAS) efficiency. However, a high-performance predictor depends on adequate trainning samples, which requires unaffordable computation overhead. To alleviate this problem, we propose a novel framework to train an accuracy predictor under few training samples. The framework consists ofdata augmentation methods and an ensemble learning algorithm. The data augmentation methods calibrate weak labels and inject noise to feature space. The ensemble learning algorithm, termed cascade bagging, trains two-level models by sampling data and features. In the end, the advantages of above methods are proved in the Performance Prediciton Track of CVPR2021 1st Lightweight NAS Challenge. Our code is made public at: https://github.com/dlongry/Solutionto-CVPR2021-NAS-Track2.

Via

Access Paper or Ask Questions

Few-shot Unsupervised Domain Adaptation with Image-to-class Sparse Similarity Encoding

Aug 06, 2021
Shengqi Huang, Wanqi Yang, Lei Wang, Luping Zhou, Ming Yang

Figure 1 for Few-shot Unsupervised Domain Adaptation with Image-to-class Sparse Similarity Encoding

Figure 2 for Few-shot Unsupervised Domain Adaptation with Image-to-class Sparse Similarity Encoding

Figure 3 for Few-shot Unsupervised Domain Adaptation with Image-to-class Sparse Similarity Encoding

Figure 4 for Few-shot Unsupervised Domain Adaptation with Image-to-class Sparse Similarity Encoding

This paper investigates a valuable setting called few-shot unsupervised domain adaptation (FS-UDA), which has not been sufficiently studied in the literature. In this setting, the source domain data are labelled, but with few-shot per category, while the target domain data are unlabelled. To address the FS-UDA setting, we develop a general UDA model to solve the following two key issues: the few-shot labeled data per category and the domain adaptation between support and query sets. Our model is general in that once trained it will be able to be applied to various FS-UDA tasks from the same source and target domains. Inspired by the recent local descriptor based few-shot learning (FSL), our general UDA model is fully built upon local descriptors (LDs) for image classification and domain adaptation. By proposing a novel concept called similarity patterns (SPs), our model not only effectively considers the spatial relationship of LDs that was ignored in previous FSL methods, but also makes the learned image similarity better serve the required domain alignment. Specifically, we propose a novel IMage-to-class sparse Similarity Encoding (IMSE) method. It learns SPs to extract the local discriminative information for classification and meanwhile aligns the covariance matrix of the SPs for domain adaptation. Also, domain adversarial training and multi-scale local feature matching are performed upon LDs. Extensive experiments conducted on a multi-domain benchmark dataset DomainNet demonstrates the state-of-the-art performance of our IMSE for the novel setting of FS-UDA. In addition, for FSL, our IMSE can also show better performance than most of recent FSL methods on miniImageNet.

* Proceedings of the 29th ACM International Conference on Multimedia (MM '21)

Via

Access Paper or Ask Questions

Hand Image Understanding via Deep Multi-Task Learning

Jul 28, 2021
Xiong Zhang, Hongsheng Huang, Jianchao Tan, Hongmin Xu, Cheng Yang, Guozhu Peng, Lei Wang, Ji Liu

Figure 1 for Hand Image Understanding via Deep Multi-Task Learning

Figure 2 for Hand Image Understanding via Deep Multi-Task Learning

Figure 3 for Hand Image Understanding via Deep Multi-Task Learning

Figure 4 for Hand Image Understanding via Deep Multi-Task Learning

Analyzing and understanding hand information from multimedia materials like images or videos is important for many real world applications and remains active in research community. There are various works focusing on recovering hand information from single image, however, they usually solve a single task, for example, hand mask segmentation, 2D/3D hand pose estimation, or hand mesh reconstruction and perform not well in challenging scenarios. To further improve the performance of these tasks, we propose a novel Hand Image Understanding (HIU) framework to extract comprehensive information of the hand object from a single RGB image, by jointly considering the relationships between these tasks. To achieve this goal, a cascaded multi-task learning (MTL) backbone is designed to estimate the 2D heat maps, to learn the segmentation mask, and to generate the intermediate 3D information encoding, followed by a coarse-to-fine learning paradigm and a self-supervised learning strategy. Qualitative experiments demonstrate that our approach is capable of recovering reasonable mesh representations even in challenging situations. Quantitatively, our method significantly outperforms the state-of-the-art approaches on various widely-used datasets, in terms of diverse evaluation metrics.

* Accepted By ICCV 2021

Via

Access Paper or Ask Questions

Trade When Opportunity Comes: Price Movement Forecasting via Locality-Aware Attention and Adaptive Refined Labeling

Jul 26, 2021
Liang Zeng, Lei Wang, Hui Niu, Jian Li, Ruchen Zhang, Zhonghao Dai, Dewei Zhu, Ling Wang

Figure 1 for Trade When Opportunity Comes: Price Movement Forecasting via Locality-Aware Attention and Adaptive Refined Labeling

Figure 2 for Trade When Opportunity Comes: Price Movement Forecasting via Locality-Aware Attention and Adaptive Refined Labeling

Figure 3 for Trade When Opportunity Comes: Price Movement Forecasting via Locality-Aware Attention and Adaptive Refined Labeling

Figure 4 for Trade When Opportunity Comes: Price Movement Forecasting via Locality-Aware Attention and Adaptive Refined Labeling

Price movement forecasting aims at predicting the future trends of financial assets based on the current market conditions and other relevant information. Recently, machine learning(ML) methods have become increasingly popular and achieved promising results for price movement forecasting in both academia and industry. Most existing ML solutions formulate the forecasting problem as a classification(to predict the direction) or a regression(to predict the return) problem in the entire set of training data. However, due to the extremely low signal-to-noise ratio and stochastic nature of financial data, good trading opportunities are extremely scarce. As a result, without careful selection of potentially profitable samples, such ML methods are prone to capture the patterns of noises instead of real signals. To address the above issues, we propose a novel framework-LARA(Locality-Aware Attention and Adaptive Refined Labeling), which contains the following three components: 1)Locality-aware attention automatically extracts the potentially profitable samples by attending to their label information in order to construct a more accurate classifier on these selected samples. 2)Adaptive refined labeling further iteratively refines the labels, alleviating the noise of samples. 3)Equipped with metric learning techniques, Locality-aware attention enjoys task-specific distance metrics and distributes attention on potentially profitable samples in a more effective way. To validate our method, we conduct comprehensive experiments on three real-world financial markets: ETFs, the China's A-share stock market, and the cryptocurrency market. LARA achieves superior performance compared with the time-series analysis methods and a set of machine learning based competitors on the Qlib platform. Extensive ablation studies and experiments demonstrate that LARA indeed captures more reliable trading opportunities.

Via

Access Paper or Ask Questions