Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Wei Lu

A Deep Learning based No-reference Quality Assessment Model for UGC Videos

Apr 29, 2022
Wei Sun, Xiongkuo Min, Wei Lu, Guangtao Zhai

Figure 1 for A Deep Learning based No-reference Quality Assessment Model for UGC Videos

Figure 2 for A Deep Learning based No-reference Quality Assessment Model for UGC Videos

Figure 3 for A Deep Learning based No-reference Quality Assessment Model for UGC Videos

Figure 4 for A Deep Learning based No-reference Quality Assessment Model for UGC Videos

Quality assessment for User Generated Content (UGC) videos plays an important role in ensuring the viewing experience of end-users. Previous UGC video quality assessment (VQA) studies either use the image recognition model or the image quality assessment (IQA) models to extract frame-level features of UGC videos for quality regression, which are regarded as the sub-optimal solutions because of the domain shifts between these tasks and the UGC VQA task. In this paper, we propose a very simple but effective UGC VQA model, which tries to address this problem by training an end-to-end spatial feature extraction network to directly learn the quality-aware spatial feature representation from raw pixels of the video frames. We also extract the motion features to measure the temporal-related distortions that the spatial features cannot model. The proposed model utilizes very sparse frames to extract spatial features and dense frames (i.e. the video chunk) with a very low spatial resolution to extract motion features, which thereby has low computational complexity. With the better quality-aware features, we only use the simple multilayer perception layer (MLP) network to regress them into the chunk-level quality scores, and then the temporal average pooling strategy is adopted to obtain the video-level quality score. We further introduce a multi-scale quality fusion strategy to solve the problem of VQA across different spatial resolutions, where the multi-scale weights are obtained from the contrast sensitivity function of the human visual system. The experimental results show that the proposed model achieves the best performance on five popular UGC VQA databases, which demonstrates the effectiveness of the proposed model. The code will be publicly available.

Via

Access Paper or Ask Questions

Learning to Reason Deductively: Math Word Problem Solving as Complex Relation Extraction

Mar 19, 2022
Zhanming Jie, Jierui Li, Wei Lu

Figure 1 for Learning to Reason Deductively: Math Word Problem Solving as Complex Relation Extraction

Figure 2 for Learning to Reason Deductively: Math Word Problem Solving as Complex Relation Extraction

Figure 3 for Learning to Reason Deductively: Math Word Problem Solving as Complex Relation Extraction

Figure 4 for Learning to Reason Deductively: Math Word Problem Solving as Complex Relation Extraction

Solving math word problems requires deductive reasoning over the quantities in the text. Various recent research efforts mostly relied on sequence-to-sequence or sequence-to-tree models to generate mathematical expressions without explicitly performing relational reasoning between quantities in the given context. While empirically effective, such approaches typically do not provide explanations for the generated expressions. In this work, we view the task as a complex relation extraction problem, proposing a novel approach that presents explainable deductive reasoning steps to iteratively construct target expressions, where each step involves a primitive operation over two quantities defining their relation. Through extensive experiments on four benchmark datasets, we show that the proposed model significantly outperforms existing strong baselines. We further demonstrate that the deductive procedure not only presents more explainable steps but also enables us to make more accurate predictions on questions that require more complex reasoning.

* 12 pages, 7 figures, Accepted by ACL-2022 main conference as a long paper

Via

Access Paper or Ask Questions

DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition

Mar 01, 2022
Xinyu Wang, Yongliang Shen, Jiong Cai, Tao Wang, Xiaobin Wang, Pengjun Xie, Fei Huang, Weiming Lu, Yueting Zhuang, Kewei Tu, Wei Lu, Yong Jiang

Figure 1 for DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition

Figure 2 for DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition

Figure 3 for DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition

Figure 4 for DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition

The MultiCoNER shared task aims at detecting semantically ambiguous and complex named entities in short and low-context settings for multiple languages. The lack of contexts makes the recognition of ambiguous named entities challenging. To alleviate this issue, our team DAMO-NLP proposes a knowledge-based system, where we build a multilingual knowledge base based on Wikipedia to provide related context information to the named entity recognition (NER) model. Given an input sentence, our system effectively retrieves related contexts from the knowledge base. The original input sentences are then augmented with such context information, allowing significantly better contextualized token representations to be captured. Our system wins 10 out of 13 tracks in the MultiCoNER shared task.

* Our Knowledge-based NER system wins 10 out of 13 tracks in the SemEval-2022 MultiCoNER shared task

Via

Access Paper or Ask Questions

Exploring Task Difficulty for Few-Shot Relation Extraction

Sep 28, 2021
Jiale Han, Bo Cheng, Wei Lu

Figure 1 for Exploring Task Difficulty for Few-Shot Relation Extraction

Figure 2 for Exploring Task Difficulty for Few-Shot Relation Extraction

Figure 3 for Exploring Task Difficulty for Few-Shot Relation Extraction

Figure 4 for Exploring Task Difficulty for Few-Shot Relation Extraction

Few-shot relation extraction (FSRE) focuses on recognizing novel relations by learning with merely a handful of annotated instances. Meta-learning has been widely adopted for such a task, which trains on randomly generated few-shot tasks to learn generic data representations. Despite impressive results achieved, existing models still perform suboptimally when handling hard FSRE tasks, where the relations are fine-grained and similar to each other. We argue this is largely because existing models do not distinguish hard tasks from easy ones in the learning process. In this paper, we introduce a novel approach based on contrastive learning that learns better representations by exploiting relation label information. We further design a method that allows the model to adaptively learn how to focus on hard tasks. Experiments on two standard datasets demonstrate the effectiveness of our method.

Via

Access Paper or Ask Questions

Profiling Neural Blocks and Design Spaces for Mobile Neural Architecture Search

Sep 25, 2021
Keith G. Mills, Fred X. Han, Jialin Zhang, Seyed Saeed Changiz Rezaei, Fabian Chudak, Wei Lu, Shuo Lian, Shangling Jui, Di Niu

Figure 1 for Profiling Neural Blocks and Design Spaces for Mobile Neural Architecture Search

Figure 2 for Profiling Neural Blocks and Design Spaces for Mobile Neural Architecture Search

Figure 3 for Profiling Neural Blocks and Design Spaces for Mobile Neural Architecture Search

Figure 4 for Profiling Neural Blocks and Design Spaces for Mobile Neural Architecture Search

Neural architecture search automates neural network design and has achieved state-of-the-art results in many deep learning applications. While recent literature has focused on designing networks to maximize accuracy, little work has been conducted to understand the compatibility of architecture design spaces to varying hardware. In this paper, we analyze the neural blocks used to build Once-for-All (MobileNetV3), ProxylessNAS and ResNet families, in order to understand their predictive power and inference latency on various devices, including Huawei Kirin 9000 NPU, RTX 2080 Ti, AMD Threadripper 2990WX, and Samsung Note10. We introduce a methodology to quantify the friendliness of neural blocks to hardware and the impact of their placement in a macro network on overall network performance via only end-to-end measurements. Based on extensive profiling results, we derive design insights and apply them to hardware-specific search space reduction. We show that searching in the reduced search space generates better accuracy-latency Pareto frontiers than searching in the original search spaces, customizing architecture search according to the hardware. Moreover, insights derived from measurements lead to notably higher ImageNet top-1 scores on all search spaces investigated.

* Accepted as an Applied Research Paper at CIKM 2021; 10 pages, 8 Figures, 2 Tables

Via

Access Paper or Ask Questions

L$^{2}$NAS: Learning to Optimize Neural Architectures via Continuous-Action Reinforcement Learning

Sep 25, 2021
Keith G. Mills, Fred X. Han, Mohammad Salameh, Seyed Saeed Changiz Rezaei, Linglong Kong, Wei Lu, Shuo Lian, Shangling Jui, Di Niu

$Figure 1 for L$^{2}$NAS: Learning to Optimize Neural Architectures via Continuous-Action Reinforcement Learning$

$Figure 2 for L$^{2}$NAS: Learning to Optimize Neural Architectures via Continuous-Action Reinforcement Learning$

$Figure 3 for L$^{2}$NAS: Learning to Optimize Neural Architectures via Continuous-Action Reinforcement Learning$

$Figure 4 for L$^{2}$NAS: Learning to Optimize Neural Architectures via Continuous-Action Reinforcement Learning$

Neural architecture search (NAS) has achieved remarkable results in deep neural network design. Differentiable architecture search converts the search over discrete architectures into a hyperparameter optimization problem which can be solved by gradient descent. However, questions have been raised regarding the effectiveness and generalizability of gradient methods for solving non-convex architecture hyperparameter optimization problems. In this paper, we propose L$^{2}$NAS, which learns to intelligently optimize and update architecture hyperparameters via an actor neural network based on the distribution of high-performing architectures in the search history. We introduce a quantile-driven training procedure which efficiently trains L$^{2}$NAS in an actor-critic framework via continuous-action reinforcement learning. Experiments show that L$^{2}$NAS achieves state-of-the-art results on NAS-Bench-201 benchmark as well as DARTS search space and Once-for-All MobileNetV3 search space. We also show that search policies generated by L$^{2}$NAS are generalizable and transferable across different training datasets with minimal fine-tuning.

* Accepted as a Full Research Paper at CIKM 2021; 10 pages, 3 Figures, 5 Tables

Via

Access Paper or Ask Questions

A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff and Service Satisfaction Analysis

Sep 17, 2021
Jiawei Liu, Kaisong Song, Yangyang Kang, Guoxiu He, Zhuoren Jiang, Changlong Sun, Wei Lu, Xiaozhong Liu

Figure 1 for A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff and Service Satisfaction Analysis

Figure 2 for A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff and Service Satisfaction Analysis

Figure 3 for A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff and Service Satisfaction Analysis

Figure 4 for A Role-Selected Sharing Network for Joint Machine-Human Chatting Handoff and Service Satisfaction Analysis

Chatbot is increasingly thriving in different domains, however, because of unexpected discourse complexity and training data sparseness, its potential distrust hatches vital apprehension. Recently, Machine-Human Chatting Handoff (MHCH), predicting chatbot failure and enabling human-algorithm collaboration to enhance chatbot quality, has attracted increasing attention from industry and academia. In this study, we propose a novel model, Role-Selected Sharing Network (RSSN), which integrates both dialogue satisfaction estimation and handoff prediction in one multi-task learning framework. Unlike prior efforts in dialog mining, by utilizing local user satisfaction as a bridge, global satisfaction detector and handoff predictor can effectively exchange critical information. Specifically, we decouple the relation and interaction between the two tasks by the role information after the shared encoder. Extensive experiments on two public datasets demonstrate the effectiveness of our model.

* 11 pages, 4 figures, accepted by the main conference of EMNLP 2021

Via

Access Paper or Ask Questions

To be Closer: Learning to Link up Aspects with Opinions

Sep 17, 2021
Yuxiang Zhou, Lejian Liao, Yang Gao, Zhanming Jie, Wei Lu

Figure 1 for To be Closer: Learning to Link up Aspects with Opinions

Figure 2 for To be Closer: Learning to Link up Aspects with Opinions

Figure 3 for To be Closer: Learning to Link up Aspects with Opinions

Figure 4 for To be Closer: Learning to Link up Aspects with Opinions

Dependency parse trees are helpful for discovering the opinion words in aspect-based sentiment analysis (ABSA). However, the trees obtained from off-the-shelf dependency parsers are static, and could be sub-optimal in ABSA. This is because the syntactic trees are not designed for capturing the interactions between opinion words and aspect words. In this work, we aim to shorten the distance between aspects and corresponding opinion words by learning an aspect-centric tree structure. The aspect and opinion words are expected to be closer along such tree structure compared to the standard dependency parse tree. The learning process allows the tree structure to adaptively correlate the aspect and opinion words, enabling us to better identify the polarity in the ABSA task. We conduct experiments on five aspect-based sentiment datasets, and the proposed model significantly outperforms recent strong baselines. Furthermore, our thorough analysis demonstrates the average distance between aspect and opinion words are shortened by at least 19% on the standard SemEval Restaurant14 dataset.

* Accepted as a long paper in the main conference of EMNLP 2021

Via

Access Paper or Ask Questions

Uncovering Main Causalities for Long-tailed Information Extraction

Sep 11, 2021
Guoshun Nan, Jiaqi Zeng, Rui Qiao, Zhijiang Guo, Wei Lu

Figure 1 for Uncovering Main Causalities for Long-tailed Information Extraction

Figure 2 for Uncovering Main Causalities for Long-tailed Information Extraction

Figure 3 for Uncovering Main Causalities for Long-tailed Information Extraction

Figure 4 for Uncovering Main Causalities for Long-tailed Information Extraction

Information Extraction (IE) aims to extract structural information from unstructured texts. In practice, long-tailed distributions caused by the selection bias of a dataset, may lead to incorrect correlations, also known as spurious correlations, between entities and labels in the conventional likelihood models. This motivates us to propose counterfactual IE (CFIE), a novel framework that aims to uncover the main causalities behind data in the view of causal inference. Specifically, 1) we first introduce a unified structural causal model (SCM) for various IE tasks, describing the relationships among variables; 2) with our SCM, we then generate counterfactuals based on an explicit language structure to better calculate the direct causal effect during the inference stage; 3) we further propose a novel debiasing approach to yield more robust predictions. Experiments on three IE tasks across five public datasets show the effectiveness of our CFIE model in mitigating the spurious correlation issues.

* Accepted as a long paper in the main conference of EMNLP 2021

Via

Access Paper or Ask Questions