Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Zhuo Wang

Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today

Jun 02, 2023

Zhuo Wang, Rongzhen Li, Bowen Dong, Jie Wang, Xiuxing Li, Ning Liu, Chenhui Mao, Wei Zhang, Liling Dong, Jing Gao(+1 more)

Figure 1 for Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today

Figure 2 for Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today

Figure 3 for Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today

Figure 4 for Can LLMs like GPT-4 outperform traditional AI tools in dementia diagnosis? Maybe, but not today

Abstract:Recent investigations show that large language models (LLMs), specifically GPT-4, not only have remarkable capabilities in common Natural Language Processing (NLP) tasks but also exhibit human-level performance on various professional and academic benchmarks. However, whether GPT-4 can be directly used in practical applications and replace traditional artificial intelligence (AI) tools in specialized domains requires further experimental validation. In this paper, we explore the potential of LLMs such as GPT-4 to outperform traditional AI tools in dementia diagnosis. Comprehensive comparisons between GPT-4 and traditional AI tools are conducted to examine their diagnostic accuracy in a clinical setting. Experimental results on two real clinical datasets show that, although LLMs like GPT-4 demonstrate potential for future advancements in dementia diagnosis, they currently do not surpass the performance of traditional AI tools. The interpretability and faithfulness of GPT-4 are also evaluated by comparison with real doctors. We discuss the limitations of GPT-4 in its current state and propose future research directions to enhance GPT-4 in dementia diagnosis.

* 16 pages, 6 figures

Via

Access Paper or Ask Questions

Not Just Plain Text! Fuel Document-Level Relation Extraction with Explicit Syntax Refinement and Subsentence Modeling

Nov 10, 2022

Zhichao Duan, Xiuxing Li, Zhenyu Li, Zhuo Wang, Jianyong Wang

Abstract:Document-level relation extraction (DocRE) aims to identify semantic labels among entities within a single document. One major challenge of DocRE is to dig decisive details regarding a specific entity pair from long text. However, in many cases, only a fraction of text carries required information, even in the manually labeled supporting evidence. To better capture and exploit instructive information, we propose a novel expLicit syntAx Refinement and Subsentence mOdeliNg based framework (LARSON). By introducing extra syntactic information, LARSON can model subsentences of arbitrary granularity and efficiently screen instructive ones. Moreover, we incorporate refined syntax into text representations which further improves the performance of LARSON. Experimental results on three benchmark datasets (DocRED, CDR, and GDA) demonstrate that LARSON significantly outperforms existing methods.

* Findings of EMNLP 2022

Via

Access Paper or Ask Questions

Cloud removal Using Atmosphere Model

Oct 05, 2022

Yi Guo, Feng Li, Zhuo Wang

Figure 1 for Cloud removal Using Atmosphere Model

Figure 2 for Cloud removal Using Atmosphere Model

Figure 3 for Cloud removal Using Atmosphere Model

Figure 4 for Cloud removal Using Atmosphere Model

Abstract:Cloud removal is an essential task in remote sensing data analysis. As the image sensors are distant from the earth ground, it is likely that part of the area of interests is covered by cloud. Moreover, the atmosphere in between creates a constant haze layer upon the acquired images. To recover the ground image, we propose to use scattering model for temporal sequence of images of any scene in the framework of low rank and sparse models. We further develop its variant, which is much faster and yet more accurate. To measure the performance of different methods {\em objectively}, we develop a semi-realistic simulation method to produce cloud cover so that various methods can be quantitatively analysed, which enables detailed study of many aspects of cloud removal algorithms, including verifying the effectiveness of proposed models in comparison with the state-of-the-arts, including deep learning models, and addressing the long standing problem of the determination of regularisation parameters. The latter is companioned with theoretic analysis on the range of the sparsity regularisation parameter and verified numerically.

Via

Access Paper or Ask Questions

Depth Quality-Inspired Feature Manipulation for Efficient RGB-D and Video Salient Object Detection

Aug 08, 2022

Wenbo Zhang, Keren Fu, Zhuo Wang, Ge-Peng Ji, Qijun Zhao

Figure 1 for Depth Quality-Inspired Feature Manipulation for Efficient RGB-D and Video Salient Object Detection

Figure 2 for Depth Quality-Inspired Feature Manipulation for Efficient RGB-D and Video Salient Object Detection

Figure 3 for Depth Quality-Inspired Feature Manipulation for Efficient RGB-D and Video Salient Object Detection

Figure 4 for Depth Quality-Inspired Feature Manipulation for Efficient RGB-D and Video Salient Object Detection

Abstract:Recently CNN-based RGB-D salient object detection (SOD) has obtained significant improvement on detection accuracy. However, existing models often fail to perform well in terms of efficiency and accuracy simultaneously. This hinders their potential applications on mobile devices as well as many real-world problems. To bridge the accuracy gap between lightweight and large models for RGB-D SOD, in this paper, an efficient module that can greatly improve the accuracy but adds little computation is proposed. Inspired by the fact that depth quality is a key factor influencing the accuracy, we propose an efficient depth quality-inspired feature manipulation (DQFM) process, which can dynamically filter depth features according to depth quality. The proposed DQFM resorts to the alignment of low-level RGB and depth features, as well as holistic attention of the depth stream to explicitly control and enhance cross-modal fusion. We embed DQFM to obtain an efficient lightweight RGB-D SOD model called DFM-Net, where we in addition design a tailored depth backbone and a two-stage decoder as basic parts. Extensive experimental results on nine RGB-D datasets demonstrate that our DFM-Net outperforms recent efficient models, running at about 20 FPS on CPU with only 8.5Mb model size, and meanwhile being 2.9/2.4 times faster and 6.7/3.1 times smaller than the latest best models A2dele and MobileSal. It also maintains state-of-the-art accuracy when even compared to non-efficient models. Interestingly, further statistics and analyses verify the ability of DQFM in distinguishing depth maps of various qualities without any quality labels. Last but not least, we further apply DFM-Net to deal with video SOD (VSOD), achieving comparable performance against recent efficient models while being 3/2.3 times faster/smaller than the prior best in this field. Our code is available at https://github.com/zwbx/DFM-Net.

* submitted to IJCV. arXiv admin note: substantial text overlap with arXiv:2107.01779

Via

Access Paper or Ask Questions

Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing

Mar 18, 2022

Zhuo Wang, Zezheng Wang, Zitong Yu, Weihong Deng, Jiahong Li, Tingting Gao, Zhongyuan Wang

Figure 1 for Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing

Figure 2 for Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing

Figure 3 for Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing

Figure 4 for Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing

Abstract:With diverse presentation attacks emerging continually, generalizable face anti-spoofing (FAS) has drawn growing attention. Most existing methods implement domain generalization (DG) on the complete representations. However, different image statistics may have unique properties for the FAS tasks. In this work, we separate the complete representation into content and style ones. A novel Shuffled Style Assembly Network (SSAN) is proposed to extract and reassemble different content and style features for a stylized feature space. Then, to obtain a generalized representation, a contrastive learning strategy is developed to emphasize liveness-related style information while suppress the domain-specific one. Finally, the representations of the correct assemblies are used to distinguish between living and spoofing during the inferring. On the other hand, despite the decent performance, there still exists a gap between academia and industry, due to the difference in data quantity and distribution. Thus, a new large-scale benchmark for FAS is built up to further evaluate the performance of algorithms in reality. Both qualitative and quantitative results on existing and proposed benchmarks demonstrate the effectiveness of our methods. The codes will be available at https://github.com/wangzhuo2019/SSAN.

* Accepted by CVPR2022

Via

Access Paper or Ask Questions

Scalable Rule-Based Representation Learning for Interpretable Classification

Sep 30, 2021

Zhuo Wang, Wei Zhang, Ning Liu, Jianyong Wang

Figure 1 for Scalable Rule-Based Representation Learning for Interpretable Classification

Figure 2 for Scalable Rule-Based Representation Learning for Interpretable Classification

Figure 3 for Scalable Rule-Based Representation Learning for Interpretable Classification

Figure 4 for Scalable Rule-Based Representation Learning for Interpretable Classification

Abstract:Rule-based models, e.g., decision trees, are widely used in scenarios demanding high model interpretability for their transparent inner structures and good model expressivity. However, rule-based models are hard to optimize, especially on large data sets, due to their discrete parameters and structures. Ensemble methods and fuzzy/soft rules are commonly used to improve performance, but they sacrifice the model interpretability. To obtain both good scalability and interpretability, we propose a new classifier, named Rule-based Representation Learner (RRL), that automatically learns interpretable non-fuzzy rules for data representation and classification. To train the non-differentiable RRL effectively, we project it to a continuous space and propose a novel training method, called Gradient Grafting, that can directly optimize the discrete model using gradient descent. An improved design of logical activation functions is also devised to increase the scalability of RRL and enable it to discretize the continuous features end-to-end. Exhaustive experiments on nine small and four large data sets show that RRL outperforms the competitive interpretable approaches and can be easily adjusted to obtain a trade-off between classification accuracy and model complexity for different scenarios. Our code is available at: https://github.com/12wang3/rrl.

* Accepted by NeurIPS 2021; Interpretable ML; Neuro-Symbolic AI

Via

Access Paper or Ask Questions

Depth Quality-Inspired Feature Manipulation for Efficient RGB-D Salient Object Detection

Jul 06, 2021

Wenbo Zhang, Ge-Peng Ji, Zhuo Wang, Keren Fu, Qijun Zhao

Figure 1 for Depth Quality-Inspired Feature Manipulation for Efficient RGB-D Salient Object Detection

Figure 2 for Depth Quality-Inspired Feature Manipulation for Efficient RGB-D Salient Object Detection

Figure 3 for Depth Quality-Inspired Feature Manipulation for Efficient RGB-D Salient Object Detection

Figure 4 for Depth Quality-Inspired Feature Manipulation for Efficient RGB-D Salient Object Detection

Abstract:RGB-D salient object detection (SOD) recently has attracted increasing research interest by benefiting conventional RGB SOD with extra depth information. However, existing RGB-D SOD models often fail to perform well in terms of both efficiency and accuracy, which hinders their potential applications on mobile devices and real-world problems. An underlying challenge is that the model accuracy usually degrades when the model is simplified to have few parameters. To tackle this dilemma and also inspired by the fact that depth quality is a key factor influencing the accuracy, we propose a novel depth quality-inspired feature manipulation (DQFM) process, which is efficient itself and can serve as a gating mechanism for filtering depth features to greatly boost the accuracy. DQFM resorts to the alignment of low-level RGB and depth features, as well as holistic attention of the depth stream to explicitly control and enhance cross-modal fusion. We embed DQFM to obtain an efficient light-weight model called DFM-Net, where we also design a tailored depth backbone and a two-stage decoder for further efficiency consideration. Extensive experimental results demonstrate that our DFM-Net achieves state-of-the-art accuracy when comparing to existing non-efficient models, and meanwhile runs at 140ms on CPU (2.2$\times$ faster than the prior fastest efficient model) with only $\sim$8.5Mb model size (14.9% of the prior lightest). Our code will be available at https://github.com/zwbx/DFM-Net.

* accepted in ACM MM 2021

Via

Access Paper or Ask Questions

Learning to drive via Apprenticeship Learning and Deep Reinforcement Learning

Jan 12, 2020

Wenhui Huang, Francesco Braghin, Zhuo Wang

Figure 1 for Learning to drive via Apprenticeship Learning and Deep Reinforcement Learning

Figure 2 for Learning to drive via Apprenticeship Learning and Deep Reinforcement Learning

Figure 3 for Learning to drive via Apprenticeship Learning and Deep Reinforcement Learning

Figure 4 for Learning to drive via Apprenticeship Learning and Deep Reinforcement Learning

Abstract:With the implementation of reinforcement learning (RL) algorithms, current state-of-art autonomous vehicle technology have the potential to get closer to full automation. However, most of the applications have been limited to game domains or discrete action space which are far from the real world driving. Moreover, it is very tough to tune the parameters of reward mechanism since the driving styles vary a lot among the different users. For instance, an aggressive driver may prefer driving with high acceleration whereas some conservative drivers prefer a safer driving style. Therefore, we propose an apprenticeship learning in combination with deep reinforcement learning approach that allows the agent to learn the driving and stopping behaviors with continuous actions. We use gradient inverse reinforcement learning (GIRL) algorithm to recover the unknown reward function and employ REINFORCE as well as Deep Deterministic Policy Gradient algorithm (DDPG) to learn the optimal policy. The performance of our method is evaluated in simulation-based scenario and the results demonstrate that the agent performs human like driving and even better in some aspects after training.

* 7 pages, 11 figures, conference

Via

Access Paper or Ask Questions

Transparent Classification with Multilayer Logical Perceptrons and Random Binarization

Dec 10, 2019

Zhuo Wang, Wei Zhang, Ning Liu, Jianyong Wang

Figure 1 for Transparent Classification with Multilayer Logical Perceptrons and Random Binarization

Figure 2 for Transparent Classification with Multilayer Logical Perceptrons and Random Binarization

Figure 3 for Transparent Classification with Multilayer Logical Perceptrons and Random Binarization

Figure 4 for Transparent Classification with Multilayer Logical Perceptrons and Random Binarization

Abstract:Models with transparent inner structure and high classification performance are required to reduce potential risk and provide trust for users in domains like health care, finance, security, etc. However, existing models are hard to simultaneously satisfy the above two properties. In this paper, we propose a new hierarchical rule-based model for classification tasks, named Concept Rule Sets (CRS), which has both a strong expressive ability and a transparent inner structure. To address the challenge of efficiently learning the non-differentiable CRS model, we propose a novel neural network architecture, Multilayer Logical Perceptron (MLLP), which is a continuous version of CRS. Using MLLP and the Random Binarization (RB) method we proposed, we can search the discrete solution of CRS in continuous space using gradient descent and ensure the discrete CRS acts almost the same as the corresponding continuous MLLP. Experiments on 12 public data sets show that CRS outperforms the state-of-the-art approaches and the complexity of the learned CRS is close to the simple decision tree.

* AAAI-20 oral

Via

Access Paper or Ask Questions

ColluEagle: Collusive review spammer detection using Markov random fields

Nov 05, 2019

Zhuo Wang, Runlong Hu, Qian Chen, Pei Gao, Xiaowei Xu

Figure 1 for ColluEagle: Collusive review spammer detection using Markov random fields

Figure 2 for ColluEagle: Collusive review spammer detection using Markov random fields

Figure 3 for ColluEagle: Collusive review spammer detection using Markov random fields

Figure 4 for ColluEagle: Collusive review spammer detection using Markov random fields

Abstract:Product reviews are extremely valuable for online shoppers in providing purchase decisions. Driven by immense profit incentives, fraudsters deliberately fabricate untruthful reviews to distort the reputation of online products. As online reviews become more and more important, group spamming, i.e., a team of fraudsters working collaboratively to attack a set of target products, becomes a new fashion. Previous works use review network effects, i.e. the relationships among reviewers, reviews, and products, to detect fake reviews or review spammers, but ignore time effects, which are critical in characterizing group spamming. In this paper, we propose a novel Markov random field (MRF)-based method (ColluEagle) to detect collusive review spammers, as well as review spam campaigns, considering both network effects and time effects. First we identify co-review pairs, a review phenomenon that happens between two reviewers who review a common product in a similar way, and then model reviewers and their co-review pairs as a pairwise-MRF, and use loopy belief propagation to evaluate the suspiciousness of reviewers. We further design a high quality yet easy-to-compute node prior for ColluEagle, through which the review spammer groups can also be subsequently identified. Experiments show that ColluEagle can not only detect collusive spammers with high precision, significantly outperforming state-of-the-art baselines --- FraudEagle and SpEagle, but also identify highly suspicious review spammer campaigns.

* 16 pages, 12 figures

Via

Access Paper or Ask Questions