Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Wei Wang

Bi-Level Attention Graph Neural Networks

Apr 23, 2023
Roshni G. Iyer, Wei Wang, Yizhou Sun

Figure 1 for Bi-Level Attention Graph Neural Networks

Figure 2 for Bi-Level Attention Graph Neural Networks

Figure 3 for Bi-Level Attention Graph Neural Networks

Figure 4 for Bi-Level Attention Graph Neural Networks

Recent graph neural networks (GNNs) with the attention mechanism have historically been limited to small-scale homogeneous graphs (HoGs). However, GNNs handling heterogeneous graphs (HeGs), which contain several entity and relation types, all have shortcomings in handling attention. Most GNNs that learn graph attention for HeGs learn either node-level or relation-level attention, but not both, limiting their ability to predict both important entities and relations in the HeG. Even the best existing method that learns both levels of attention has the limitation of assuming graph relations are independent and that its learned attention disregards this dependency association. To effectively model both multi-relational and multi-entity large-scale HeGs, we present Bi-Level Attention Graph Neural Networks (BA-GNN), scalable neural networks (NNs) that use a novel bi-level graph attention mechanism. BA-GNN models both node-node and relation-relation interactions in a personalized way, by hierarchically attending to both types of information from local neighborhood contexts instead of the global graph context. Rigorous experiments on seven real-world HeGs show BA-GNN consistently outperforms all baselines, and demonstrate quality and transferability of its learned relation-level attention to improve performance of other GNNs.

Via

Access Paper or Ask Questions

Dehazing-NeRF: Neural Radiance Fields from Hazy Images

Apr 22, 2023
Tian Li, LU Li, Wei Wang, Zhangchi Feng

Figure 1 for Dehazing-NeRF: Neural Radiance Fields from Hazy Images

Figure 2 for Dehazing-NeRF: Neural Radiance Fields from Hazy Images

Figure 3 for Dehazing-NeRF: Neural Radiance Fields from Hazy Images

Figure 4 for Dehazing-NeRF: Neural Radiance Fields from Hazy Images

Neural Radiance Field (NeRF) has received much attention in recent years due to the impressively high quality in 3D scene reconstruction and novel view synthesis. However, image degradation caused by the scattering of atmospheric light and object light by particles in the atmosphere can significantly decrease the reconstruction quality when shooting scenes in hazy conditions. To address this issue, we propose Dehazing-NeRF, a method that can recover clear NeRF from hazy image inputs. Our method simulates the physical imaging process of hazy images using an atmospheric scattering model, and jointly learns the atmospheric scattering model and a clean NeRF model for both image dehazing and novel view synthesis. Different from previous approaches, Dehazing-NeRF is an unsupervised method with only hazy images as the input, and also does not rely on hand-designed dehazing priors. By jointly combining the depth estimated from the NeRF 3D scene with the atmospheric scattering model, our proposed model breaks through the ill-posed problem of single-image dehazing while maintaining geometric consistency. Besides, to alleviate the degradation of image quality caused by information loss, soft margin consistency regularization, as well as atmospheric consistency and contrast discriminative loss, are addressed during the model training process. Extensive experiments demonstrate that our method outperforms the simple combination of single-image dehazing and NeRF on both image dehazing and novel view image synthesis.

Via

Access Paper or Ask Questions

Collaborative Feature Learning for Fine-grained Facial Forgery Detection and Segmentation

Apr 17, 2023
Weinan Guan, Wei Wang, Jing Dong, Bo Peng, Tieniu Tan

Figure 1 for Collaborative Feature Learning for Fine-grained Facial Forgery Detection and Segmentation

Figure 2 for Collaborative Feature Learning for Fine-grained Facial Forgery Detection and Segmentation

Figure 3 for Collaborative Feature Learning for Fine-grained Facial Forgery Detection and Segmentation

Figure 4 for Collaborative Feature Learning for Fine-grained Facial Forgery Detection and Segmentation

Detecting maliciously falsified facial images and videos has attracted extensive attention from digital-forensics and computer-vision communities. An important topic in manipulation detection is the localization of the fake regions. Previous work related to forgery detection mostly focuses on the entire faces. However, recent forgery methods have developed to edit important facial components while maintaining others unchanged. This drives us to not only focus on the forgery detection but also fine-grained falsified region segmentation. In this paper, we propose a collaborative feature learning approach to simultaneously detect manipulation and segment the falsified components. With the collaborative manner, detection and segmentation can boost each other efficiently. To enable our study of forgery detection and segmentation, we build a facial forgery dataset consisting of both entire and partial face forgeries with their pixel-level manipulation ground-truth. Experiment results have justified the mutual promotion between forgery detection and manipulated region segmentation. The overall performance of the proposed approach is better than the state-of-the-art detection or segmentation approaches. The visualization results have shown that our proposed model always captures the artifacts on facial regions, which is more reasonable.

Via

Access Paper or Ask Questions

ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human

Apr 16, 2023
Junfeng Tian, Hehong Chen, Guohai Xu, Ming Yan, Xing Gao, Jianhai Zhang, Chenliang Li, Jiayi Liu, Wenshen Xu, Haiyang Xu, Qi Qian, Wei Wang, Qinghao Ye, Jiejing Zhang, Ji Zhang, Fei Huang, Jingren Zhou

Figure 1 for ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human

Figure 2 for ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human

Figure 3 for ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human

Figure 4 for ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human

In this paper, we present ChatPLUG, a Chinese open-domain dialogue system for digital human applications that instruction finetunes on a wide range of dialogue tasks in a unified internet-augmented format. Different from other open-domain dialogue models that focus on large-scale pre-training and scaling up model size or dialogue corpus, we aim to build a powerful and practical dialogue system for digital human with diverse skills and good multi-task generalization by internet-augmented instruction tuning. To this end, we first conduct large-scale pre-training on both common document corpus and dialogue data with curriculum learning, so as to inject various world knowledge and dialogue abilities into ChatPLUG. Then, we collect a wide range of dialogue tasks spanning diverse features of knowledge, personality, multi-turn memory, and empathy, on which we further instruction tune \modelname via unified natural language instruction templates. External knowledge from an internet search is also used during instruction finetuning for alleviating the problem of knowledge hallucinations. We show that \modelname outperforms state-of-the-art Chinese dialogue systems on both automatic and human evaluation, and demonstrates strong multi-task generalization on a variety of text understanding and generation tasks. In addition, we deploy \modelname to real-world applications such as Smart Speaker and Instant Message applications with fast inference. Our models and code will be made publicly available on ModelScope~\footnote{\small{https://modelscope.cn/models/damo/ChatPLUG-3.7B}} and Github~\footnote{\small{https://github.com/X-PLUG/ChatPLUG}}.

* 36 pages

Via

Access Paper or Ask Questions

GPSMirror: Expanding Accurate GPS Positioning to Shadowed and Indoor Regions with Backscatter

Apr 15, 2023
Huixin Dong, Yirong Xie, Xianan Zhang, Wei Wang, Xinyu Zhang, Jianhua He

Figure 1 for GPSMirror: Expanding Accurate GPS Positioning to Shadowed and Indoor Regions with Backscatter

Figure 2 for GPSMirror: Expanding Accurate GPS Positioning to Shadowed and Indoor Regions with Backscatter

Figure 3 for GPSMirror: Expanding Accurate GPS Positioning to Shadowed and Indoor Regions with Backscatter

Figure 4 for GPSMirror: Expanding Accurate GPS Positioning to Shadowed and Indoor Regions with Backscatter

Despite the prevalence of GPS services, they still suffer from intermittent positioning with poor accuracy in partially shadowed regions like urban canyons, flyover shadows, and factories' indoor areas. Existing wisdom relies on hardware modifications of GPS receivers or power-hungry infrastructures requiring continuous plug-in power supply which is hard to provide in outdoor regions and some factories. This paper fills the gap with GPSMirror, the first GPS-strengthening system that works for unmodified smartphones with the assistance of newly-designed GPS backscatter tags. The key enabling techniques in GPSMirror include: (i) a careful hardware design with microwatt-level power consumption that pushes the limit of backscatter sensitivity to re-radiate extremely weak GPS signals with enough coverage approaching the regulation limit; and (ii) a novel GPS positioning algorithm achieving meter-level accuracy in shadowed regions as well as expanding locatable regions under inadequate satellites where conventional algorithms fail. We build a prototype of the GPSMirror tags and conduct comprehensive experiments to evaluate them. Our results show that a GPSMirror tag can provide coverage up to 27.7 m. GPSMirror achieves median positioning accuracy of 3.7 m indoors and 4.6 m in urban canyon environments, respectively.

* 13 pages, 26 figures, to appear in MobiCom 2023

Via

Access Paper or Ask Questions

InfluencerRank: Discovering Effective Influencers via Graph Convolutional Attentive Recurrent Neural Networks

Apr 12, 2023
Seungbae Kim, Jyun-Yu Jiang, Jinyoung Han, Wei Wang

Figure 1 for InfluencerRank: Discovering Effective Influencers via Graph Convolutional Attentive Recurrent Neural Networks

Figure 2 for InfluencerRank: Discovering Effective Influencers via Graph Convolutional Attentive Recurrent Neural Networks

Figure 3 for InfluencerRank: Discovering Effective Influencers via Graph Convolutional Attentive Recurrent Neural Networks

Figure 4 for InfluencerRank: Discovering Effective Influencers via Graph Convolutional Attentive Recurrent Neural Networks

As influencers play considerable roles in social media marketing, companies increase the budget for influencer marketing. Hiring effective influencers is crucial in social influencer marketing, but it is challenging to find the right influencers among hundreds of millions of social media users. In this paper, we propose InfluencerRank that ranks influencers by their effectiveness based on their posting behaviors and social relations over time. To represent the posting behaviors and social relations, the graph convolutional neural networks are applied to model influencers with heterogeneous networks during different historical periods. By learning the network structure with the embedded node features, InfluencerRank can derive informative representations for influencers at each period. An attentive recurrent neural network finally distinguishes highly effective influencers from other influencers by capturing the knowledge of the dynamics of influencer representations over time. Extensive experiments have been conducted on an Instagram dataset that consists of 18,397 influencers with their 2,952,075 posts published within 12 months. The experimental results demonstrate that InfluencerRank outperforms existing baseline methods. An in-depth analysis further reveals that all of our proposed features and model components are beneficial to discover effective influencers.

* ICWSM 2023

Via

Access Paper or Ask Questions

RRHF: Rank Responses to Align Language Models with Human Feedback without tears

Apr 11, 2023
Zheng Yuan, Hongyi Yuan, Chuanqi Tan, Wei Wang, Songfang Huang, Fei Huang

Figure 1 for RRHF: Rank Responses to Align Language Models with Human Feedback without tears

Figure 2 for RRHF: Rank Responses to Align Language Models with Human Feedback without tears

Figure 3 for RRHF: Rank Responses to Align Language Models with Human Feedback without tears

Figure 4 for RRHF: Rank Responses to Align Language Models with Human Feedback without tears

Reinforcement Learning from Human Feedback (RLHF) facilitates the alignment of large language models with human preferences, significantly enhancing the quality of interactions between humans and these models. InstructGPT implements RLHF through several stages, including Supervised Fine-Tuning (SFT), reward model training, and Proximal Policy Optimization (PPO). PPO, however, is sensitive to hyperparameters and requires a minimum of four models in its standard implementation, which makes it hard to train. In contrast, we propose a novel learning paradigm called RRHF, which scores responses generated by different sampling policies and learns to align them with human preferences through ranking loss. RRHF can efficiently align language model output probabilities with human preferences as robust as fine-tuning and it only needs 1 to 2 models during tuning. In addition, RRHF can be considered an extension of SFT and reward models while being simpler than PPO in terms of coding, model counts, and hyperparameters. The entire alignment process can be accomplished within a single RRHF training session. We evaluate RRHF using LLaMA and Alpaca on Helpful and Harmless data, demonstrating performance comparable to PPO.

* Codes available at https://github.com/GanjinZero/RRHF

Via

Access Paper or Ask Questions

High-frequency Limits for 3D-Printed Gradient-index (GRIN) Lens Antennas

Apr 10, 2023
Wei Wang, Philip Lambert, Jonathan Chisum

Figure 1 for High-frequency Limits for 3D-Printed Gradient-index (GRIN) Lens Antennas

Figure 2 for High-frequency Limits for 3D-Printed Gradient-index (GRIN) Lens Antennas

Figure 3 for High-frequency Limits for 3D-Printed Gradient-index (GRIN) Lens Antennas

Figure 4 for High-frequency Limits for 3D-Printed Gradient-index (GRIN) Lens Antennas

Artificial dielectrics are widely used for Gradient-Index (GRIN) lens antennas. The unit-cell size of an artificial dielectric determines the maximum operating frequency and also drives cost and yield. To explore the frequency limitations we printed four identical Luneburg lens antennas using gyroid unit-cells of 12.5, 10, 7.5, and 5mm and measured their gain over the K- and Ka-band. We find maximum frequencies of 20, 25, 33, and >40GHz for each unit-cell, respectively. These measurements suggest a print resolution limit of $0.7\lambda_g$, where $\lambda_g$ is the wavelength in the host dielectric.

Via

Access Paper or Ask Questions

Efficient automatic segmentation for multi-level pulmonary arteries: The PARSE challenge

Apr 07, 2023
Gongning Luo, Kuanquan Wang, Jun Liu, Shuo Li, Xinjie Liang, Xiangyu Li, Shaowei Gan, Wei Wang, Suyu Dong, Wenyi Wang, Pengxin Yu, Enyou Liu, Hongrong Wei, Na Wang, Jia Guo, Huiqi Li, Zhao Zhang, Ziwei Zhao, Na Gao, Nan An, Ashkan Pakzad, Bojidar Rangelov, Jiaqi Dou, Song Tian, Zeyu Liu, Yi Wang, Ampatishan Sivalingam, Kumaradevan Punithakumar, Zhaowen Qiu, Xin Gao

Figure 1 for Efficient automatic segmentation for multi-level pulmonary arteries: The PARSE challenge

Figure 2 for Efficient automatic segmentation for multi-level pulmonary arteries: The PARSE challenge

Figure 3 for Efficient automatic segmentation for multi-level pulmonary arteries: The PARSE challenge

Figure 4 for Efficient automatic segmentation for multi-level pulmonary arteries: The PARSE challenge

Efficient automatic segmentation of multi-level (i.e. main and branch) pulmonary arteries (PA) in CTPA images plays a significant role in clinical applications. However, most existing methods concentrate only on main PA or branch PA segmentation separately and ignore segmentation efficiency. Besides, there is no public large-scale dataset focused on PA segmentation, which makes it highly challenging to compare the different methods. To benchmark multi-level PA segmentation algorithms, we organized the first \textbf{P}ulmonary \textbf{AR}tery \textbf{SE}gmentation (PARSE) challenge. On the one hand, we focus on both the main PA and the branch PA segmentation. On the other hand, for better clinical application, we assign the same score weight to segmentation efficiency (mainly running time and GPU memory consumption during inference) while ensuring PA segmentation accuracy. We present a summary of the top algorithms and offer some suggestions for efficient and accurate multi-level PA automatic segmentation. We provide the PARSE challenge as open-access for the community to benchmark future algorithm developments at \url{https://parse2022.grand-challenge.org/Parse2022/}.

Via

Access Paper or Ask Questions