Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Hao He

Unlock the Potential of Counterfactually-Augmented Data in Out-Of-Distribution Generalization

Oct 10, 2023
Caoyun Fan, Wenqing Chen, Jidong Tian, Yitian Li, Hao He, Yaohui Jin

Figure 1 for Unlock the Potential of Counterfactually-Augmented Data in Out-Of-Distribution Generalization

Figure 2 for Unlock the Potential of Counterfactually-Augmented Data in Out-Of-Distribution Generalization

Figure 3 for Unlock the Potential of Counterfactually-Augmented Data in Out-Of-Distribution Generalization

Figure 4 for Unlock the Potential of Counterfactually-Augmented Data in Out-Of-Distribution Generalization

Counterfactually-Augmented Data (CAD) -- minimal editing of sentences to flip the corresponding labels -- has the potential to improve the Out-Of-Distribution (OOD) generalization capability of language models, as CAD induces language models to exploit domain-independent causal features and exclude spurious correlations. However, the empirical results of CAD's OOD generalization are not as efficient as anticipated. In this study, we attribute the inefficiency to the myopia phenomenon caused by CAD: language models only focus on causal features that are edited in the augmentation operation and exclude other non-edited causal features. Therefore, the potential of CAD is not fully exploited. To address this issue, we analyze the myopia phenomenon in feature space from the perspective of Fisher's Linear Discriminant, then we introduce two additional constraints based on CAD's structural properties (dataset-level and sentence-level) to help language models extract more complete causal features in CAD, thereby mitigating the myopia phenomenon and improving OOD generalization capability. We evaluate our method on two tasks: Sentiment Analysis and Natural Language Inference, and the experimental results demonstrate that our method could unlock the potential of CAD and improve the OOD generalization performance of language models by 1.0% to 5.9%.

* Expert Systems With Applications 2023. arXiv admin note: text overlap with arXiv:2302.09345

Via

Access Paper or Ask Questions

Randomized algorithms for precise measurement of differentially-private, personalized recommendations

Aug 08, 2023
Allegra Laro, Yanqing Chen, Hao He, Babak Aghazadeh

Figure 1 for Randomized algorithms for precise measurement of differentially-private, personalized recommendations

Figure 2 for Randomized algorithms for precise measurement of differentially-private, personalized recommendations

Figure 3 for Randomized algorithms for precise measurement of differentially-private, personalized recommendations

Figure 4 for Randomized algorithms for precise measurement of differentially-private, personalized recommendations

Personalized recommendations form an important part of today's internet ecosystem, helping artists and creators to reach interested users, and helping users to discover new and engaging content. However, many users today are skeptical of platforms that personalize recommendations, in part due to historically careless treatment of personal data and data privacy. Now, businesses that rely on personalized recommendations are entering a new paradigm, where many of their systems must be overhauled to be privacy-first. In this article, we propose an algorithm for personalized recommendations that facilitates both precise and differentially-private measurement. We consider advertising as an example application, and conduct offline experiments to quantify how the proposed privacy-preserving algorithm affects key metrics related to user experience, advertiser value, and platform revenue compared to the extremes of both (private) non-personalized and non-private, personalized implementations.

* Submitted to AAAI

Via

Access Paper or Ask Questions

Taxonomy-Structured Domain Adaptation

Jul 01, 2023
Tianyi Liu, Zihao Xu, Hao He, Guang-Yuan Hao, Guang-He Lee, Hao Wang

Figure 1 for Taxonomy-Structured Domain Adaptation

Figure 2 for Taxonomy-Structured Domain Adaptation

Figure 3 for Taxonomy-Structured Domain Adaptation

Figure 4 for Taxonomy-Structured Domain Adaptation

Domain adaptation aims to mitigate distribution shifts among different domains. However, traditional formulations are mostly limited to categorical domains, greatly simplifying nuanced domain relationships in the real world. In this work, we tackle a generalization with taxonomy-structured domains, which formalizes domains with nested, hierarchical similarity structures such as animal species and product catalogs. We build on the classic adversarial framework and introduce a novel taxonomist, which competes with the adversarial discriminator to preserve the taxonomy information. The equilibrium recovers the classic adversarial domain adaptation's solution if given a non-informative domain taxonomy (e.g., a flat taxonomy where all leaf nodes connect to the root node) while yielding non-trivial results with other taxonomies. Empirically, our method achieves state-of-the-art performance on both synthetic and real-world datasets with successful adaptation. Code is available at https://github.com/Wang-ML-Lab/TSDA.

* Accepted by ICML 2023

Via

Access Paper or Ask Questions

Rethinking Rendering in Generalizable Neural Surface Reconstruction: A Learning-based Solution

May 30, 2023
Yixun Liang, Hao He, Ying-cong Chen

Figure 1 for Rethinking Rendering in Generalizable Neural Surface Reconstruction: A Learning-based Solution

Figure 2 for Rethinking Rendering in Generalizable Neural Surface Reconstruction: A Learning-based Solution

Figure 3 for Rethinking Rendering in Generalizable Neural Surface Reconstruction: A Learning-based Solution

Figure 4 for Rethinking Rendering in Generalizable Neural Surface Reconstruction: A Learning-based Solution

Generalizable neural surface reconstruction techniques have attracted great attention in recent years. However, they encounter limitations of low confidence depth distribution and inaccurate surface reasoning due to the oversimplified volume rendering process employed. In this paper, we present Reconstruction TRansformer (ReTR), a novel framework that leverages the transformer architecture to redesign the rendering process, enabling complex photon-particle interaction modeling. It introduces a learnable meta-ray token and utilizes the cross-attention mechanism to simulate the interaction of photons with sampled points and render the observed color. Meanwhile, by operating within a high-dimensional feature space rather than the color space, ReTR mitigates sensitivity to projected colors in source views. Such improvements result in accurate surface assessment with high confidence. We demonstrate the effectiveness of our approach on various datasets, showcasing how our method outperforms the current state-of-the-art approaches in terms of reconstruction quality and generalization ability.

* 18 pages, 11 Figures, Our code will be released at https://github.com/YixunLiang/ReTR

Via

Access Paper or Ask Questions

Summary of ChatGPT/GPT-4 Research and Perspective Towards the Future of Large Language Models

Apr 08, 2023
Yiheng Liu, Tianle Han, Siyuan Ma, Jiayue Zhang, Yuanyuan Yang, Jiaming Tian, Hao He, Antong Li, Mengshen He, Zhengliang Liu, Zihao Wu, Dajiang Zhu, Xiang Li, Ning Qiang, Dingang Shen, Tianming Liu, Bao Ge

Figure 1 for Summary of ChatGPT/GPT-4 Research and Perspective Towards the Future of Large Language Models

Figure 2 for Summary of ChatGPT/GPT-4 Research and Perspective Towards the Future of Large Language Models

Figure 3 for Summary of ChatGPT/GPT-4 Research and Perspective Towards the Future of Large Language Models

This paper presents a comprehensive survey of ChatGPT and GPT-4, state-of-the-art large language models (LLM) from the GPT series, and their prospective applications across diverse domains. Indeed, key innovations such as large-scale pre-training that captures knowledge across the entire world wide web, instruction fine-tuning and Reinforcement Learning from Human Feedback (RLHF) have played significant roles in enhancing LLMs' adaptability and performance. We performed an in-depth analysis of 194 relevant papers on arXiv, encompassing trend analysis, word cloud representation, and distribution analysis across various application domains. The findings reveal a significant and increasing interest in ChatGPT/GPT-4 research, predominantly centered on direct natural language processing applications, while also demonstrating considerable potential in areas ranging from education and history to mathematics, medicine, and physics. This study endeavors to furnish insights into ChatGPT's capabilities, potential implications, ethical concerns, and offer direction for future advancements in this field.

* 35 pages, 3 figures

Via

Access Paper or Ask Questions

Label Name is Mantra: Unifying Point Cloud Segmentation across Heterogeneous Datasets

Mar 19, 2023
Yixun Liang, Hao He, Shishi Xiao, Hao Lu, Yingcong Chen

Figure 1 for Label Name is Mantra: Unifying Point Cloud Segmentation across Heterogeneous Datasets

Figure 2 for Label Name is Mantra: Unifying Point Cloud Segmentation across Heterogeneous Datasets

Figure 3 for Label Name is Mantra: Unifying Point Cloud Segmentation across Heterogeneous Datasets

Figure 4 for Label Name is Mantra: Unifying Point Cloud Segmentation across Heterogeneous Datasets

Point cloud segmentation is a fundamental task in 3D vision that serves a wide range of applications. Although great progresses have been made these years, its practical usability is still limited by the availability of training data. Existing approaches cannot make full use of multiple datasets on hand due to the label mismatch among different datasets. In this paper, we propose a principled approach that supports learning from heterogeneous datasets with different label sets. Our idea is to utilize a pre-trained language model to embed discrete labels to a continuous latent space with the help of their label names. This unifies all labels of different datasets, so that joint training is doable. Meanwhile, classifying points in the continuous 3D space by their vocabulary tokens significantly increase the generalization ability of the model in comparison with existing approaches that have fixed decoder architecture. Besides, we also integrate prompt learning in our framework to alleviate data shifts among different data sources. Extensive experiments demonstrate that our model outperforms the state-of-the-art by a large margin.

* 11 pages, 5 figures

Via

Access Paper or Ask Questions

Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks

Mar 07, 2023
Jierun Chen, Shiu-hong Kao, Hao He, Weipeng Zhuo, Song Wen, Chul-Ho Lee, S. -H. Gary Chan

Figure 1 for Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks

Figure 2 for Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks

Figure 3 for Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks

Figure 4 for Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks

To design fast neural networks, many works have been focusing on reducing the number of floating-point operations (FLOPs). We observe that such reduction in FLOPs, however, does not necessarily lead to a similar level of reduction in latency. This mainly stems from inefficiently low floating-point operations per second (FLOPS). To achieve faster networks, we revisit popular operators and demonstrate that such low FLOPS is mainly due to frequent memory access of the operators, especially the depthwise convolution. We hence propose a novel partial convolution (PConv) that extracts spatial features more efficiently, by cutting down redundant computation and memory access simultaneously. Building upon our PConv, we further propose FasterNet, a new family of neural networks, which attains substantially higher running speed than others on a wide range of devices, without compromising on accuracy for various vision tasks. For example, on ImageNet-1k, our tiny FasterNet-T0 is $3.1\times$, $3.1\times$, and $2.5\times$ faster than MobileViT-XXS on GPU, CPU, and ARM processors, respectively, while being $2.9\%$ more accurate. Our large FasterNet-L achieves impressive $83.5\%$ top-1 accuracy, on par with the emerging Swin-B, while having $49\%$ higher inference throughput on GPU, as well as saving $42\%$ compute time on CPU. Code is available at \url{https://github.com/JierunChen/FasterNet}.

* Accepted to CVPR 2023

Via

Access Paper or Ask Questions