Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Bing Xue

Complex Permittivity Characterization of Low-Loss Dielectric Slabs at Sub-THz

May 09, 2024

Bing Xue, Katsuyuki Haneda, Clemens Icheln, Juha Tuomela

Abstract:This manuscript presents a novel method for characterizing the permittivities of low-loss dielectric slabs in sub-terahertz (sub-THz) frequencies, specifically above 100 GHz using a quasi-optical system. The algorithm is introduced with detailed derivations, and the measurement sensitivity is analyzed through simulations. Subsequently, the method's validity is established via simulations, demonstrating high accuracy (error 0.1% for the loss tangent) for a 30 mm thick plate material and relatively lower accuracy (error <5% for the loss tangent) for a 6 mm thick plate material. Notably, this accuracy surpasses that of the approach presented in [1] when the same window width is used to extract signals. Furthermore, a comparison between the permittivities of plexiglass with a 30 mm thickness characterized by the proposed method and the approach in [1] reveals a maximum difference in the dielectric constant of 0.011 and in loss tangent of 0.00071 from 140 to 220 GHz. Finally, the relative complex permittivities of plexiglass at 142.86 GHz obtained by both methods are compared with the reference values provided in [2], exhibiting differences of 0.06 in the dielectric constant.

* 5 pages, for IEEE letter form paper submission

Via

Access Paper or Ask Questions

Genetic Programming for Explainable Manifold Learning

Mar 21, 2024

Ben Cravens, Andrew Lensen, Paula Maddigan, Bing Xue

Figure 1 for Genetic Programming for Explainable Manifold Learning

Figure 2 for Genetic Programming for Explainable Manifold Learning

Figure 3 for Genetic Programming for Explainable Manifold Learning

Figure 4 for Genetic Programming for Explainable Manifold Learning

Abstract:Manifold learning techniques play a pivotal role in machine learning by revealing lower-dimensional embeddings within high-dimensional data, thus enhancing both the efficiency and interpretability of data analysis by transforming the data into a lower-dimensional representation. However, a notable challenge with current manifold learning methods is their lack of explicit functional mappings, crucial for explainability in many real-world applications. Genetic programming, known for its interpretable functional tree-based models, has emerged as a promising approach to address this challenge. Previous research leveraged multi-objective GP to balance manifold quality against embedding dimensionality, producing functional mappings across a range of embedding sizes. Yet, these mapping trees often became complex, hindering explainability. In response, in this paper, we introduce Genetic Programming for Explainable Manifold Learning (GP-EMaL), a novel approach that directly penalises tree complexity. Our new method is able to maintain high manifold quality while significantly enhancing explainability and also allows customisation of complexity measures, such as symmetry balancing, scaling, and node complexity, catering to diverse application needs. Our experimental analysis demonstrates that GP-EMaL is able to match the performance of the existing approach in most cases, while using simpler, smaller, and more interpretable tree structures. This advancement marks a significant step towards achieving interpretable manifold learning.

Via

Access Paper or Ask Questions

Explaining Genetic Programming Trees using Large Language Models

Mar 06, 2024

Paula Maddigan, Andrew Lensen, Bing Xue

Figure 1 for Explaining Genetic Programming Trees using Large Language Models

Figure 2 for Explaining Genetic Programming Trees using Large Language Models

Figure 3 for Explaining Genetic Programming Trees using Large Language Models

Figure 4 for Explaining Genetic Programming Trees using Large Language Models

Abstract:Genetic programming (GP) has the potential to generate explainable results, especially when used for dimensionality reduction. In this research, we investigate the potential of leveraging eXplainable AI (XAI) and large language models (LLMs) like ChatGPT to improve the interpretability of GP-based non-linear dimensionality reduction. Our study introduces a novel XAI dashboard named GP4NLDR, the first approach to combine state-of-the-art GP with an LLM-powered chatbot to provide comprehensive, user-centred explanations. We showcase the system's ability to provide intuitive and insightful narratives on high-dimensional data reduction processes through case studies. Our study highlights the importance of prompt engineering in eliciting accurate and pertinent responses from LLMs. We also address important considerations around data privacy, hallucinatory outputs, and the rapid advancements in generative AI. Our findings demonstrate its potential in advancing the explainability of GP algorithms. This opens the door for future research into explaining GP models with LLMs.

Via

Access Paper or Ask Questions

Fast and Efficient Local Search for Genetic Programming Based Loss Function Learning

Mar 01, 2024

Christian Raymond, Qi Chen, Bing Xue, Mengjie Zhang

Figure 1 for Fast and Efficient Local Search for Genetic Programming Based Loss Function Learning

Figure 2 for Fast and Efficient Local Search for Genetic Programming Based Loss Function Learning

Figure 3 for Fast and Efficient Local Search for Genetic Programming Based Loss Function Learning

Figure 4 for Fast and Efficient Local Search for Genetic Programming Based Loss Function Learning

Abstract:In this paper, we develop upon the topic of loss function learning, an emergent meta-learning paradigm that aims to learn loss functions that significantly improve the performance of the models trained under them. Specifically, we propose a new meta-learning framework for task and model-agnostic loss function learning via a hybrid search approach. The framework first uses genetic programming to find a set of symbolic loss functions. Second, the set of learned loss functions is subsequently parameterized and optimized via unrolled differentiation. The versatility and performance of the proposed framework are empirically validated on a diverse set of supervised learning tasks. Results show that the learned loss functions bring improved convergence, sample efficiency, and inference performance on tabulated, computer vision, and natural language processing problems, using a variety of task-specific neural network architectures.

* arXiv admin note: substantial text overlap with arXiv:2209.08907

Via

Access Paper or Ask Questions

Prescribing Large Language Models for Perioperative Care: What's The Right Dose for Pre-trained Models?

Feb 28, 2024

Bing Xue, Charles Alba, Joanna Abraham, Thomas Kannampallil, Chenyang Lu

Figure 1 for Prescribing Large Language Models for Perioperative Care: What's The Right Dose for Pre-trained Models?

Figure 2 for Prescribing Large Language Models for Perioperative Care: What's The Right Dose for Pre-trained Models?

Figure 3 for Prescribing Large Language Models for Perioperative Care: What's The Right Dose for Pre-trained Models?

Figure 4 for Prescribing Large Language Models for Perioperative Care: What's The Right Dose for Pre-trained Models?

Abstract:Postoperative risk predictions can inform effective perioperative care management and planning. We aimed to assess whether clinical large language models (LLMs) can predict postoperative risks using clinical texts with various training strategies. The main cohort involved 84,875 records from Barnes Jewish Hospital (BJH) system between 2018 and 2021. Methods were replicated on Beth Israel Deaconess's MIMIC dataset. Both studies had mean duration of follow-up based on the length of postoperative ICU stay less than 7 days. For the BJH dataset, outcomes included 30-day mortality, pulmonary embolism (PE) and pneumonia. Three domain adaptation and finetuning strategies were implemented for BioGPT, ClinicalBERT and BioClinicalBERT: self-supervised objectives; incorporating labels with semi-supervised fine-tuning; and foundational modelling through multi-task learning. Model performance was compared using the area under the receiver operating characteristic curve (AUROC) and the area under the precision recall curve (AUPRC) for classification tasks, and mean squared error (MSE) and R2 for regression tasks. Pre-trained LLMs outperformed traditional word embeddings, with absolute maximal gains of 38.3% for AUROC and 14% for AUPRC. Adapting models further improved performance: (1) self-supervised finetuning by 3.2% for AUROC and 1.5% for AUPRC; (2) semi-supervised finetuning by 1.8% for AUROC and 2% for AUPRC, compared to self-supervised finetuning; (3) foundational modelling by 3.6% for AUROC and 2.6% for AUPRC, compared to self-supervised finetuning. Pre-trained clinical LLMs offer opportunities for postoperative risk predictions in unforeseen data, with peaks in foundational models indicating the potential of task-agnostic learning towards the generalizability of LLMs in perioperative care.

* Supplemental file available at: http://tinyurl.com/mszmjna9 models publicly available at: https://huggingface.co/cja5553/BJH-perioperative-notes-bioGPT AND https://huggingface.co/cja5553/BJH-perioperative-notes-bioGPT

Via

Access Paper or Ask Questions

A Consistent Lebesgue Measure for Multi-label Learning

Feb 01, 2024

Kaan Demir, Bach Nguyen, Bing Xue, Mengjie Zhang

Abstract:Multi-label loss functions are usually non-differentiable, requiring surrogate loss functions for gradient-based optimisation. The consistency of surrogate loss functions is not proven and is exacerbated by the conflicting nature of multi-label loss functions. To directly learn from multiple related, yet potentially conflicting multi-label loss functions, we propose a Consistent Lebesgue Measure-based Multi-label Learner (CLML) and prove that CLML can achieve theoretical consistency under a Bayes risk framework. Empirical evidence supports our theory by demonstrating that: (1) CLML can consistently achieve state-of-the-art results; (2) the primary performance factor is the Lebesgue measure design, as CLML optimises a simpler feedforward model without additional label graph, perturbation-based conditioning, or semantic embeddings; and (3) an analysis of the results not only distinguishes CLML's effectiveness but also highlights inconsistencies between the surrogate and the desired loss functions.

Via

Access Paper or Ask Questions

Improving Buoy Detection with Deep Transfer Learning for Mussel Farm Automation

Aug 18, 2023

Carl McMillan, Junhong Zhao, Bing Xue, Ross Vennell, Mengjie Zhang

Figure 1 for Improving Buoy Detection with Deep Transfer Learning for Mussel Farm Automation

Figure 2 for Improving Buoy Detection with Deep Transfer Learning for Mussel Farm Automation

Figure 3 for Improving Buoy Detection with Deep Transfer Learning for Mussel Farm Automation

Figure 4 for Improving Buoy Detection with Deep Transfer Learning for Mussel Farm Automation

Abstract:The aquaculture sector in New Zealand is experiencing rapid expansion, with a particular emphasis on mussel exports. As the demands of mussel farming operations continue to evolve, the integration of artificial intelligence and computer vision techniques, such as intelligent object detection, is emerging as an effective approach to enhance operational efficiency. This study delves into advancing buoy detection by leveraging deep learning methodologies for intelligent mussel farm monitoring and management. The primary objective centers on improving accuracy and robustness in detecting buoys across a spectrum of real-world scenarios. A diverse dataset sourced from mussel farms is captured and labeled for training, encompassing imagery taken from cameras mounted on both floating platforms and traversing vessels, capturing various lighting and weather conditions. To establish an effective deep learning model for buoy detection with a limited number of labeled data, we employ transfer learning techniques. This involves adapting a pre-trained object detection model to create a specialized deep learning buoy detection model. We explore different pre-trained models, including YOLO and its variants, alongside data diversity to investigate their effects on model performance. Our investigation demonstrates a significant enhancement in buoy detection performance through deep learning, accompanied by improved generalization across diverse weather conditions, highlighting the practical effectiveness of our approach.

* 7 pages, 5 figures, submitted to ICVNZ 2023 conference https://ivcnz2023.massey.ac.nz/

Via

Access Paper or Ask Questions

Assisting Clinical Decisions for Scarcely Available Treatment via Disentangled Latent Representation

Jul 06, 2023

Bing Xue, Ahmed Sameh Said, Ziqi Xu, Hanyang Liu, Neel Shah, Hanqing Yang, Philip Payne, Chenyang Lu

Figure 1 for Assisting Clinical Decisions for Scarcely Available Treatment via Disentangled Latent Representation

Figure 2 for Assisting Clinical Decisions for Scarcely Available Treatment via Disentangled Latent Representation

Figure 3 for Assisting Clinical Decisions for Scarcely Available Treatment via Disentangled Latent Representation

Figure 4 for Assisting Clinical Decisions for Scarcely Available Treatment via Disentangled Latent Representation

Abstract:Extracorporeal membrane oxygenation (ECMO) is an essential life-supporting modality for COVID-19 patients who are refractory to conventional therapies. However, the proper treatment decision has been the subject of significant debate and it remains controversial about who benefits from this scarcely available and technically complex treatment option. To support clinical decisions, it is a critical need to predict the treatment need and the potential treatment and no-treatment responses. Targeting this clinical challenge, we propose Treatment Variational AutoEncoder (TVAE), a novel approach for individualized treatment analysis. TVAE is specifically designed to address the modeling challenges like ECMO with strong treatment selection bias and scarce treatment cases. TVAE conceptualizes the treatment decision as a multi-scale problem. We model a patient's potential treatment assignment and the factual and counterfactual outcomes as part of their intrinsic characteristics that can be represented by a deep latent variable model. The factual and counterfactual prediction errors are alleviated via a reconstruction regularization scheme together with semi-supervision, and the selection bias and the scarcity of treatment cases are mitigated by the disentangled and distribution-matched latent space and the label-balancing generative strategy. We evaluate TVAE on two real-world COVID-19 datasets: an international dataset collected from 1651 hospitals across 63 countries, and a institutional dataset collected from 15 hospitals. The results show that TVAE outperforms state-of-the-art treatment effect models in predicting both the propensity scores and factual outcomes on heterogeneous COVID-19 datasets. Additional experiments also show TVAE outperforms the best existing models in individual treatment effect estimation on the synthesized IHDP benchmark dataset.

Via

Access Paper or Ask Questions

Online Loss Function Learning

Jan 30, 2023

Christian Raymond, Qi Chen, Bing Xue, Mengjie Zhang

Figure 1 for Online Loss Function Learning

Figure 2 for Online Loss Function Learning

Figure 3 for Online Loss Function Learning

Figure 4 for Online Loss Function Learning

Abstract:Loss function learning is a new meta-learning paradigm that aims to automate the essential task of designing a loss function for a machine learning model. Existing techniques for loss function learning have shown promising results, often improving a model's training dynamics and final inference performance. However, a significant limitation of these techniques is that the loss functions are meta-learned in an offline fashion, where the meta-objective only considers the very first few steps of training, which is a significantly shorter time horizon than the one typically used for training deep neural networks. This causes significant bias towards loss functions that perform well at the very start of training but perform poorly at the end of training. To address this issue we propose a new loss function learning technique for adaptively updating the loss function online after each update to the base model parameters. The experimental results show that our proposed method consistently outperforms the cross-entropy loss and offline loss function learning techniques on a diverse range of neural network architectures and datasets.

Via

Access Paper or Ask Questions

An Efficient Evolutionary Deep Learning Framework Based on Multi-source Transfer Learning to Evolve Deep Convolutional Neural Networks

Dec 07, 2022

Bin Wang, Bing Xue, Mengjie Zhang

Figure 1 for An Efficient Evolutionary Deep Learning Framework Based on Multi-source Transfer Learning to Evolve Deep Convolutional Neural Networks

Figure 2 for An Efficient Evolutionary Deep Learning Framework Based on Multi-source Transfer Learning to Evolve Deep Convolutional Neural Networks

Figure 3 for An Efficient Evolutionary Deep Learning Framework Based on Multi-source Transfer Learning to Evolve Deep Convolutional Neural Networks

Figure 4 for An Efficient Evolutionary Deep Learning Framework Based on Multi-source Transfer Learning to Evolve Deep Convolutional Neural Networks

Abstract:Convolutional neural networks (CNNs) have constantly achieved better performance over years by introducing more complex topology, and enlarging the capacity towards deeper and wider CNNs. This makes the manual design of CNNs extremely difficult, so the automated design of CNNs has come into the research spotlight, which has obtained CNNs that outperform manually-designed CNNs. However, the computational cost is still the bottleneck of automatically designing CNNs. In this paper, inspired by transfer learning, a new evolutionary computation based framework is proposed to efficiently evolve CNNs without compromising the classification accuracy. The proposed framework leverages multi-source domains, which are smaller datasets than the target domain datasets, to evolve a generalised CNN block only once. And then, a new stacking method is proposed to both widen and deepen the evolved block, and a grid search method is proposed to find optimal stacking solutions. The experimental results show the proposed method acquires good CNNs faster than 15 peer competitors within less than 40 GPU-hours. Regarding the classification accuracy, the proposed method gains its strong competitiveness against the peer competitors, which achieves the best error rates of 3.46%, 18.36% and 1.76% for the CIFAR-10, CIFAR-100 and SVHN datasets, respectively.

Via

Access Paper or Ask Questions