Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

"Information": models, code, and papers

A Cryogenic Memristive Neural Decoder for Fault-tolerant Quantum Error Correction

Jul 18, 2023
Frédéric Marcotte, Pierre-Antoine Mouny, Victor Yon, Gebremedhin A. Dagnew, Bohdan Kulchytskyy, Sophie Rochette, Yann Beilliard, Dominique Drouin, Pooya Ronagh

Figure 1 for A Cryogenic Memristive Neural Decoder for Fault-tolerant Quantum Error Correction

Figure 2 for A Cryogenic Memristive Neural Decoder for Fault-tolerant Quantum Error Correction

Figure 3 for A Cryogenic Memristive Neural Decoder for Fault-tolerant Quantum Error Correction

Figure 4 for A Cryogenic Memristive Neural Decoder for Fault-tolerant Quantum Error Correction

Neural decoders for quantum error correction (QEC) rely on neural networks to classify syndromes extracted from error correction codes and find appropriate recovery operators to protect logical information against errors. Despite the good performance of neural decoders, important practical requirements remain to be achieved, such as minimizing the decoding time to meet typical rates of syndrome generation in repeated error correction schemes, and ensuring the scalability of the decoding approach as the code distance increases. Designing a dedicated integrated circuit to perform the decoding task in co-integration with a quantum processor appears necessary to reach these decoding time and scalability requirements, as routing signals in and out of a cryogenic environment to be processed externally leads to unnecessary delays and an eventual wiring bottleneck. In this work, we report the design and performance analysis of a neural decoder inference accelerator based on an in-memory computing (IMC) architecture, where crossbar arrays of resistive memory devices are employed to both store the synaptic weights of the decoder neural network and perform analog matrix-vector multiplications during inference. In proof-of-concept numerical experiments supported by experimental measurements, we investigate the impact of TiO$_\textrm{x}$-based memristive devices' non-idealities on decoding accuracy. Hardware-aware training methods are developed to mitigate the loss in accuracy, allowing the memristive neural decoders to achieve a pseudo-threshold of $9.23\times 10^{-4}$ for the distance-three surface code, whereas the equivalent digital neural decoder achieves a pseudo-threshold of $1.01\times 10^{-3}$. This work provides a pathway to scalable, fast, and low-power cryogenic IMC hardware for integrated QEC.

Via

Access Paper or Ask Questions

Scale-Aware Modulation Meet Transformer

Jul 17, 2023
Weifeng Lin, Ziheng Wu, Jiayu Chen, Jun Huang, Lianwen Jin

Figure 1 for Scale-Aware Modulation Meet Transformer

Figure 2 for Scale-Aware Modulation Meet Transformer

Figure 3 for Scale-Aware Modulation Meet Transformer

Figure 4 for Scale-Aware Modulation Meet Transformer

This paper presents a new vision Transformer, Scale-Aware Modulation Transformer (SMT), that can handle various downstream tasks efficiently by combining the convolutional network and vision Transformer. The proposed Scale-Aware Modulation (SAM) in the SMT includes two primary novel designs. Firstly, we introduce the Multi-Head Mixed Convolution (MHMC) module, which can capture multi-scale features and expand the receptive field. Secondly, we propose the Scale-Aware Aggregation (SAA) module, which is lightweight but effective, enabling information fusion across different heads. By leveraging these two modules, convolutional modulation is further enhanced. Furthermore, in contrast to prior works that utilized modulations throughout all stages to build an attention-free network, we propose an Evolutionary Hybrid Network (EHN), which can effectively simulate the shift from capturing local to global dependencies as the network becomes deeper, resulting in superior performance. Extensive experiments demonstrate that SMT significantly outperforms existing state-of-the-art models across a wide range of visual tasks. Specifically, SMT with 11.5M / 2.4GFLOPs and 32M / 7.7GFLOPs can achieve 82.2% and 84.3% top-1 accuracy on ImageNet-1K, respectively. After pretrained on ImageNet-22K in 224^2 resolution, it attains 87.1% and 88.1% top-1 accuracy when finetuned with resolution 224^2 and 384^2, respectively. For object detection with Mask R-CNN, the SMT base trained with 1x and 3x schedule outperforms the Swin Transformer counterpart by 4.2 and 1.3 mAP on COCO, respectively. For semantic segmentation with UPerNet, the SMT base test at single- and multi-scale surpasses Swin by 2.0 and 1.1 mIoU respectively on the ADE20K.

* Accepted to ICCV 2023

Via

Access Paper or Ask Questions

Privacy-preserving patient clustering for personalized federated learning

Jul 17, 2023
Ahmed Elhussein, Gamze Gursoy

Figure 1 for Privacy-preserving patient clustering for personalized federated learning

Figure 2 for Privacy-preserving patient clustering for personalized federated learning

Figure 3 for Privacy-preserving patient clustering for personalized federated learning

Figure 4 for Privacy-preserving patient clustering for personalized federated learning

Federated Learning (FL) is a machine learning framework that enables multiple organizations to train a model without sharing their data with a central server. However, it experiences significant performance degradation if the data is non-identically independently distributed (non-IID). This is a problem in medical settings, where variations in the patient population contribute significantly to distribution differences across hospitals. Personalized FL addresses this issue by accounting for site-specific distribution differences. Clustered FL, a Personalized FL variant, was used to address this problem by clustering patients into groups across hospitals and training separate models on each group. However, privacy concerns remained as a challenge as the clustering process requires exchange of patient-level information. This was previously solved by forming clusters using aggregated data, which led to inaccurate groups and performance degradation. In this study, we propose Privacy-preserving Community-Based Federated machine Learning (PCBFL), a novel Clustered FL framework that can cluster patients using patient-level data while protecting privacy. PCBFL uses Secure Multiparty Computation, a cryptographic technique, to securely calculate patient-level similarity scores across hospitals. We then evaluate PCBFL by training a federated mortality prediction model using 20 sites from the eICU dataset. We compare the performance gain from PCBFL against traditional and existing Clustered FL frameworks. Our results show that PCBFL successfully forms clinically meaningful cohorts of low, medium, and high-risk patients. PCBFL outperforms traditional and existing Clustered FL frameworks with an average AUC improvement of 4.3% and AUPRC improvement of 7.8%.

Via

Access Paper or Ask Questions

Learning for Counterfactual Fairness from Observational Data

Jul 17, 2023
Jing Ma, Ruocheng Guo, Aidong Zhang, Jundong Li

Figure 1 for Learning for Counterfactual Fairness from Observational Data

Figure 2 for Learning for Counterfactual Fairness from Observational Data

Figure 3 for Learning for Counterfactual Fairness from Observational Data

Figure 4 for Learning for Counterfactual Fairness from Observational Data

Fairness-aware machine learning has attracted a surge of attention in many domains, such as online advertising, personalized recommendation, and social media analysis in web applications. Fairness-aware machine learning aims to eliminate biases of learning models against certain subgroups described by certain protected (sensitive) attributes such as race, gender, and age. Among many existing fairness notions, counterfactual fairness is a popular notion defined from a causal perspective. It measures the fairness of a predictor by comparing the prediction of each individual in the original world and that in the counterfactual worlds in which the value of the sensitive attribute is modified. A prerequisite for existing methods to achieve counterfactual fairness is the prior human knowledge of the causal model for the data. However, in real-world scenarios, the underlying causal model is often unknown, and acquiring such human knowledge could be very difficult. In these scenarios, it is risky to directly trust the causal models obtained from information sources with unknown reliability and even causal discovery methods, as incorrect causal models can consequently bring biases to the predictor and lead to unfair predictions. In this work, we address the problem of counterfactually fair prediction from observational data without given causal models by proposing a novel framework CLAIRE. Specifically, under certain general assumptions, CLAIRE effectively mitigates the biases from the sensitive attribute with a representation learning framework based on counterfactual data augmentation and an invariant penalty. Experiments conducted on both synthetic and real-world datasets validate the superiority of CLAIRE in both counterfactual fairness and prediction performance.

Via

Access Paper or Ask Questions

Improving Data Efficiency for Plant Cover Prediction with Label Interpolation and Monte-Carlo Cropping

Jul 17, 2023
Matthias Körschens, Solveig Franziska Bucher, Christine Römermann, Joachim Denzler

Figure 1 for Improving Data Efficiency for Plant Cover Prediction with Label Interpolation and Monte-Carlo Cropping

Figure 2 for Improving Data Efficiency for Plant Cover Prediction with Label Interpolation and Monte-Carlo Cropping

Figure 3 for Improving Data Efficiency for Plant Cover Prediction with Label Interpolation and Monte-Carlo Cropping

Figure 4 for Improving Data Efficiency for Plant Cover Prediction with Label Interpolation and Monte-Carlo Cropping

The plant community composition is an essential indicator of environmental changes and is, for this reason, usually analyzed in ecological field studies in terms of the so-called plant cover. The manual acquisition of this kind of data is time-consuming, laborious, and prone to human error. Automated camera systems can collect high-resolution images of the surveyed vegetation plots at a high frequency. In combination with subsequent algorithmic analysis, it is possible to objectively extract information on plant community composition quickly and with little human effort. An automated camera system can easily collect the large amounts of image data necessary to train a Deep Learning system for automatic analysis. However, due to the amount of work required to annotate vegetation images with plant cover data, only few labeled samples are available. As automated camera systems can collect many pictures without labels, we introduce an approach to interpolate the sparse labels in the collected vegetation plot time series down to the intermediate dense and unlabeled images to artificially increase our training dataset to seven times its original size. Moreover, we introduce a new method we call Monte-Carlo Cropping. This approach trains on a collection of cropped parts of the training images to deal with high-resolution images efficiently, implicitly augment the training images, and speed up training. We evaluate both approaches on a plant cover dataset containing images of herbaceous plant communities and find that our methods lead to improvements in the species, community, and segmentation metrics investigated.

* Accepted for publication at DAGM-GCPR 2023

Via

Access Paper or Ask Questions

Quality Assessment of Photoplethysmography Signals For Cardiovascular Biomarkers Monitoring Using Wearable Devices

Jul 17, 2023
Felipe M. Dias, Marcelo A. F. Toledo, Diego A. C. Cardenas, Douglas A. Almeida, Filipe A. C. Oliveira, Estela Ribeiro, Jose E. Krieger, Marco A. Gutierrez

Figure 1 for Quality Assessment of Photoplethysmography Signals For Cardiovascular Biomarkers Monitoring Using Wearable Devices

Figure 2 for Quality Assessment of Photoplethysmography Signals For Cardiovascular Biomarkers Monitoring Using Wearable Devices

Figure 3 for Quality Assessment of Photoplethysmography Signals For Cardiovascular Biomarkers Monitoring Using Wearable Devices

Figure 4 for Quality Assessment of Photoplethysmography Signals For Cardiovascular Biomarkers Monitoring Using Wearable Devices

Photoplethysmography (PPG) is a non-invasive technology that measures changes in blood volume in the microvascular bed of tissue. It is commonly used in medical devices such as pulse oximeters and wrist worn heart rate monitors to monitor cardiovascular hemodynamics. PPG allows for the assessment of parameters (e.g., heart rate, pulse waveform, and peripheral perfusion) that can indicate conditions such as vasoconstriction or vasodilation, and provides information about microvascular blood flow, making it a valuable tool for monitoring cardiovascular health. However, PPG is subject to a number of sources of variations that can impact its accuracy and reliability, especially when using a wearable device for continuous monitoring, such as motion artifacts, skin pigmentation, and vasomotion. In this study, we extracted 27 statistical features from the PPG signal for training machine-learning models based on gradient boosting (XGBoost and CatBoost) and Random Forest (RF) algorithms to assess quality of PPG signals that were labeled as good or poor quality. We used the PPG time series from a publicly available dataset and evaluated the algorithm s performance using Sensitivity (Se), Positive Predicted Value (PPV), and F1-score (F1) metrics. Our model achieved Se, PPV, and F1-score of 94.4, 95.6, and 95.0 for XGBoost, 94.7, 95.9, and 95.3 for CatBoost, and 93.7, 91.3 and 92.5 for RF, respectively. Our findings are comparable to state-of-the-art reported in the literature but using a much simpler model, indicating that ML models are promising for developing remote, non-invasive, and continuous measurement devices.

* 9 pages

Via

Access Paper or Ask Questions

Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)

Jul 17, 2023
Subba Reddy Oota, Manish Gupta, Raju S. Bapi, Gael Jobard, Frederic Alexandre, Xavier Hinaut

Figure 1 for Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)

Figure 2 for Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)

Figure 3 for Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)

Figure 4 for Deep Neural Networks and Brain Alignment: Brain Encoding and Decoding (Survey)

How does the brain represent different modes of information? Can we design a system that automatically understands what the user is thinking? Such questions can be answered by studying brain recordings like functional magnetic resonance imaging (fMRI). As a first step, the neuroscience community has contributed several large cognitive neuroscience datasets related to passive reading/listening/viewing of concept words, narratives, pictures and movies. Encoding and decoding models using these datasets have also been proposed in the past two decades. These models serve as additional tools for basic research in cognitive science and neuroscience. Encoding models aim at generating fMRI brain representations given a stimulus automatically. They have several practical applications in evaluating and diagnosing neurological conditions and thus also help design therapies for brain damage. Decoding models solve the inverse problem of reconstructing the stimuli given the fMRI. They are useful for designing brain-machine or brain-computer interfaces. Inspired by the effectiveness of deep learning models for natural language processing, computer vision, and speech, recently several neural encoding and decoding models have been proposed. In this survey, we will first discuss popular representations of language, vision and speech stimuli, and present a summary of neuroscience datasets. Further, we will review popular deep learning based encoding and decoding architectures and note their benefits and limitations. Finally, we will conclude with a brief summary and discussion about future trends. Given the large amount of recently published work in the `computational cognitive neuroscience' community, we believe that this survey nicely organizes the plethora of work and presents it as a coherent story.

* 16 pages, 10 figures

Via

Access Paper or Ask Questions

Experimentally realized physical-model-based wave control in metasurface-programmable complex media

Jul 17, 2023
Jérôme Sol, Hugo Prod'homme, Luc Le Magoarou, Philipp del Hougne

Figure 1 for Experimentally realized physical-model-based wave control in metasurface-programmable complex media

Figure 2 for Experimentally realized physical-model-based wave control in metasurface-programmable complex media

Figure 3 for Experimentally realized physical-model-based wave control in metasurface-programmable complex media

Figure 4 for Experimentally realized physical-model-based wave control in metasurface-programmable complex media

The reconfigurability of radio environments with programmable metasurfaces is considered a key feature of next-generation wireless networks. Identifying suitable metasurface configurations for desired wireless functionalities requires a precise setting-specific understanding of the intricate impact of the metasurface configuration on the wireless channels. Yet, to date, the relevant short and long-range correlations between the meta-atoms due to proximity and reverberation are largely ignored rather than precisely captured. Here, we experimentally demonstrate that a compact model derived from first physical principles can precisely predict how wireless channels in complex scattering environments depend on the programmable-metasurface configuration. The model is calibrated using a very small random subset of all possible metasurface configurations and without knowing the setup's geometry. Our approach achieves two orders of magnitude higher precision than a deep learning-based digital-twin benchmark while involving hundred times fewer parameters. Strikingly, when only phaseless calibration data is available, our model can nonetheless retrieve the precise phase relations of the scattering matrix as well as their dependencies on the metasurface configuration. Thereby, we achieve coherent wave control (focusing or enhancing absorption) and phase-shift-keying backscatter communications without ever having measured phase information. Finally, our model is also capable of retrieving the essential properties of scattering coefficients for which no calibration data was ever provided. These unique generalization capabilities of our pure-physics model significantly alleviate the measurement complexity. Our approach is also directly relevant to dynamic metasurface antennas, microwave-based signal processors as well as emerging in situ reconfigurable nanophotonic, optical and room-acoustical systems.

* 23 pages, 4 figures, 4 supplementary figures

Via

Access Paper or Ask Questions

MAP-NBV: Multi-agent Prediction-guided Next-Best-View Planning for Active 3D Object Reconstruction

Jul 08, 2023
Harnaik Dhami, Vishnu D. Sharma, Pratap Tokekar

Figure 1 for MAP-NBV: Multi-agent Prediction-guided Next-Best-View Planning for Active 3D Object Reconstruction

Figure 2 for MAP-NBV: Multi-agent Prediction-guided Next-Best-View Planning for Active 3D Object Reconstruction

Figure 3 for MAP-NBV: Multi-agent Prediction-guided Next-Best-View Planning for Active 3D Object Reconstruction

Figure 4 for MAP-NBV: Multi-agent Prediction-guided Next-Best-View Planning for Active 3D Object Reconstruction

We propose MAP-NBV, a prediction-guided active algorithm for 3D reconstruction with multi-agent systems. Prediction-based approaches have shown great improvement in active perception tasks by learning the cues about structures in the environment from data. But these methods primarily focus on single-agent systems. We design a next-best-view approach that utilizes geometric measures over the predictions and jointly optimizes the information gain and control effort for efficient collaborative 3D reconstruction of the object. Our method achieves 22.75% improvement over the prediction-based single-agent approach and 15.63% improvement over the non-predictive multi-agent approach. We make our code publicly available through our project website: http://raaslab.org/projects/MAPNBV/

* 7 pages, 7 figures, 2 tables. Submitted to MRS 2023

Via

Access Paper or Ask Questions

General-Purpose Multimodal Transformer meets Remote Sensing Semantic Segmentation

Jul 07, 2023
Nhi Kieu, Kien Nguyen, Sridha Sridharan, Clinton Fookes

Figure 1 for General-Purpose Multimodal Transformer meets Remote Sensing Semantic Segmentation

Figure 2 for General-Purpose Multimodal Transformer meets Remote Sensing Semantic Segmentation

Figure 3 for General-Purpose Multimodal Transformer meets Remote Sensing Semantic Segmentation

Figure 4 for General-Purpose Multimodal Transformer meets Remote Sensing Semantic Segmentation

The advent of high-resolution multispectral/hyperspectral sensors, LiDAR DSM (Digital Surface Model) information and many others has provided us with an unprecedented wealth of data for Earth Observation. Multimodal AI seeks to exploit those complementary data sources, particularly for complex tasks like semantic segmentation. While specialized architectures have been developed, they are highly complicated via significant effort in model design, and require considerable re-engineering whenever a new modality emerges. Recent trends in general-purpose multimodal networks have shown great potential to achieve state-of-the-art performance across multiple multimodal tasks with one unified architecture. In this work, we investigate the performance of PerceiverIO, one in the general-purpose multimodal family, in the remote sensing semantic segmentation domain. Our experiments reveal that this ostensibly universal network struggles with object scale variation in remote sensing images and fails to detect the presence of cars from a top-down view. To address these issues, even with extreme class imbalance issues, we propose a spatial and volumetric learning component. Specifically, we design a UNet-inspired module that employs 3D convolution to encode vital local information and learn cross-modal features simultaneously, while reducing network computational burden via the cross-attention mechanism of PerceiverIO. The effectiveness of the proposed component is validated through extensive experiments comparing it with other methods such as 2D convolution, and dual local module (\ie the combination of Conv2D 1x1 and Conv2D 3x3 inspired by UNetFormer). The proposed method achieves competitive results with specialized architectures like UNetFormer and SwinUNet, showing its potential to minimize network architecture engineering with a minimal compromise on the performance.

* Accepted to CVPR Workshop on Multimodal Learning for Earth and Environment 2023

Via

Access Paper or Ask Questions