Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

"Information": models, code, and papers

Estimation of Soiling Losses in Unlabeled PV Data

Sep 20, 2022
Bennet Meyers

Figure 1 for Estimation of Soiling Losses in Unlabeled PV Data

Figure 2 for Estimation of Soiling Losses in Unlabeled PV Data

Figure 3 for Estimation of Soiling Losses in Unlabeled PV Data

Figure 4 for Estimation of Soiling Losses in Unlabeled PV Data

We provide a methodology for estimating the losses due to soiling for photovoltaic (PV) systems. We focus this work on estimating the losses from historical power production data that are unlabeled, i.e. power measurements with time stamps, but no other information such as site configuration or meteorological data. We present a validation of this approach on a small fleet of typical rooftop PV systems. The proposed method differs from prior work in that the construction of a performance index is not required to analyze soiling loss. This approach is appropriate for analyzing the soiling losses in field production data from fleets of distributed rooftop systems and is highly automatic, allowing for scaling to large fleets of heterogeneous PV systems.

* 7 pages, 10 figures, 3 tables

Via

Access Paper or Ask Questions

Estimation of Shade Losses in Unlabeled PV Data

Sep 20, 2022
Bennet Meyers, David Jose Florez Rodriguez

Figure 1 for Estimation of Shade Losses in Unlabeled PV Data

Figure 2 for Estimation of Shade Losses in Unlabeled PV Data

Figure 3 for Estimation of Shade Losses in Unlabeled PV Data

Figure 4 for Estimation of Shade Losses in Unlabeled PV Data

We provide a methodology for estimating the losses due to shade in power generation data sets produced by real-world photovoltaic (PV) systems. We focus this work on estimating shade loss from data that are unlabeled, i.e. power measurements with time stamps but no other information such as site configuration or meteorological data. This approach enables, for the first time, the analysis of data generated by small scale, distributed PV systems, which do not have the data quality or richness of large, utility-scale PV systems or research-grade installations. This work is an application of the newly published signal decomposition (SD) framework, which provides an extensible approach for estimating hidden components in time-series data.

* 8 pages in double-column format, 12 figures, and 2 tables

Via

Access Paper or Ask Questions

Toward an understanding of the properties of neural network approaches for supernovae light curve approximation

Sep 15, 2022
Mariia Demianenko, Konstantin Malanchev, Ekaterina Samorodova, Mikhail Sysak, Aleksandr Shiriaev, Denis Derkach, Mikhail Hushchyn

Figure 1 for Toward an understanding of the properties of neural network approaches for supernovae light curve approximation

Figure 2 for Toward an understanding of the properties of neural network approaches for supernovae light curve approximation

Figure 3 for Toward an understanding of the properties of neural network approaches for supernovae light curve approximation

Figure 4 for Toward an understanding of the properties of neural network approaches for supernovae light curve approximation

The modern time-domain photometric surveys collect a lot of observations of various astronomical objects, and the coming era of large-scale surveys will provide even more information. Most of the objects have never received a spectroscopic follow-up, which is especially crucial for transients e.g. supernovae. In such cases, observed light curves could present an affordable alternative. Time series are actively used for photometric classification and characterization, such as peak and luminosity decline estimation. However, the collected time series are multidimensional, irregularly sampled, contain outliers, and do not have well-defined systematic uncertainties. Machine learning methods help extract useful information from available data in the most efficient way. We consider several light curve approximation methods based on neural networks: Multilayer Perceptrons, Bayesian Neural Networks, and Normalizing Flows, to approximate observations of a single light curve. Tests using both the simulated PLAsTiCC and real Zwicky Transient Facility data samples demonstrate that even few observations are enough to fit networks and achieve better approximation quality than other state-of-the-art methods. We show that the methods described in this work have better computational complexity and work faster than Gaussian Processes. We analyze the performance of the approximation techniques aiming to fill the gaps in the observations of the light curves, and show that the use of appropriate technique increases the accuracy of peak finding and supernova classification. In addition, the study results are organized in a Fulu Python library available on GitHub, which can be easily used by the community.

* Submitted to MNRAS. 14 pages, 6 figures, 9 tables

Via

Access Paper or Ask Questions

DFA: Dynamic Feature Aggregation for Efficient Video Object Detection

Oct 02, 2022
Yiming Cui

Figure 1 for DFA: Dynamic Feature Aggregation for Efficient Video Object Detection

Figure 2 for DFA: Dynamic Feature Aggregation for Efficient Video Object Detection

Figure 3 for DFA: Dynamic Feature Aggregation for Efficient Video Object Detection

Figure 4 for DFA: Dynamic Feature Aggregation for Efficient Video Object Detection

Video object detection is a fundamental yet challenging task in computer vision. One practical solution is to take advantage of temporal information from the video and apply feature aggregation to enhance the object features in each frame. Though effective, those existing methods always suffer from low inference speeds because they use a fixed number of frames for feature aggregation regardless of the input frame. Therefore, this paper aims to improve the inference speed of the current feature aggregation-based video object detectors while maintaining their performance. To achieve this goal, we propose a vanilla dynamic aggregation module that adaptively selects the frames for feature enhancement. Then, we extend the vanilla dynamic aggregation module to a more effective and reconfigurable deformable version. Finally, we introduce inplace distillation loss to improve the representations of objects aggregated with fewer frames. Extensive experimental results validate the effectiveness and efficiency of our proposed methods: On the ImageNet VID benchmark, integrated with our proposed methods, FGFA and SELSA can improve the inference speed by 31% and 76% respectively while getting comparable performance on accuracy.

Via

Access Paper or Ask Questions

Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval

Oct 02, 2022
Kei Nishimaki, Kumpei Ikuta, Yuto Onga, Hitoshi Iyatomi, Kenichi Oishi

Figure 1 for Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval

Figure 2 for Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval

Figure 3 for Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval

Figure 4 for Loc-VAE: Learning Structurally Localized Representation from 3D Brain MR Images for Content-Based Image Retrieval

Content-based image retrieval (CBIR) systems are an emerging technology that supports reading and interpreting medical images. Since 3D brain MR images are high dimensional, dimensionality reduction is necessary for CBIR using machine learning techniques. In addition, for a reliable CBIR system, each dimension in the resulting low-dimensional representation must be associated with a neurologically interpretable region. We propose a localized variational autoencoder (Loc-VAE) that provides neuroanatomically interpretable low-dimensional representation from 3D brain MR images for clinical CBIR. Loc-VAE is based on $\beta$-VAE with the additional constraint that each dimension of the low-dimensional representation corresponds to a local region of the brain. The proposed Loc-VAE is capable of acquiring representation that preserves disease features and is highly localized, even under high-dimensional compression ratios (4096:1). The low-dimensional representation obtained by Loc-VAE improved the locality measure of each dimension by 4.61 points compared to naive $\beta$-VAE, while maintaining comparable brain reconstruction capability and information about the diagnosis of Alzheimer's disease.

* 6 pages, 6 figures. Accepted at the International Conference on Systems, Man, and Cybernetics (IEEE SMC '22)

Via

Access Paper or Ask Questions

Evaluating the Susceptibility of Pre-Trained Language Models via Handcrafted Adversarial Examples

Sep 05, 2022
Hezekiah J. Branch, Jonathan Rodriguez Cefalu, Jeremy McHugh, Leyla Hujer, Aditya Bahl, Daniel del Castillo Iglesias, Ron Heichman, Ramesh Darwishi

Figure 1 for Evaluating the Susceptibility of Pre-Trained Language Models via Handcrafted Adversarial Examples

Figure 2 for Evaluating the Susceptibility of Pre-Trained Language Models via Handcrafted Adversarial Examples

Figure 3 for Evaluating the Susceptibility of Pre-Trained Language Models via Handcrafted Adversarial Examples

Figure 4 for Evaluating the Susceptibility of Pre-Trained Language Models via Handcrafted Adversarial Examples

Recent advances in the development of large language models have resulted in public access to state-of-the-art pre-trained language models (PLMs), including Generative Pre-trained Transformer 3 (GPT-3) and Bidirectional Encoder Representations from Transformers (BERT). However, evaluations of PLMs, in practice, have shown their susceptibility to adversarial attacks during the training and fine-tuning stages of development. Such attacks can result in erroneous outputs, model-generated hate speech, and the exposure of users' sensitive information. While existing research has focused on adversarial attacks during either the training or the fine-tuning of PLMs, there is a deficit of information on attacks made between these two development phases. In this work, we highlight a major security vulnerability in the public release of GPT-3 and further investigate this vulnerability in other state-of-the-art PLMs. We restrict our work to pre-trained models that have not undergone fine-tuning. Further, we underscore token distance-minimized perturbations as an effective adversarial approach, bypassing both supervised and unsupervised quality measures. Following this approach, we observe a significant decrease in text classification quality when evaluating for semantic similarity.

* 10 pages, 1 figure, 3 tables

Via

Access Paper or Ask Questions

RGB-D Saliency Detection via Cascaded Mutual Information Minimization

Sep 15, 2021
Jing Zhang, Deng-Ping Fan, Yuchao Dai, Xin Yu, Yiran Zhong, Nick Barnes, Ling Shao

Figure 1 for RGB-D Saliency Detection via Cascaded Mutual Information Minimization

Figure 2 for RGB-D Saliency Detection via Cascaded Mutual Information Minimization

Figure 3 for RGB-D Saliency Detection via Cascaded Mutual Information Minimization

Figure 4 for RGB-D Saliency Detection via Cascaded Mutual Information Minimization

Existing RGB-D saliency detection models do not explicitly encourage RGB and depth to achieve effective multi-modal learning. In this paper, we introduce a novel multi-stage cascaded learning framework via mutual information minimization to "explicitly" model the multi-modal information between RGB image and depth data. Specifically, we first map the feature of each mode to a lower dimensional feature vector, and adopt mutual information minimization as a regularizer to reduce the redundancy between appearance features from RGB and geometric features from depth. We then perform multi-stage cascaded learning to impose the mutual information minimization constraint at every stage of the network. Extensive experiments on benchmark RGB-D saliency datasets illustrate the effectiveness of our framework. Further, to prosper the development of this field, we contribute the largest (7x larger than NJU2K) dataset, which contains 15,625 image pairs with high quality polygon-/scribble-/object-/instance-/rank-level annotations. Based on these rich labels, we additionally construct four new benchmarks with strong baselines and observe some interesting phenomena, which can motivate future model design. Source code and dataset are available at "https://github.com/JingZhang617/cascaded_rgbd_sod".

* Accepted as ICCV2021 paper

Via

Access Paper or Ask Questions

Gated Information Bottleneck for Generalization in Sequential Environments

Oct 12, 2021
Francesco Alesiani, Shujian Yu, Xi Yu

Figure 1 for Gated Information Bottleneck for Generalization in Sequential Environments

Figure 2 for Gated Information Bottleneck for Generalization in Sequential Environments

Figure 3 for Gated Information Bottleneck for Generalization in Sequential Environments

Figure 4 for Gated Information Bottleneck for Generalization in Sequential Environments

Deep neural networks suffer from poor generalization to unseen environments when the underlying data distribution is different from that in the training set. By learning minimum sufficient representations from training data, the information bottleneck (IB) approach has demonstrated its effectiveness to improve generalization in different AI applications. In this work, we propose a new neural network-based IB approach, termed gated information bottleneck (GIB), that dynamically drops spurious correlations and progressively selects the most task-relevant features across different environments by a trainable soft mask (on raw features). GIB enjoys a simple and tractable objective, without any variational approximation or distributional assumption. We empirically demonstrate the superiority of GIB over other popular neural network-based IB approaches in adversarial robustness and out-of-distribution (OOD) detection. Meanwhile, we also establish the connection between IB theory and invariant causal representation learning, and observed that GIB demonstrates appealing performance when different environments arrive sequentially, a more practical scenario where invariant risk minimization (IRM) fails. Code of GIB is available at https://github.com/falesiani/GIB

* manuscript accepted by IEEE ICDM-21 (regular papers), code is available at https://github.com/falesiani/GIB

Via

Access Paper or Ask Questions

Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation

Aug 05, 2022
Feng Zhu, Zongxin Yang, Xin Yu, Yi Yang, Yunchao Wei

Figure 1 for Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation

Figure 2 for Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation

Figure 3 for Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation

Figure 4 for Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation

Modeling temporal information for both detection and tracking in a unified framework has been proved a promising solution to video instance segmentation (VIS). However, how to effectively incorporate the temporal information into an online model remains an open problem. In this work, we propose a new online VIS paradigm named Instance As Identity (IAI), which models temporal information for both detection and tracking in an efficient way. In detail, IAI employs a novel identification module to predict identification number for tracking instances explicitly. For passing temporal information cross frame, IAI utilizes an association module which combines current features and past embeddings. Notably, IAI can be integrated with different image models. We conduct extensive experiments on three VIS benchmarks. IAI outperforms all the online competitors on YouTube-VIS-2019 (ResNet-101 41.9 mAP) and YouTube-VIS-2021 (ResNet-50 37.7 mAP). Surprisingly, on the more challenging OVIS, IAI achieves SOTA performance (20.3 mAP). Code is available at https://github.com/zfonemore/IAI

Via

Access Paper or Ask Questions

FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation

Sep 28, 2022
Sebastian Hofstätter, Jiecao Chen, Karthik Raman, Hamed Zamani

Figure 1 for FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation

Figure 2 for FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation

Figure 3 for FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation

Figure 4 for FiD-Light: Efficient and Effective Retrieval-Augmented Text Generation

Retrieval-augmented generation models offer many benefits over standalone language models: besides a textual answer to a given query they provide provenance items retrieved from an updateable knowledge base. However, they are also more complex systems and need to handle long inputs. In this work, we introduce FiD-Light to strongly increase the efficiency of the state-of-the-art retrieval-augmented FiD model, while maintaining the same level of effectiveness. Our FiD-Light model constrains the information flow from the encoder (which encodes passages separately) to the decoder (using concatenated encoded representations). Furthermore, we adapt FiD-Light with re-ranking capabilities through textual source pointers, to improve the top-ranked provenance precision. Our experiments on a diverse set of seven knowledge intensive tasks (KILT) show FiD-Light consistently improves the Pareto frontier between query latency and effectiveness. FiD-Light with source pointing sets substantial new state-of-the-art results on six KILT tasks for combined text generation and provenance retrieval evaluation, while maintaining reasonable efficiency.

Via

Access Paper or Ask Questions