Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

"Time": models, code, and papers

Model Predictive Spherical Image-Based Visual Servoing On $SO(3)$ for Aggressive Aerial Tracking

Dec 19, 2022
Chao Qin, Qiuyu Yu, Hugh H. T. Liu

Figure 1 for Model Predictive Spherical Image-Based Visual Servoing On $SO(3)$ for Aggressive Aerial Tracking

Figure 2 for Model Predictive Spherical Image-Based Visual Servoing On $SO(3)$ for Aggressive Aerial Tracking

Figure 3 for Model Predictive Spherical Image-Based Visual Servoing On $SO(3)$ for Aggressive Aerial Tracking

Figure 4 for Model Predictive Spherical Image-Based Visual Servoing On $SO(3)$ for Aggressive Aerial Tracking

This paper presents an image-based visual servo control (IBVS) method for a first-person-view (FPV) quadrotor to conduct aggressive aerial tracking. There are three major challenges to maneuvering an underactuated vehicle using IBVS: (i) finding a visual feature representation that is robust to large rotations and is suited to be an optimization variable; (ii) keeping the target visible without sacrificing the robot's agility; and (iii) compensating for the rotational effects in the detected features. We propose a complete design framework to address these problems. First, we employ a rotation on $SO(3)$ to represent a spherical image feature on $S^{2}$ to gain singularity-free and second-order differentiable properties. To ensure target visibility, we formulate the IBVS as a nonlinear model predictive control (NMPC) problem with three constraints taken into account: the robot's physical limits, target visibility, and time-to-collision (TTC). Furthermore, we propose a novel attitude-compensation scheme to enable formulating the visibility constraint in the actual image plane instead of a virtual fix-orientation image plane. It guarantees that the visibility constraint is valid under large rotations. Extensive experimental results show that our method can track a fast-moving target stably and aggressively without the aid of a localization system.

Via

Access Paper or Ask Questions

SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations

Dec 19, 2022
Ioannis Tsiamas, José A. R. Fonollosa, Marta R. Costa-jussà

Figure 1 for SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations

Figure 2 for SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations

Figure 3 for SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations

Figure 4 for SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations

Data scarcity is one of the main issues with the end-to-end approach for Speech Translation, as compared to the cascaded one. Although most data resources for Speech Translation are originally document-level, they offer a sentence-level view, which can be directly used during training. But this sentence-level view is single and static, potentially limiting the utility of the data. Our proposed data augmentation method SegAugment challenges this idea and aims to increase data availability by providing multiple alternative sentence-level views of a dataset. Our method heavily relies on an Audio Segmentation system to re-segment the speech of each document, after which we obtain the target text with alignment methods. The Audio Segmentation system can be parameterized with different length constraints, thus giving us access to multiple and diverse sentence-level views for each document. Experiments in MuST-C show consistent gains across 8 language pairs, with an average increase of 2.2 BLEU points, and up to 4.7 BLEU for lower-resource scenarios in mTEDx. Additionally, we find that SegAugment is also applicable to purely sentence-level data, as in CoVoST, and that it enables Speech Translation models to completely close the gap between the gold and automatic segmentation at inference time.

* Work in progress, 10 pages + appendix

Via

Access Paper or Ask Questions

Privacy Adhering Machine Un-learning in NLP

Dec 19, 2022
Vinayshekhar Bannihatti Kumar, Rashmi Gangadharaiah, Dan Roth

Figure 1 for Privacy Adhering Machine Un-learning in NLP

Figure 2 for Privacy Adhering Machine Un-learning in NLP

Figure 3 for Privacy Adhering Machine Un-learning in NLP

Figure 4 for Privacy Adhering Machine Un-learning in NLP

Regulations introduced by General Data Protection Regulation (GDPR) in the EU or California Consumer Privacy Act (CCPA) in the US have included provisions on the \textit{right to be forgotten} that mandates industry applications to remove data related to an individual from their systems. In several real world industry applications that use Machine Learning to build models on user data, such mandates require significant effort both in terms of data cleansing as well as model retraining while ensuring the models do not deteriorate in prediction quality due to removal of data. As a result, continuous removal of data and model retraining steps do not scale if these applications receive such requests at a very high frequency. Recently, a few researchers proposed the idea of \textit{Machine Unlearning} to tackle this challenge. Despite the significant importance of this task, the area of Machine Unlearning is under-explored in Natural Language Processing (NLP) tasks. In this paper, we explore the Unlearning framework on various GLUE tasks \cite{Wang:18}, such as, QQP, SST and MNLI. We propose computationally efficient approaches (SISA-FC and SISA-A) to perform \textit{guaranteed} Unlearning that provides significant reduction in terms of both memory (90-95\%), time (100x) and space consumption (99\%) in comparison to the baselines while keeping model performance constant.

Via

Access Paper or Ask Questions

Discovering Language Model Behaviors with Model-Written Evaluations

Dec 19, 2022
Ethan Perez, Sam Ringer, Kamilė Lukošiūtė, Karina Nguyen, Edwin Chen, Scott Heiner, Craig Pettit, Catherine Olsson, Sandipan Kundu, Saurav Kadavath, Andy Jones, Anna Chen, Ben Mann, Brian Israel, Bryan Seethor, Cameron McKinnon, Christopher Olah, Da Yan, Daniela Amodei, Dario Amodei, Dawn Drain, Dustin Li, Eli Tran-Johnson, Guro Khundadze, Jackson Kernion, James Landis, Jamie Kerr, Jared Mueller, Jeeyoon Hyun, Joshua Landau, Kamal Ndousse, Landon Goldberg, Liane Lovitt, Martin Lucas, Michael Sellitto, Miranda Zhang, Neerav Kingsland, Nelson Elhage, Nicholas Joseph, Noemí Mercado, Nova DasSarma, Oliver Rausch, Robin Larson, Sam McCandlish, Scott Johnston, Shauna Kravec, Sheer El Showk, Tamera Lanham, Timothy Telleen-Lawton, Tom Brown, Tom Henighan, Tristan Hume, Yuntao Bai, Zac Hatfield-Dodds, Jack Clark, Samuel R. Bowman, Amanda Askell, Roger Grosse, Danny Hernandez, Deep Ganguli, Evan Hubinger, Nicholas Schiefer, Jared Kaplan

Figure 1 for Discovering Language Model Behaviors with Model-Written Evaluations

Figure 2 for Discovering Language Model Behaviors with Model-Written Evaluations

Figure 3 for Discovering Language Model Behaviors with Model-Written Evaluations

Figure 4 for Discovering Language Model Behaviors with Model-Written Evaluations

As language models (LMs) scale, they develop many novel behaviors, good and bad, exacerbating the need to evaluate how they behave. Prior work creates evaluations with crowdwork (which is time-consuming and expensive) or existing data sources (which are not always available). Here, we automatically generate evaluations with LMs. We explore approaches with varying amounts of human effort, from instructing LMs to write yes/no questions to making complex Winogender schemas with multiple stages of LM-based generation and filtering. Crowdworkers rate the examples as highly relevant and agree with 90-100% of labels, sometimes more so than corresponding human-written datasets. We generate 154 datasets and discover new cases of inverse scaling where LMs get worse with size. Larger LMs repeat back a dialog user's preferred answer ("sycophancy") and express greater desire to pursue concerning goals like resource acquisition and goal preservation. We also find some of the first examples of inverse scaling in RL from Human Feedback (RLHF), where more RLHF makes LMs worse. For example, RLHF makes LMs express stronger political views (on gun rights and immigration) and a greater desire to avoid shut down. Overall, LM-written evaluations are high-quality and let us quickly discover many novel LM behaviors.

* for associated data visualizations, see https://www.evals.anthropic.com/model-written/ for full datasets, see https://github.com/anthropics/evals

Via

Access Paper or Ask Questions

Identification of optimal prediction error Thévenin models of Li-ion cells using the MOLI approach

Dec 19, 2022
Paulo Lopes dos Santos, T-P Azevedo Perdicoúlis, Paulo A. Salgado

Figure 1 for Identification of optimal prediction error Thévenin models of Li-ion cells using the MOLI approach

Figure 2 for Identification of optimal prediction error Thévenin models of Li-ion cells using the MOLI approach

Figure 3 for Identification of optimal prediction error Thévenin models of Li-ion cells using the MOLI approach

Figure 4 for Identification of optimal prediction error Thévenin models of Li-ion cells using the MOLI approach

This report presents System Identification algorithms to estimate the dynamical model of Li-Oin cells. First the dependence of open circuit voltage (OCV) on the state of charge (SOC) is studied. thN battery equivalent model when a resistor is added to the circuit is stated. The discharge data is divided into segments where the internal resistance is assumed constant, and therefore SOC is constant, thence is described an LTI identification algorithm to be used to estimate the cell model in each segment. A Randles circuit is introduced to the model to describe the diffusion process. This model includes the so called Warburg impedance which is as fractional system. This impedance is discussed and it is approximatted by a finite order linear time invariant state-space model. Also, after presenting the simplified Randles circuit, is stated an identification algorithm that estimates the parameters of this model. The Th\'evenin model is presented as an alternative to the Randles circuit. An algorithm to identify a Th\'evenin model of 1st and 2nd order is enunciated. The performances of the simplified randles model and of the two Th\'evenin models models described and its respective identification algorithms, are discussed and compared using an experimental set of data.

Via

Access Paper or Ask Questions

Multi-View Knowledge Distillation from Crowd Annotations for Out-of-Domain Generalization

Dec 19, 2022
Dustin Wright, Isabelle Augenstein

Figure 1 for Multi-View Knowledge Distillation from Crowd Annotations for Out-of-Domain Generalization

Figure 2 for Multi-View Knowledge Distillation from Crowd Annotations for Out-of-Domain Generalization

Figure 3 for Multi-View Knowledge Distillation from Crowd Annotations for Out-of-Domain Generalization

Figure 4 for Multi-View Knowledge Distillation from Crowd Annotations for Out-of-Domain Generalization

Selecting an effective training signal for tasks in natural language processing is difficult: collecting expert annotations is expensive, and crowd-sourced annotations may not be reliable. At the same time, recent work in machine learning has demonstrated that learning from soft-labels acquired from crowd annotations can be effective, especially when there is distribution shift in the test set. However, the best method for acquiring these soft labels is inconsistent across tasks. This paper proposes new methods for acquiring soft-labels from crowd-annotations by aggregating the distributions produced by existing methods. In particular, we propose to find a distribution over classes by learning from multiple-views of crowd annotations via temperature scaling and finding the Jensen-Shannon centroid of their distributions. We demonstrate that using these aggregation methods leads to best or near-best performance across four NLP tasks on out-of-domain test sets, mitigating fluctuations in performance when using the constituent methods on their own. Additionally, these methods result in best or near-best uncertainty estimation across tasks. We argue that aggregating different views of crowd-annotations as soft-labels is an effective way to ensure performance which is as good or better than the best individual view, which is useful given the inconsistency in performance of the individual methods.

* 14 pages, 4 figures, 1 table

Via

Access Paper or Ask Questions

Time-Series Domain Adaptation via Sparse Associative Structure Alignment: Learning Invariance and Variance

May 07, 2022
Zijian Li, Ruichu Cai, Jiawei Chen, Yuguan Yan, Wei Chen, Keli Zhang, Junjian Ye

Figure 1 for Time-Series Domain Adaptation via Sparse Associative Structure Alignment: Learning Invariance and Variance

Figure 2 for Time-Series Domain Adaptation via Sparse Associative Structure Alignment: Learning Invariance and Variance

Figure 3 for Time-Series Domain Adaptation via Sparse Associative Structure Alignment: Learning Invariance and Variance

Figure 4 for Time-Series Domain Adaptation via Sparse Associative Structure Alignment: Learning Invariance and Variance

Domain adaptation on time-series data is often encountered in the industry but received limited attention in academia. Most of the existing domain adaptation methods for time-series data borrow the ideas from the existing methods for non-time series data to extract the domain-invariant representation. However, two peculiar difficulties to time-series data have not been solved. 1) It is not a trivial task to model the domain-invariant and complex dependence among different timestamps. 2) The domain-variant information is important but how to leverage them is almost underexploited. Fortunately, the stableness of causal structures among different domains inspires us to explore the structures behind the time-series data. Based on this inspiration, we investigate the domain-invariant unweighted sparse associative structures and the domain-variant strengths of the structures. To achieve this, we propose Sparse Associative structure alignment by learning Invariance and Variance (SASA-IV in short), a model that simultaneously aligns the invariant unweighted spare associative structures and considers the variant information for time-series unsupervised domain adaptation. Technologically, we extract the domain-invariant unweighted sparse associative structures with a unidirectional alignment restriction and embed the domain-variant strengths via a well-designed autoregressive module. Experimental results not only testify that our model yields state-of-the-art performance on three real-world datasets but also provide some insightful discoveries on knowledge transfer.

* arXiv admin note: text overlap with arXiv:2012.11797

Via

Access Paper or Ask Questions

Pseudo-Riemannian Embedding Models for Multi-Relational Graph Representations

Dec 02, 2022
Saee Paliwal, Angus Brayne, Benedek Fabian, Maciej Wiatrak, Aaron Sim

Figure 1 for Pseudo-Riemannian Embedding Models for Multi-Relational Graph Representations

Figure 2 for Pseudo-Riemannian Embedding Models for Multi-Relational Graph Representations

Figure 3 for Pseudo-Riemannian Embedding Models for Multi-Relational Graph Representations

Figure 4 for Pseudo-Riemannian Embedding Models for Multi-Relational Graph Representations

In this paper we generalize single-relation pseudo-Riemannian graph embedding models to multi-relational networks, and show that the typical approach of encoding relations as manifold transformations translates from the Riemannian to the pseudo-Riemannian case. In addition we construct a view of relations as separate spacetime submanifolds of multi-time manifolds, and consider an interpolation between a pseudo-Riemannian embedding model and its Wick-rotated Riemannian counterpart. We validate these extensions in the task of link prediction, focusing on flat Lorentzian manifolds, and demonstrate their use in both knowledge graph completion and knowledge discovery in a biological domain.

* 4th Conference on Automated Knowledge Base Construction 2022
* 11 pages, 3 figures, AKBC 2022 conference

Via

Access Paper or Ask Questions

Graph Learning for Anomaly Analytics: Algorithms, Applications, and Challenges

Dec 11, 2022
Jing Ren, Feng Xia, Azadeh Noori Hoshyar, Charu C. Aggarwal

Figure 1 for Graph Learning for Anomaly Analytics: Algorithms, Applications, and Challenges

Figure 2 for Graph Learning for Anomaly Analytics: Algorithms, Applications, and Challenges

Figure 3 for Graph Learning for Anomaly Analytics: Algorithms, Applications, and Challenges

Figure 4 for Graph Learning for Anomaly Analytics: Algorithms, Applications, and Challenges

Anomaly analytics is a popular and vital task in various research contexts, which has been studied for several decades. At the same time, deep learning has shown its capacity in solving many graph-based tasks like, node classification, link prediction, and graph classification. Recently, many studies are extending graph learning models for solving anomaly analytics problems, resulting in beneficial advances in graph-based anomaly analytics techniques. In this survey, we provide a comprehensive overview of graph learning methods for anomaly analytics tasks. We classify them into four categories based on their model architectures, namely graph convolutional network (GCN), graph attention network (GAT), graph autoencoder (GAE), and other graph learning models. The differences between these methods are also compared in a systematic manner. Furthermore, we outline several graph-based anomaly analytics applications across various domains in the real world. Finally, we discuss five potential future research directions in this rapidly growing field.

Via

Access Paper or Ask Questions

Low Latency Conversion of Artificial Neural Network Models to Rate-encoded Spiking Neural Networks

Oct 27, 2022
Zhanglu Yan, Jun Zhou, Weng-Fai Wong

Figure 1 for Low Latency Conversion of Artificial Neural Network Models to Rate-encoded Spiking Neural Networks

Figure 2 for Low Latency Conversion of Artificial Neural Network Models to Rate-encoded Spiking Neural Networks

Figure 3 for Low Latency Conversion of Artificial Neural Network Models to Rate-encoded Spiking Neural Networks

Figure 4 for Low Latency Conversion of Artificial Neural Network Models to Rate-encoded Spiking Neural Networks

Spiking neural networks (SNNs) are well suited for resource-constrained applications as they do not need expensive multipliers. In a typical rate-encoded SNN, a series of binary spikes within a globally fixed time window is used to fire the neurons. The maximum number of spikes in this time window is also the latency of the network in performing a single inference, as well as determines the overall energy efficiency of the model. The aim of this paper is to reduce this while maintaining accuracy when converting ANNs to their equivalent SNNs. The state-of-the-art conversion schemes yield SNNs with accuracies comparable with ANNs only for large window sizes. In this paper, we start with understanding the information loss when converting from pre-existing ANN models to standard rate-encoded SNN models. From these insights, we propose a suite of novel techniques that together mitigate the information lost in the conversion, and achieve state-of-art SNN accuracies along with very low latency. Our method achieved a Top-1 SNN accuracy of 98.73% (1 time step) on the MNIST dataset, 76.38% (8 time steps) on the CIFAR-100 dataset, and 93.71% (8 time steps) on the CIFAR-10 dataset. On ImageNet, an SNN accuracy of 75.35%/79.16% was achieved with 100/200 time steps.

Via

Access Paper or Ask Questions