Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jin Tian

Iowa State University

AD-DINO: Attention-Dynamic DINO for Distance-Aware Embodied Reference Understanding

Nov 13, 2024

Hao Guo, Wei Fan, Baichun Wei, Jianfei Zhu, Jin Tian, Chunzhi Yi, Feng Jiang

Abstract:Embodied reference understanding is crucial for intelligent agents to predict referents based on human intention through gesture signals and language descriptions. This paper introduces the Attention-Dynamic DINO, a novel framework designed to mitigate misinterpretations of pointing gestures across various interaction contexts. Our approach integrates visual and textual features to simultaneously predict the target object's bounding box and the attention source in pointing gestures. Leveraging the distance-aware nature of nonverbal communication in visual perspective taking, we extend the virtual touch line mechanism and propose an attention-dynamic touch line to represent referring gesture based on interactive distances. The combination of this distance-aware approach and independent prediction of the attention source, enhances the alignment between objects and the gesture represented line. Extensive experiments on the YouRefIt dataset demonstrate the efficacy of our gesture information understanding method in significantly improving task performance. Our model achieves 76.4% accuracy at the 0.25 IoU threshold and, notably, surpasses human performance at the 0.75 IoU threshold, marking a first in this domain. Comparative experiments with distance-unaware understanding methods from previous research further validate the superiority of the Attention-Dynamic Touch Line across diverse contexts.

Via

Access Paper or Ask Questions

Testing Causal Models with Hidden Variables in Polynomial Delay via Conditional Independencies

Sep 22, 2024

Hyunchai Jeong, Adiba Ejaz, Jin Tian, Elias Bareinboim

Figure 1 for Testing Causal Models with Hidden Variables in Polynomial Delay via Conditional Independencies

Figure 2 for Testing Causal Models with Hidden Variables in Polynomial Delay via Conditional Independencies

Figure 3 for Testing Causal Models with Hidden Variables in Polynomial Delay via Conditional Independencies

Figure 4 for Testing Causal Models with Hidden Variables in Polynomial Delay via Conditional Independencies

Abstract:Testing a hypothesized causal model against observational data is a key prerequisite for many causal inference tasks. A natural approach is to test whether the conditional independence relations (CIs) assumed in the model hold in the data. While a model can assume exponentially many CIs (with respect to the number of variables), testing all of them is both impractical and unnecessary. Causal graphs, which encode these CIs in polynomial space, give rise to local Markov properties that enable model testing with a significantly smaller subset of CIs. Model testing based on local properties requires an algorithm to list the relevant CIs. However, existing algorithms for realistic settings with hidden variables and non-parametric distributions can take exponential time to produce even a single CI constraint. In this paper, we introduce the c-component local Markov property (C-LMP) for causal graphs with hidden variables. Since C-LMP can still invoke an exponential number of CIs, we develop a polynomial delay algorithm to list these CIs in poly-time intervals. To our knowledge, this is the first algorithm that enables poly-delay testing of CIs in causal graphs with hidden variables against arbitrary data distributions. Experiments on real-world and synthetic data demonstrate the practicality of our algorithm.

* 34 total pages, 14 figures

Via

Access Paper or Ask Questions

Estimating Causal Effects from Learned Causal Networks

Aug 27, 2024

Anna Raichev, Alexander Ihler, Jin Tian, Rina Dechter

Figure 1 for Estimating Causal Effects from Learned Causal Networks

Figure 2 for Estimating Causal Effects from Learned Causal Networks

Figure 3 for Estimating Causal Effects from Learned Causal Networks

Figure 4 for Estimating Causal Effects from Learned Causal Networks

Abstract:The standard approach to answering an identifiable causal-effect query (e.g., $P(Y|do(X)$) when given a causal diagram and observational data is to first generate an estimand, or probabilistic expression over the observable variables, which is then evaluated using the observational data. In this paper, we propose an alternative paradigm for answering causal-effect queries over discrete observable variables. We propose to instead learn the causal Bayesian network and its confounding latent variables directly from the observational data. Then, efficient probabilistic graphical model (PGM) algorithms can be applied to the learned model to answer queries. Perhaps surprisingly, we show that this \emph{model completion} learning approach can be more effective than estimand approaches, particularly for larger models in which the estimand expressions become computationally difficult. We illustrate our method's potential using a benchmark collection of Bayesian networks and synthetically generated causal models.

Via

Access Paper or Ask Questions

AnnotatedTables: A Large Tabular Dataset with Language Model Annotations

Jun 24, 2024

Yaojie Hu, Ilias Fountalis, Jin Tian, Nikolaos Vasiloglou

Figure 1 for AnnotatedTables: A Large Tabular Dataset with Language Model Annotations

Figure 2 for AnnotatedTables: A Large Tabular Dataset with Language Model Annotations

Figure 3 for AnnotatedTables: A Large Tabular Dataset with Language Model Annotations

Figure 4 for AnnotatedTables: A Large Tabular Dataset with Language Model Annotations

Abstract:Tabular data is ubiquitous in real-world applications and abundant on the web, yet its annotation has traditionally required human labor, posing a significant scalability bottleneck for tabular machine learning. Our methodology can successfully annotate a large amount of tabular data and can be flexibly steered to generate various types of annotations based on specific research objectives, as we demonstrate with SQL annotation and input-target column annotation as examples. As a result, we release AnnotatedTables, a collection of 32,119 databases with LLM-generated annotations. The dataset includes 405,616 valid SQL programs, making it the largest SQL dataset with associated tabular data that supports query execution. To further demonstrate the value of our methodology and dataset, we perform two follow-up research studies. 1) We investigate whether LLMs can translate SQL programs to Rel programs, a database language previously unknown to LLMs, while obtaining the same execution results. Using our Incremental Prompt Engineering methods based on execution feedback, we show that LLMs can produce adequate translations with few-shot learning. 2) We evaluate the performance of TabPFN, a recent neural tabular classifier trained on Bayesian priors, on 2,720 tables with input-target columns identified and annotated by LLMs. On average, TabPFN performs on par with the baseline AutoML method, though the relative performance can vary significantly from one data table to another, making both models viable for practical applications depending on the situation. Our findings underscore the potential of LLMs in automating the annotation of large volumes of diverse tabular data.

Via

Access Paper or Ask Questions

Probabilities of Causation for Continuous and Vector Variables

May 30, 2024

Yuta Kawakami, Manabu Kuroki, Jin Tian

Abstract:Probabilities of causation (PoC) are valuable concepts for explainable artificial intelligence and practical decision-making. PoC are originally defined for scalar binary variables. In this paper, we extend the concept of PoC to continuous treatment and outcome variables, and further generalize PoC to capture causal effects between multiple treatments and multiple outcomes. In addition, we consider PoC for a sub-population and PoC with multi-hypothetical terms to capture more sophisticated counterfactual information useful for decision-making. We provide a nonparametric identification theorem for each type of PoC we introduce. Finally, we illustrate the application of our results on a real-world dataset about education.

Via

Access Paper or Ask Questions

Improving Adversarial Training using Vulnerability-Aware Perturbation Budget

Mar 06, 2024

Olukorede Fakorede, Modeste Atsague, Jin Tian

Figure 1 for Improving Adversarial Training using Vulnerability-Aware Perturbation Budget

Figure 2 for Improving Adversarial Training using Vulnerability-Aware Perturbation Budget

Figure 3 for Improving Adversarial Training using Vulnerability-Aware Perturbation Budget

Figure 4 for Improving Adversarial Training using Vulnerability-Aware Perturbation Budget

Abstract:Adversarial Training (AT) effectively improves the robustness of Deep Neural Networks (DNNs) to adversarial attacks. Generally, AT involves training DNN models with adversarial examples obtained within a pre-defined, fixed perturbation bound. Notably, individual natural examples from which these adversarial examples are crafted exhibit varying degrees of intrinsic vulnerabilities, and as such, crafting adversarial examples with fixed perturbation radius for all instances may not sufficiently unleash the potency of AT. Motivated by this observation, we propose two simple, computationally cheap vulnerability-aware reweighting functions for assigning perturbation bounds to adversarial examples used for AT, named Margin-Weighted Perturbation Budget (MWPB) and Standard-Deviation-Weighted Perturbation Budget (SDWPB). The proposed methods assign perturbation radii to individual adversarial samples based on the vulnerability of their corresponding natural examples. Experimental results show that the proposed methods yield genuine improvements in the robustness of AT algorithms against various adversarial attacks.

* 19 pages, 2 figures

Via

Access Paper or Ask Questions

Identification and Estimation of Conditional Average Partial Causal Effects via Instrumental Variable

Jan 20, 2024

Yuta Kawakami, Manabu Kuroki, Jin Tian

Abstract:There has been considerable recent interest in estimating heterogeneous causal effects. In this paper, we introduce conditional average partial causal effects (CAPCE) to reveal the heterogeneity of causal effects with continuous treatment. We provide conditions for identifying CAPCE in an instrumental variable setting. We develop three families of CAPCE estimators: sieve, parametric, and reproducing kernel Hilbert space (RKHS)-based, and analyze their statistical properties. We illustrate the proposed CAPCE estimators on synthetic and real-world data.

Via

Access Paper or Ask Questions

Vulnerability-Aware Instance Reweighting For Adversarial Training

Jul 14, 2023

Olukorede Fakorede, Ashutosh Kumar Nirala, Modeste Atsague, Jin Tian

Figure 1 for Vulnerability-Aware Instance Reweighting For Adversarial Training

Figure 2 for Vulnerability-Aware Instance Reweighting For Adversarial Training

Figure 3 for Vulnerability-Aware Instance Reweighting For Adversarial Training

Figure 4 for Vulnerability-Aware Instance Reweighting For Adversarial Training

Abstract:Adversarial Training (AT) has been found to substantially improve the robustness of deep learning classifiers against adversarial attacks. AT involves obtaining robustness by including adversarial examples in training a classifier. Most variants of AT algorithms treat every training example equally. However, recent works have shown that better performance is achievable by treating them unequally. In addition, it has been observed that AT exerts an uneven influence on different classes in a training set and unfairly hurts examples corresponding to classes that are inherently harder to classify. Consequently, various reweighting schemes have been proposed that assign unequal weights to robust losses of individual examples in a training set. In this work, we propose a novel instance-wise reweighting scheme. It considers the vulnerability of each natural example and the resulting information loss on its adversarial counterpart occasioned by adversarial attacks. Through extensive experiments, we show that our proposed method significantly improves over existing reweighting schemes, especially against strong white and black-box attacks.

Via

Access Paper or Ask Questions

Neural Interpretation of Generic Source Code

Mar 23, 2023

Yaojie Hu, Jin Tian

Abstract:Can a generic (Python) program be executed statement-by-statement by neural networks composed according to the source code? We formulate the Abstract Neural Execution Problem and introduce Neural Interpretation, the first neural model that abstractly executes generic source code, where every variable has a vector encoding, and every function executes a neural network. Neural Interpretation is a model of computers with a compiler architecture, which can assemble neural layers ''programmed'' by partial source code. Neural Interpretation can be trained with flexible learning objectives. We demonstrate white-box execution without concrete inputs for variable misuse localization and repair.

Via

Access Paper or Ask Questions

Improving Adversarial Robustness with Hypersphere Embedding and Angular-based Regularizations

Mar 15, 2023

Olukorede Fakorede, Ashutosh Nirala, Modeste Atsague, Jin Tian

Figure 1 for Improving Adversarial Robustness with Hypersphere Embedding and Angular-based Regularizations

Figure 2 for Improving Adversarial Robustness with Hypersphere Embedding and Angular-based Regularizations

Figure 3 for Improving Adversarial Robustness with Hypersphere Embedding and Angular-based Regularizations

Figure 4 for Improving Adversarial Robustness with Hypersphere Embedding and Angular-based Regularizations

Abstract:Adversarial training (AT) methods have been found to be effective against adversarial attacks on deep neural networks. Many variants of AT have been proposed to improve its performance. Pang et al. [1] have recently shown that incorporating hypersphere embedding (HE) into the existing AT procedures enhances robustness. We observe that the existing AT procedures are not designed for the HE framework, and thus fail to adequately learn the angular discriminative information available in the HE framework. In this paper, we propose integrating HE into AT with regularization terms that exploit the rich angular information available in the HE framework. Specifically, our method, termed angular-AT, adds regularization terms to AT that explicitly enforce weight-feature compactness and inter-class separation; all expressed in terms of angular features. Experimental results show that angular-AT further improves adversarial robustness.

Via

Access Paper or Ask Questions