Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

"Time": models, code, and papers

Neural-Sim: Learning to Generate Training Data with NeRF

Jul 22, 2022
Yunhao Ge, Harkirat Behl, Jiashu Xu, Suriya Gunasekar, Neel Joshi, Yale Song, Xin Wang, Laurent Itti, Vibhav Vineet

Figure 1 for Neural-Sim: Learning to Generate Training Data with NeRF

Figure 2 for Neural-Sim: Learning to Generate Training Data with NeRF

Figure 3 for Neural-Sim: Learning to Generate Training Data with NeRF

Figure 4 for Neural-Sim: Learning to Generate Training Data with NeRF

Training computer vision models usually requires collecting and labeling vast amounts of imagery under a diverse set of scene configurations and properties. This process is incredibly time-consuming, and it is challenging to ensure that the captured data distribution maps well to the target domain of an application scenario. Recently, synthetic data has emerged as a way to address both of these issues. However, existing approaches either require human experts to manually tune each scene property or use automatic methods that provide little to no control; this requires rendering large amounts of random data variations, which is slow and is often suboptimal for the target domain. We present the first fully differentiable synthetic data pipeline that uses Neural Radiance Fields (NeRFs) in a closed-loop with a target application's loss function. Our approach generates data on-demand, with no human labor, to maximize accuracy for a target task. We illustrate the effectiveness of our method on synthetic and real-world object detection tasks. We also introduce a new "YCB-in-the-Wild" dataset and benchmark that provides a test scenario for object detection with varied poses in real-world environments.

* ECCV 2022

Via

Access Paper or Ask Questions

CuDi: Curve Distillation for Efficient and Controllable Exposure Adjustment

Jul 28, 2022
Chongyi Li, Chunle Guo, Ruicheng Feng, Shangchen Zhou, Chen Change Loy

Figure 1 for CuDi: Curve Distillation for Efficient and Controllable Exposure Adjustment

Figure 2 for CuDi: Curve Distillation for Efficient and Controllable Exposure Adjustment

Figure 3 for CuDi: Curve Distillation for Efficient and Controllable Exposure Adjustment

Figure 4 for CuDi: Curve Distillation for Efficient and Controllable Exposure Adjustment

We present Curve Distillation, CuDi, for efficient and controllable exposure adjustment without the requirement of paired or unpaired data during training. Our method inherits the zero-reference learning and curve-based framework from an effective low-light image enhancement method, Zero-DCE, with further speed up in its inference speed, reduction in its model size, and extension to controllable exposure adjustment. The improved inference speed and lightweight model are achieved through novel curve distillation that approximates the time-consuming iterative operation in the conventional curve-based framework by high-order curve's tangent line. The controllable exposure adjustment is made possible with a new self-supervised spatial exposure control loss that constrains the exposure levels of different spatial regions of the output to be close to the brightness distribution of an exposure map serving as an input condition. Different from most existing methods that can only correct either underexposed or overexposed photos, our approach corrects both underexposed and overexposed photos with a single model. Notably, our approach can additionally adjust the exposure levels of a photo globally or locally with the guidance of an input condition exposure map, which can be pre-defined or manually set in the inference stage. Through extensive experiments, we show that our method is appealing for its fast, robust, and flexible performance, outperforming state-of-the-art methods in real scenes. Project page: https://li-chongyi.github.io/CuDi_files/.

* https://li-chongyi.github.io/CuDi_files/

Via

Access Paper or Ask Questions

Semi-supervised Predictive Clustering Trees for (Hierarchical) Multi-label Classification

Jul 19, 2022
Jurica Levatić, Michelangelo Ceci, Dragi Kocev, Sašo Džeroski

Figure 1 for Semi-supervised Predictive Clustering Trees for (Hierarchical) Multi-label Classification

Figure 2 for Semi-supervised Predictive Clustering Trees for (Hierarchical) Multi-label Classification

Figure 3 for Semi-supervised Predictive Clustering Trees for (Hierarchical) Multi-label Classification

Figure 4 for Semi-supervised Predictive Clustering Trees for (Hierarchical) Multi-label Classification

Semi-supervised learning (SSL) is a common approach to learning predictive models using not only labeled examples, but also unlabeled examples. While SSL for the simple tasks of classification and regression has received a lot of attention from the research community, this is not properly investigated for complex prediction tasks with structurally dependent variables. This is the case of multi-label classification and hierarchical multi-label classification tasks, which may require additional information, possibly coming from the underlying distribution in the descriptive space provided by unlabeled examples, to better face the challenging task of predicting simultaneously multiple class labels. In this paper, we investigate this aspect and propose a (hierarchical) multi-label classification method based on semi-supervised learning of predictive clustering trees. We also extend the method towards ensemble learning and propose a method based on the random forest approach. Extensive experimental evaluation conducted on 23 datasets shows significant advantages of the proposed method and its extension with respect to their supervised counterparts. Moreover, the method preserves interpretability and reduces the time complexity of classical tree-based models.

Via

Access Paper or Ask Questions

The mbsts package: Multivariate Bayesian Structural Time Series Models in R

Jun 26, 2021
Ning Ning, Jinwen Qiu

Figure 1 for The mbsts package: Multivariate Bayesian Structural Time Series Models in R

Figure 2 for The mbsts package: Multivariate Bayesian Structural Time Series Models in R

Figure 3 for The mbsts package: Multivariate Bayesian Structural Time Series Models in R

The multivariate Bayesian structural time series (MBSTS) model \citep{qiu2018multivariate,Jammalamadaka2019Predicting} as a generalized version of many structural time series models, deals with inference and prediction for multiple correlated time series, where one also has the choice of using a different candidate pool of contemporaneous predictors for each target series. The MBSTS model has wide applications and is ideal for feature selection, time series forecasting, nowcasting, inferring causal impact, and others. This paper demonstrates how to use the R package \pkg{mbsts} for MBSTS modeling, establishing a bridge between user-friendly and developer-friendly functions in package and the corresponding methodology. A simulated dataset and object-oriented functions in the \pkg{mbsts} package are explained in the way that enables users to flexibly add or deduct some components, as well as to simplify or complicate some settings.

Via

Access Paper or Ask Questions

Live Stream Temporally Embedded 3D Human Body Pose and Shape Estimation

Jul 25, 2022
Zhouping Wang, Sarah Ostadabbas

Figure 1 for Live Stream Temporally Embedded 3D Human Body Pose and Shape Estimation

Figure 2 for Live Stream Temporally Embedded 3D Human Body Pose and Shape Estimation

Figure 3 for Live Stream Temporally Embedded 3D Human Body Pose and Shape Estimation

Figure 4 for Live Stream Temporally Embedded 3D Human Body Pose and Shape Estimation

3D Human body pose and shape estimation within a temporal sequence can be quite critical for understanding human behavior. Despite the significant progress in human pose estimation in the recent years, which are often based on single images or videos, human motion estimation on live stream videos is still a rarely-touched area considering its special requirements for real-time output and temporal consistency. To address this problem, we present a temporally embedded 3D human body pose and shape estimation (TePose) method to improve the accuracy and temporal consistency of pose estimation in live stream videos. TePose uses previous predictions as a bridge to feedback the error for better estimation in the current frame and to learn the correspondence between data frames and predictions in the history. A multi-scale spatio-temporal graph convolutional network is presented as the motion discriminator for adversarial training using datasets without any 3D labeling. We propose a sequential data loading strategy to meet the special start-to-end data processing requirement of live stream. We demonstrate the importance of each proposed module with extensive experiments. The results show the effectiveness of TePose on widely-used human pose benchmarks with state-of-the-art performance.

Via

Access Paper or Ask Questions

ABCinML: Anticipatory Bias Correction in Machine Learning Applications

Jun 14, 2022
Abdulaziz A. Almuzaini, Chidansh A. Bhatt, David M. Pennock, Vivek K. Singh

Figure 1 for ABCinML: Anticipatory Bias Correction in Machine Learning Applications

Figure 2 for ABCinML: Anticipatory Bias Correction in Machine Learning Applications

Figure 3 for ABCinML: Anticipatory Bias Correction in Machine Learning Applications

Figure 4 for ABCinML: Anticipatory Bias Correction in Machine Learning Applications

The idealization of a static machine-learned model, trained once and deployed forever, is not practical. As input distributions change over time, the model will not only lose accuracy, any constraints to reduce bias against a protected class may fail to work as intended. Thus, researchers have begun to explore ways to maintain algorithmic fairness over time. One line of work focuses on dynamic learning: retraining after each batch, and the other on robust learning which tries to make algorithms robust against all possible future changes. Dynamic learning seeks to reduce biases soon after they have occurred and robust learning often yields (overly) conservative models. We propose an anticipatory dynamic learning approach for correcting the algorithm to mitigate bias before it occurs. Specifically, we make use of anticipations regarding the relative distributions of population subgroups (e.g., relative ratios of male and female applicants) in the next cycle to identify the right parameters for an importance weighing fairness approach. Results from experiments over multiple real-world datasets suggest that this approach has promise for anticipatory bias correction.

Via

Access Paper or Ask Questions

A Novel Meta-predictor based Algorithm for Testing VLSI Circuits

Jul 22, 2022
Shruti Pandey, Jayadeva, Smruti R. Sarangi

Figure 1 for A Novel Meta-predictor based Algorithm for Testing VLSI Circuits

Figure 2 for A Novel Meta-predictor based Algorithm for Testing VLSI Circuits

Figure 3 for A Novel Meta-predictor based Algorithm for Testing VLSI Circuits

Figure 4 for A Novel Meta-predictor based Algorithm for Testing VLSI Circuits

Testing of integrated circuits (IC) is a highly expensive process but also the most important one in determining the defect level of an IC. Manufacturing defects in the IC are modeled using stuck-at-fault models. Stuck-at-fault models cover most of the physical faults that occur during the manufacturing process. With decreasing feature sizes due to the advancement of semiconductor technology, the defects are also getting smaller in size. Tests for these hard-to-detect defects are generated using deterministic test generation (DTG) algorithms. Our work aims at reducing the cost of Path Oriented Decision Making: PODEM (a DTG algorithm) without compromising the test quality. We trained a meta predictor to choose the best model given the circuit and the target net. This ensemble chooses the best probability prediction model with a 95% accuracy. This leads to a reduced number of backtracking decisions and much better performance of PODEM in terms of its CPU time. We show that our ML- guided PODEM algorithm with a meta predictor outperforms the baseline PODEM by 34% and other state-of-the-art ML-guided algorithms by at least 15% for ISCAS85 benchmark circuits.

* 7 pages, 8 figures and 4 tables

Via

Access Paper or Ask Questions

Partial Disentanglement via Mechanism Sparsity

Jul 15, 2022
Sébastien Lachapelle, Simon Lacoste-Julien

Figure 1 for Partial Disentanglement via Mechanism Sparsity

Figure 2 for Partial Disentanglement via Mechanism Sparsity

Disentanglement via mechanism sparsity was introduced recently as a principled approach to extract latent factors without supervision when the causal graph relating them in time is sparse, and/or when actions are observed and affect them sparsely. However, this theory applies only to ground-truth graphs satisfying a specific criterion. In this work, we introduce a generalization of this theory which applies to any ground-truth graph and specifies qualitatively how disentangled the learned representation is expected to be, via a new equivalence relation over models we call consistency. This equivalence captures which factors are expected to remain entangled and which are not based on the specific form of the ground-truth graph. We call this weaker form of identifiability partial disentanglement. The graphical criterion that allows complete disentanglement, proposed in an earlier work, can be derived as a special case of our theory. Finally, we enforce graph sparsity with constrained optimization and illustrate our theory and algorithm in simulations.

* Appears in: The First Workshop on Causal Representation Learning (CRL 2022) at UAI. 26 pages

Via

Access Paper or Ask Questions

Exploring Attention-Aware Network Resource Allocation for Customized Metaverse Services

Jul 31, 2022
Hongyang Du, Jiacheng Wang, Dusit Niyato, Jiawen Kang, Zehui Xiong, Xuemin, Shen, Dong In Kim

Figure 1 for Exploring Attention-Aware Network Resource Allocation for Customized Metaverse Services

Figure 2 for Exploring Attention-Aware Network Resource Allocation for Customized Metaverse Services

Figure 3 for Exploring Attention-Aware Network Resource Allocation for Customized Metaverse Services

Figure 4 for Exploring Attention-Aware Network Resource Allocation for Customized Metaverse Services

Emerging with the support of computing and communications technologies, Metaverse is expected to bring users unprecedented service experiences. However, the increase in the number of Metaverse users places a heavy demand on network resources, especially for Metaverse services that are based on graphical extended reality and require rendering a plethora of virtual objects. To make efficient use of network resources and improve the Quality-of-Experience (QoE), we design an attention-aware network resource allocation scheme to achieve customized Metaverse services. The aim is to allocate more network resources to virtual objects in which users are more interested. We first discuss several key techniques related to Metaverse services, including QoE analysis, eye-tracking, and remote rendering. We then review existing datasets and propose the user-object-attention level (UOAL) dataset that contains the ground truth attention of 30 users to 96 objects in 1,000 images. A tutorial on how to use UOAL is presented. With the help of UOAL, we propose an attention-aware network resource allocation algorithm that has two steps, i.e., attention prediction and QoE maximization. Specially, we provide an overview of the designs of two types of attention prediction methods, i.e., interest-aware and time-aware prediction. By using the predicted user-object-attention values, network resources such as the rendering capacity of edge devices can be allocated optimally to maximize the QoE. Finally, we propose promising research directions related to Metaverse services.

Via

Access Paper or Ask Questions

ICME 2022 Few-shot LOGO detection top 9 solution

Jun 23, 2022
Ka Ho Tong, Ka Wai Cheung, Xiaochuan Yu

ICME-2022 few-shot logo detection competition is held in May, 2022. Participants are required to develop a single model to detect logos by handling tiny logo instances, similar brands, and adversarial images at the same time, with limited annotations. Our team achieved rank 16 and 11 in the first and second round of the competition respectively, with a final rank of 9th. This technical report summarized our major techniques used in this competitions, and potential improvement.

Via

Access Paper or Ask Questions