Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Pablo Arbelaez

BAOD: Budget-Aware Object Detection

Apr 10, 2019
Alejandro Pardo, Mengmeng Xu, Ali Thabet, Pablo Arbelaez, Bernard Ghanem

Figure 1 for BAOD: Budget-Aware Object Detection

Figure 2 for BAOD: Budget-Aware Object Detection

Figure 3 for BAOD: Budget-Aware Object Detection

Figure 4 for BAOD: Budget-Aware Object Detection

We study the problem of object detection from a novel perspective in which annotation budget constraints are taken into consideration, appropriately coined Budget Aware Object Detection (BAOD). When provided with a fixed budget, we propose a strategy for building a diverse and informative dataset that can be used to optimally train a robust detector. We investigate both optimization and learning-based methods to sample which images to annotate and what type of annotation (strongly or weakly supervised) to annotate them with. We adopt a hybrid supervised learning framework to train the object detector from both these types of annotation. We conduct a comprehensive empirical study showing that a handcrafted optimization method outperforms other selection techniques including random sampling, uncertainty sampling and active learning. By combining an optimal image/annotation selection scheme with hybrid supervised learning to solve the BAOD problem, we show that one can achieve the performance of a strongly supervised detector on PASCAL-VOC 2007 while saving 12.8% of its original annotation budget. Furthermore, when $100\%$ of the budget is used, it surpasses this performance by 2.0 mAP percentage points.

Via

Access Paper or Ask Questions

Multi-View Dynamic Facial Action Unit Detection

Aug 20, 2018
Andres Romero, Juan Leon, Pablo Arbelaez

Figure 1 for Multi-View Dynamic Facial Action Unit Detection

Figure 2 for Multi-View Dynamic Facial Action Unit Detection

Figure 3 for Multi-View Dynamic Facial Action Unit Detection

Figure 4 for Multi-View Dynamic Facial Action Unit Detection

We propose a novel convolutional neural network approach to address the fine-grained recognition problem of multi-view dynamic facial action unit detection. We leverage recent gains in large-scale object recognition by formulating the task of predicting the presence or absence of a specific action unit in a still image of a human face as holistic classification. We then explore the design space of our approach by considering both shared and independent representations for separate action units, and also different CNN architectures for combining color and motion information. We then move to the novel setup of the FERA 2017 Challenge, in which we propose a multi-view extension of our approach that operates by first predicting the viewpoint from which the video was taken, and then evaluating an ensemble of action unit detectors that were trained for that specific viewpoint. Our approach is holistic, efficient, and modular, since new action units can be easily included in the overall system. Our approach significantly outperforms the baseline of the FERA 2017 Challenge, with an absolute improvement of 14% on the F1-metric. Additionally, it compares favorably against the winner of the FERA 2017 challenge. Code source is available at https://github.com/BCV-Uniandes/AUNets.

Via

Access Paper or Ask Questions

Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation

Mar 01, 2016
Jordi Pont-Tuset, Pablo Arbelaez, Jonathan T. Barron, Ferran Marques, Jitendra Malik

Figure 1 for Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation

Figure 2 for Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation

Figure 3 for Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation

Figure 4 for Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation

We propose a unified approach for bottom-up hierarchical image segmentation and object proposal generation for recognition, called Multiscale Combinatorial Grouping (MCG). For this purpose, we first develop a fast normalized cuts algorithm. We then propose a high-performance hierarchical segmenter that makes effective use of multiscale information. Finally, we propose a grouping strategy that combines our multiscale regions into highly-accurate object proposals by exploring efficiently their combinatorial space. We also present Single-scale Combinatorial Grouping (SCG), a faster version of MCG that produces competitive proposals in under five second per image. We conduct an extensive and comprehensive empirical validation on the BSDS500, SegVOC12, SBD, and COCO datasets, showing that MCG produces state-of-the-art contours, hierarchical regions, and object proposals.

Via

Access Paper or Ask Questions

Learning to Segment Moving Objects in Videos

May 08, 2015
Katerina Fragkiadaki, Pablo Arbelaez, Panna Felsen, Jitendra Malik

Figure 1 for Learning to Segment Moving Objects in Videos

Figure 2 for Learning to Segment Moving Objects in Videos

Figure 3 for Learning to Segment Moving Objects in Videos

Figure 4 for Learning to Segment Moving Objects in Videos

We segment moving objects in videos by ranking spatio-temporal segment proposals according to "moving objectness": how likely they are to contain a moving object. In each video frame, we compute segment proposals using multiple figure-ground segmentations on per frame motion boundaries. We rank them with a Moving Objectness Detector trained on image and motion fields to detect moving objects and discard over/under segmentations or background parts of the scene. We extend the top ranked segments into spatio-temporal tubes using random walkers on motion affinities of dense point trajectories. Our final tube ranking consistently outperforms previous segmentation methods in the two largest video segmentation benchmarks currently available, for any number of proposals. Further, our per frame moving object proposals increase the detection rate up to 7\% over previous state-of-the-art static proposal methods.

Via

Access Paper or Ask Questions