Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sinan Kalkan

KOVAN Research Lab, Dept. of Computer Engineering, Middle East Technical University, Ankara, Turkey

What is in the scene? A Hybrid Deep Boltzmann Machine For Contextualized Scene Modeling

Aug 20, 2018

İlker Bozcan, Yağmur Oymak, İdil Zeynep Alemdar, Sinan Kalkan

Figure 1 for What is in the scene? A Hybrid Deep Boltzmann Machine For Contextualized Scene Modeling

Figure 2 for What is in the scene? A Hybrid Deep Boltzmann Machine For Contextualized Scene Modeling

Figure 3 for What is in the scene? A Hybrid Deep Boltzmann Machine For Contextualized Scene Modeling

Figure 4 for What is in the scene? A Hybrid Deep Boltzmann Machine For Contextualized Scene Modeling

Abstract:Scene models allow robots to reason about what is in the scene, what else should be in it, and what should not be in it. In this paper, we propose a hybrid Boltzmann Machine (BM) for scene modeling where relations between objects are integrated. To be able to do that, we extend BM to include tri-way edges between visible (object) nodes and make the network to share the relations across different objects. We evaluate our method against several baseline models (Deep Boltzmann Machines, and Restricted Boltzmann Machines) on a scene classification dataset, and show that it performs better in several scene reasoning tasks.

* 6 pages, 7 figures, submitted to ICRA 2018

Via

Access Paper or Ask Questions

CINet: A Learning Based Approach to Incremental Context Modeling in Robots

Jul 29, 2018

Fethiye Irmak Doğan, İlker Bozcan, Mehmet Çelik, Sinan Kalkan

Figure 1 for CINet: A Learning Based Approach to Incremental Context Modeling in Robots

Figure 2 for CINet: A Learning Based Approach to Incremental Context Modeling in Robots

Figure 3 for CINet: A Learning Based Approach to Incremental Context Modeling in Robots

Figure 4 for CINet: A Learning Based Approach to Incremental Context Modeling in Robots

Abstract:There have been several attempts at modeling context in robots. However, either these attempts assume a fixed number of contexts or use a rule-based approach to determine when to increment the number of contexts. In this paper, we pose the task of when to increment as a learning problem, which we solve using a Recurrent Neural Network. We show that the network successfully (with 98\% testing accuracy) learns to predict when to increment, and demonstrate, in a scene modeling problem (where the correct number of contexts is not known), that the robot increments the number of contexts in an expected manner (i.e., the entropy of the system is reduced). We also present how the incremental model can be used for various scene reasoning tasks.

* The first two authors have contributed equally, 6 pages, 8 figures, International Conference on Intelligent Robots (IROS 2018)

Via

Access Paper or Ask Questions

Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Jul 05, 2018

Kemal Oksuz, Baris Can Cam, Emre Akbas, Sinan Kalkan

Figure 1 for Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Figure 2 for Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Figure 3 for Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Figure 4 for Localization Recall Precision (LRP): A New Performance Metric for Object Detection

Abstract:Average precision (AP), the area under the recall-precision (RP) curve, is the standard performance measure for object detection. Despite its wide acceptance, it has a number of shortcomings, the most important of which are (i) the inability to distinguish very different RP curves, and (ii) the lack of directly measuring bounding box localization accuracy. In this paper, we propose 'Localization Recall Precision (LRP) Error', a new metric which we specifically designed for object detection. LRP Error is composed of three components related to localization, false negative (FN) rate and false positive (FP) rate. Based on LRP, we introduce the 'Optimal LRP', the minimum achievable LRP error representing the best achievable configuration of the detector in terms of recall-precision and the tightness of the boxes. In contrast to AP, which considers precisions over the entire recall domain, Optimal LRP determines the 'best' confidence score threshold for a class, which balances the trade-off between localization and recall-precision. In our experiments, we show that, for state-of-the-art object (SOTA) detectors, Optimal LRP provides richer and more discriminative information than AP. We also demonstrate that the best confidence score thresholds vary significantly among classes and detectors. Moreover, we present LRP results of a simple online video object detector which uses a SOTA still image object detector and show that the class-specific optimized thresholds increase the accuracy against the common approach of using a general threshold for all classes. At https://github.com/cancam/LRP we provide the source code that can compute LRP for the PASCAL VOC and MSCOCO datasets. Our source code can easily be adapted to other datasets as well.

* to appear in ECCV 2018

Via

Access Paper or Ask Questions

A Deep Incremental Boltzmann Machine for Modeling Context in Robots

Mar 02, 2018

Fethiye Irmak Doğan, Hande Çelikkanat, Sinan Kalkan

Figure 1 for A Deep Incremental Boltzmann Machine for Modeling Context in Robots

Figure 2 for A Deep Incremental Boltzmann Machine for Modeling Context in Robots

Figure 3 for A Deep Incremental Boltzmann Machine for Modeling Context in Robots

Figure 4 for A Deep Incremental Boltzmann Machine for Modeling Context in Robots

Abstract:Context is an essential capability for robots that are to be as adaptive as possible in challenging environments. Although there are many context modeling efforts, they assume a fixed structure and number of contexts. In this paper, we propose an incremental deep model that extends Restricted Boltzmann Machines. Our model gets one scene at a time, and gradually extends the contextual model when necessary, either by adding a new context or a new context layer to form a hierarchy. We show on a scene classification benchmark that our method converges to a good estimate of the contexts of the scenes, and performs better or on-par on several tasks compared to other incremental models or non-incremental models.

* 6 pages, 5 figures, International Conference on Robotics and Automation (ICRA 2018)

Via

Access Paper or Ask Questions

A Large-scale Dataset and Benchmark for Similar Trademark Retrieval

Oct 14, 2017

Osman Tursun, Cemal Aker, Sinan Kalkan

Figure 1 for A Large-scale Dataset and Benchmark for Similar Trademark Retrieval

Figure 2 for A Large-scale Dataset and Benchmark for Similar Trademark Retrieval

Figure 3 for A Large-scale Dataset and Benchmark for Similar Trademark Retrieval

Figure 4 for A Large-scale Dataset and Benchmark for Similar Trademark Retrieval

Abstract:Trademark retrieval (TR) has become an important yet challenging problem due to an ever increasing trend in trademark applications and infringement incidents. There have been many promising attempts for the TR problem, which, however, fell impracticable since they were evaluated with limited and mostly trivial datasets. In this paper, we provide a large-scale dataset with benchmark queries with which different TR approaches can be evaluated systematically. Moreover, we provide a baseline on this benchmark using the widely-used methods applied to TR in the literature. Furthermore, we identify and correct two important issues in TR approaches that were not addressed before: reversal of contrast, and presence of irrelevant text in trademarks severely affect the TR methods. Lastly, we applied deep learning, namely, several popular Convolutional Neural Network models, to the TR problem. To the best of the authors, this is the first attempt to do so.

Via

Access Paper or Ask Questions

Using Deep Networks for Drone Detection

Jun 18, 2017

Cemal Aker, Sinan Kalkan

Figure 1 for Using Deep Networks for Drone Detection

Figure 2 for Using Deep Networks for Drone Detection

Figure 3 for Using Deep Networks for Drone Detection

Figure 4 for Using Deep Networks for Drone Detection

Abstract:Drone detection is the problem of finding the smallest rectangle that encloses the drone(s) in a video sequence. In this study, we propose a solution using an end-to-end object detection model based on convolutional neural networks. To solve the scarce data problem for training the network, we propose an algorithm for creating an extensive artificial dataset by combining background-subtracted real images. With this approach, we can achieve precision and recall values both of which are high at the same time.

* To appear in International Workshop on Small-Drone Surveillance, Detection and Counteraction Techniques organised within AVSS 2017

Via

Access Paper or Ask Questions