Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Josef Kittler

University of Surrey, UK

Discriminative Supervised Hashing for Cross-Modal similarity Search

Dec 06, 2018

Jun Yu, Xiao-Jun Wu, Josef Kittler

Figure 1 for Discriminative Supervised Hashing for Cross-Modal similarity Search

Figure 2 for Discriminative Supervised Hashing for Cross-Modal similarity Search

Figure 3 for Discriminative Supervised Hashing for Cross-Modal similarity Search

Figure 4 for Discriminative Supervised Hashing for Cross-Modal similarity Search

Abstract:With the advantage of low storage cost and high retrieval efficiency, hashing techniques have recently been an emerging topic in cross-modal similarity search. As multiple modal data reflect similar semantic content, many researches aim at learning unified binary codes. However, discriminative hashing features learned by these methods are not adequate. This results in lower accuracy and robustness. We propose a novel hashing learning framework which jointly performs classifier learning, subspace learning and matrix factorization to preserve class-specific semantic content, termed Discriminative Supervised Hashing (DSH), to learn the discrimative unified binary codes for multi-modal data. Besides, reducing the loss of information and preserving the non-linear structure of data, DSH non-linearly projects different modalities into the common space in which the similarity among heterogeneous data points can be measured. Extensive experiments conducted on the three publicly available datasets demonstrate that the framework proposed in this paper outperforms several state-of -the-art methods.

* 7pages,3figures,4tables

Via

Access Paper or Ask Questions

Pose Invariant 3D Face Reconstruction

Nov 13, 2018

Lei Jiang, XiaoJun Wu, Josef Kittler

Figure 1 for Pose Invariant 3D Face Reconstruction

Abstract:3D face reconstruction is an important task in the field of computer vision. Although 3D face reconstruction has being developing rapidly in recent years, it is still a challenge for face reconstruction under large pose. That is because much of the information about a face in a large pose will be unknowable. In order to address this issue, this paper proposes a novel 3D face reconstruction algorithm (PIFR) based on 3D Morphable Model (3DMM). After input a single face image, it generates a frontal image by normalizing the image. Then we set weighted sum of the 3D parameters of the two images. Our method solves the problem of face reconstruction of a single image of a traditional method in a large pose, works on arbitrary Pose and Expressions, greatly improves the accuracy of reconstruction. Experiments on the challenging AFW, LFPW and AFLW database show that our algorithm significantly improves the accuracy of 3D face reconstruction even under extreme poses .

* 8 pages

Via

Access Paper or Ask Questions

Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks

Oct 23, 2018

Zhen-Hua Feng, Josef Kittler, Muhammad Awais, Patrik Huber, Xiao-Jun Wu

Figure 1 for Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks

Figure 2 for Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks

Figure 3 for Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks

Figure 4 for Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks

Abstract:We present a new loss function, namely Wing loss, for robust facial landmark localisation with Convolutional Neural Networks (CNNs). We first compare and analyse different loss functions including L2, L1 and smooth L1. The analysis of these loss functions suggests that, for the training of a CNN-based localisation model, more attention should be paid to small and medium range errors. To this end, we design a piece-wise loss function. The new loss amplifies the impact of errors from the interval (-w, w) by switching from L1 loss to a modified logarithm function. To address the problem of under-representation of samples with large out-of-plane head rotations in the training set, we propose a simple but effective boosting strategy, referred to as pose-based data balancing. In particular, we deal with the data imbalance problem by duplicating the minority training samples and perturbing them by injecting random image rotation, bounding box translation and other data augmentation approaches. Last, the proposed approach is extended to create a two-stage framework for robust facial landmark localisation. The experimental results obtained on AFLW and 300W demonstrate the merits of the Wing loss function, and prove the superiority of the proposed method over the state-of-the-art approaches.

* 11 pages, 6 figures, 6 tables

Via

Access Paper or Ask Questions

Unconstrained Face Detection and Open-Set Face Recognition Challenge

Sep 25, 2018

Manuel Günther, Peiyun Hu, Christian Herrmann, Chi Ho Chan, Min Jiang, Shufan Yang, Akshay Raj Dhamija, Deva Ramanan, Jürgen Beyerer, Josef Kittler(+5 more)

Figure 1 for Unconstrained Face Detection and Open-Set Face Recognition Challenge

Figure 2 for Unconstrained Face Detection and Open-Set Face Recognition Challenge

Figure 3 for Unconstrained Face Detection and Open-Set Face Recognition Challenge

Figure 4 for Unconstrained Face Detection and Open-Set Face Recognition Challenge

Abstract:Face detection and recognition benchmarks have shifted toward more difficult environments. The challenge presented in this paper addresses the next step in the direction of automatic detection and identification of people from outdoor surveillance cameras. While face detection has shown remarkable success in images collected from the web, surveillance cameras include more diverse occlusions, poses, weather conditions and image blur. Although face verification or closed-set face identification have surpassed human capabilities on some datasets, open-set identification is much more complex as it needs to reject both unknown identities and false accepts from the face detector. We show that unconstrained face detection can approach high detection rates albeit with moderate false accept rates. By contrast, open-set face recognition is currently weak and requires much more attention.

* This is an ERRATA version of the paper originally presented at the International Joint Conference on Biometrics. Due to a bug in our evaluation code, the results of the participants changed. The final conclusion, however, is still the same

Via

Access Paper or Ask Questions

One-Class Kernel Spectral Regression for Outlier Detection

Aug 20, 2018

Shervin Rahimzadeh Arashloo, Josef Kittler

Figure 1 for One-Class Kernel Spectral Regression for Outlier Detection

Figure 2 for One-Class Kernel Spectral Regression for Outlier Detection

Figure 3 for One-Class Kernel Spectral Regression for Outlier Detection

Figure 4 for One-Class Kernel Spectral Regression for Outlier Detection

Abstract:The paper introduces a new efficient nonlinear one-class classifier formulated as the Rayleigh quotient criterion optimisation. The method, operating in a reproducing kernel Hilbert subspace, minimises the scatter of target distribution along an optimal projection direction while at the same time keeping projections of positive observations distant from the mean of the negative class. We provide a graph embedding view of the problem which can then be solved efficiently using the spectral regression approach. In this sense, unlike previous similar methods which often require costly eigen-computations of dense matrices, the proposed approach casts the problem under consideration into a regression framework which is computationally more efficient. In particular, it is shown that the dominant complexity of the proposed method is the complexity of computing the kernel matrix. Additional appealing characteristics of the proposed one-class classifier are: 1-the ability to be trained in an incremental fashion (allowing for application in streaming data scenarios while also reducing the computational complexity in a non-streaming operation mode); 2-being unsupervised, but providing the option for refining the solution using negative training examples, when available; Last but not least, 3-the use of the kernel trick which facilitates a nonlinear mapping of the data into a high-dimensional feature space to seek better solutions.

Via

Access Paper or Ask Questions

Landmark Weighting for 3DMM Shape Fitting

Aug 16, 2018

Yu Yanga, Xiao-Jun Wu, Josef Kittler

Figure 1 for Landmark Weighting for 3DMM Shape Fitting

Figure 2 for Landmark Weighting for 3DMM Shape Fitting

Figure 3 for Landmark Weighting for 3DMM Shape Fitting

Abstract:Human face is a 3D object with shape and surface texture. 3D Morphable Model (3DMM) is a powerful tool for reconstructing the 3D face from a single 2D face image. In the shape fitting process, 3DMM estimates the correspondence between 2D and 3D landmarks. Most traditional 3DMM fitting methods fail to reconstruct an accurate model because face shape fitting is a difficult non-linear optimization problem. In this paper we show that landmark weighting is instrumental to improve the accuracy of shape reconstruction and propose a novel 3D Morphable Model Fitting method. Different from previous works that treat all landmarks equally, we take into consideration the estimated errors for each pair of 2D and 3D corresponding landmarks. The landmark points are weighted in the optimization cost function based on these errors. Obviously, these landmarks have different semantics because they locate on different facial components. In the context of the solution of fitting is approximated, there are deviations in landmarks matching. However, these landmarks with different semantics have different effects on reconstructing 3D faces. Thus, it is necessary to consider each landmark individually. To our knowledge, we are the first to analyze each feature point for 3D face reconstruction by 3DMM. The weight is adaptive with the estimation residuals of landmarks. Experimental results show that the proposed method significantly reduces the reconstruction error and improves the authenticity of the 3D model expression.

* 7 pages, 7 figures

Via

Access Paper or Ask Questions

Learning Discriminative Hashing Codes for Cross-Modal Retrieval based on Multiorder Statistical Features

Aug 13, 2018

Jun Yu, Xiao-Jun Wu, Josef Kittler

Figure 1 for Learning Discriminative Hashing Codes for Cross-Modal Retrieval based on Multiorder Statistical Features

Figure 2 for Learning Discriminative Hashing Codes for Cross-Modal Retrieval based on Multiorder Statistical Features

Figure 3 for Learning Discriminative Hashing Codes for Cross-Modal Retrieval based on Multiorder Statistical Features

Figure 4 for Learning Discriminative Hashing Codes for Cross-Modal Retrieval based on Multiorder Statistical Features

Abstract:Hashing techniques have been applied broadly in large-scale retrieval tasks due to their low storage requirements and high speed of processing. Many hashing methods have shown promising performance but as they fail to exploit all structural information in learning the hashing function, they leave a scope for improvement. The paper proposes a novel discrete hashing learning framework which jointly performs classifier learning and subspace learning for cross-modal retrieval. Concretely, the framework proposed in the paper includes two stages, namely a kernelization process and a quantization process. The aim of kernelization is to learn a common subspace where heterogeneous data can be fused. The quantization process is designed to learn discriminative unified hashing codes. Extensive experiments on three publicly available datasets clearly indicate the superiority of our method compared with the state-of-the-art methods.

* 10pages,6figures,4tables

Via

Access Paper or Ask Questions

Learning Adaptive Discriminative Correlation Filters via Temporal Consistency Preserving Spatial Feature Selection for Robust Visual Tracking

Jul 30, 2018

Tianyang Xu, Zhen-Hua Feng, Xiao-Jun Wu, Josef Kittler

Figure 1 for Learning Adaptive Discriminative Correlation Filters via Temporal Consistency Preserving Spatial Feature Selection for Robust Visual Tracking

Figure 2 for Learning Adaptive Discriminative Correlation Filters via Temporal Consistency Preserving Spatial Feature Selection for Robust Visual Tracking

Figure 3 for Learning Adaptive Discriminative Correlation Filters via Temporal Consistency Preserving Spatial Feature Selection for Robust Visual Tracking

Figure 4 for Learning Adaptive Discriminative Correlation Filters via Temporal Consistency Preserving Spatial Feature Selection for Robust Visual Tracking

Abstract:With efficient appearance learning models, Discriminative Correlation Filter (DCF) has been proven to be very successful in recent video object tracking benchmarks and competitions. However, the existing DCF paradigm suffers from two major problems, \ie spatial boundary effect and temporal filter degeneration. To mitigate these challenges, we propose a new DCF-based tracking method. The key innovations of the proposed method include adaptive spatial feature selection and temporal consistent constraints, with which the new tracker enables joint spatio-temporal filter learning in a lower dimensional discriminative manifold. More specifically, we apply structured sparsity constraints to multi-channel filers. Consequently, the process of learning spatial filters can be approximated by the lasso regularisation. To encourage temporal consistency, the filter model is restricted to lie around its historical value and updated locally to preserve the global structure in the manifold. Last, a unified optimisation framework is proposed to jointly select temporal consistency preserving spatial features and learn discriminative filters with the augmented Lagrangian method. Qualitative and quantitative evaluations have been conducted on a number of well-known benchmarking datasets such as OTB2013, OTB50, OTB100, Temple-Colour and UAV123. The experimental results demonstrate the superiority of the proposed method over the state-of-the-art approaches.

Via

Access Paper or Ask Questions

Client-Specific Anomaly Detection for Face Presentation Attack Detection

Jul 02, 2018

Shervin Rahimzadeh Arashloo, Josef Kittler

Figure 1 for Client-Specific Anomaly Detection for Face Presentation Attack Detection

Figure 2 for Client-Specific Anomaly Detection for Face Presentation Attack Detection

Figure 3 for Client-Specific Anomaly Detection for Face Presentation Attack Detection

Figure 4 for Client-Specific Anomaly Detection for Face Presentation Attack Detection

Abstract:The one-class anomaly detection approach has previously been found to be effective in face presentation attack detection, especially in an \textit{unseen} attack scenario, where the system is exposed to novel types of attacks. This work follows the same anomaly-based formulation of the problem and analyses the merits of deploying \textit{client-specific} information for face spoofing detection. We propose training one-class client-specific classifiers (both generative and discriminative) using representations obtained from pre-trained deep convolutional neural networks. Next, based on subject-specific score distributions, a distinct threshold is set for each client, which is then used for decision making regarding a test query. Through extensive experiments using different one-class systems, it is shown that the use of client-specific information in a one-class anomaly detection formulation (both in model construction as well as decision threshold tuning) improves the performance significantly. In addition, it is demonstrated that the same set of deep convolutional features used for the recognition purposes is effective for face presentation attack detection in the class-specific one-class anomaly detection paradigm.

Via

Access Paper or Ask Questions

Grassmannian Discriminant Maps (GDM) for Manifold Dimensionality Reduction with Application to Image Set Classification

Jun 28, 2018

Rui Wang, Xiao-Jun Wu, Kai-Xuan Chen, Josef Kittler

Figure 1 for Grassmannian Discriminant Maps (GDM) for Manifold Dimensionality Reduction with Application to Image Set Classification

Figure 2 for Grassmannian Discriminant Maps (GDM) for Manifold Dimensionality Reduction with Application to Image Set Classification

Figure 3 for Grassmannian Discriminant Maps (GDM) for Manifold Dimensionality Reduction with Application to Image Set Classification

Figure 4 for Grassmannian Discriminant Maps (GDM) for Manifold Dimensionality Reduction with Application to Image Set Classification

Abstract:In image set classification, a considerable progress has been made by representing original image sets on Grassmann manifolds. In order to extend the advantages of the Euclidean based dimensionality reduction methods to the Grassmann Manifold, several methods have been suggested recently which jointly perform dimensionality reduction and metric learning on Grassmann manifold to improve performance. Nevertheless, when applied to complex datasets, the learned features do not exhibit enough discriminatory power. To overcome this problem, we propose a new method named Grassmannian Discriminant Maps (GDM) for manifold dimensionality reduction problems. The core of the method is a new discriminant function for metric learning and dimensionality reduction. For comparison and better understanding, we also study a simple variations to GDM. The key difference between them is the discriminant function. We experiment on data sets corresponding to three tasks: face recognition, object categorization, and hand gesture recognition to evaluate the proposed method and its simple extensions. Compared with the state of the art, the results achieved show the effectiveness of the proposed algorithm.

* 8 pages, 9 figures

Via

Access Paper or Ask Questions