Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Timo Gerasimow

VisionISP: Repurposing the Image Signal Processor for Computer Vision Applications

Nov 14, 2019

Chyuan-Tyng Wu, Leo F. Isikdogan, Sushma Rao, Bhavin Nayak, Timo Gerasimow, Aleksandar Sutic, Liron Ain-kedem, Gilad Michael

Figure 1 for VisionISP: Repurposing the Image Signal Processor for Computer Vision Applications

Figure 2 for VisionISP: Repurposing the Image Signal Processor for Computer Vision Applications

Figure 3 for VisionISP: Repurposing the Image Signal Processor for Computer Vision Applications

Figure 4 for VisionISP: Repurposing the Image Signal Processor for Computer Vision Applications

Abstract:Traditional image signal processors (ISPs) are primarily designed and optimized to improve the image quality perceived by humans. However, optimal perceptual image quality does not always translate into optimal performance for computer vision applications. We propose a set of methods, which we collectively call VisionISP, to repurpose the ISP for machine consumption. VisionISP significantly reduces data transmission needs by reducing the bit-depth and resolution while preserving the relevant information. The blocks in VisionISP are simple, content-aware, and trainable. Experimental results show that VisionISP boosts the performance of a subsequent computer vision system trained to detect objects in an autonomous driving setting. The results demonstrate the potential and the practicality of VisionISP for computer vision applications.

* IEEE International Conference on Image Processing (ICIP), 2019, pp. 4624-4628

Via

Access Paper or Ask Questions

Eye Contact Correction using Deep Neural Networks

Jun 12, 2019

Leo F. Isikdogan, Timo Gerasimow, Gilad Michael

Figure 1 for Eye Contact Correction using Deep Neural Networks

Figure 2 for Eye Contact Correction using Deep Neural Networks

Figure 3 for Eye Contact Correction using Deep Neural Networks

Figure 4 for Eye Contact Correction using Deep Neural Networks

Abstract:In a typical video conferencing setup, it is hard to maintain eye contact during a call since it requires looking into the camera rather than the display. We propose an eye contact correction model that restores the eye contact regardless of the relative position of the camera and display. Unlike previous solutions, our model redirects the gaze from an arbitrary direction to the center without requiring a redirection angle or camera/display/user geometry as inputs. We use a deep convolutional neural network that inputs a monocular image and produces a vector field and a brightness map to correct the gaze. We train this model in a bi-directional way on a large set of synthetically generated photorealistic images with perfect labels. The learned model is a robust eye contact corrector which also predicts the input gaze implicitly at no additional cost. Our system is primarily designed to improve the quality of video conferencing experience. Therefore, we use a set of control mechanisms to prevent creepy results and to ensure a smooth and natural video conferencing experience. The entire eye contact correction system runs end-to-end in real-time on a commodity CPU and does not require any dedicated hardware, making our solution feasible for a variety of devices.

Via

Access Paper or Ask Questions

Automatic ISP image quality tuning using non-linear optimization

Feb 24, 2019

Jun Nishimura, Timo Gerasimow, Sushma Rao, Aleksandar Sutic, Chyuan-Tyng Wu, Gilad Michael

Figure 1 for Automatic ISP image quality tuning using non-linear optimization

Figure 2 for Automatic ISP image quality tuning using non-linear optimization

Figure 3 for Automatic ISP image quality tuning using non-linear optimization

Figure 4 for Automatic ISP image quality tuning using non-linear optimization

Abstract:Image Signal Processor (ISP) comprises of various blocks to reconstruct image sensor raw data to final image consumed by human visual system or computer vision applications. Each block typically has many tuning parameters due to the complexity of the operation. These need to be hand tuned by Image Quality (IQ) experts, which takes considerable amount of time. In this paper, we present an automatic IQ tuning using nonlinear optimization and automatic reference generation algorithms. The proposed method can produce high quality IQ in minutes as compared with weeks of hand-tuned results by IQ experts. In addition, the proposed method can work with any algorithms without being aware of their specific implementation. It was found successful on multiple different processing blocks such as noise reduction, demosaic, and sharpening.

* 2018 25th IEEE International Conference on Image Processing (ICIP), 2471-2475
* 5 pages, 2018 25th IEEE International Conference on Image Processing (ICIP), 2471-2475

Via

Access Paper or Ask Questions