Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Srivatsa Prativadibhayankaram

TreeNet: A Light Weight Model for Low Bitrate Image Compression

Dec 18, 2025

Mahadev Prasad Panda, Purnachandra Rao Makkena, Srivatsa Prativadibhayankaram, Siegfried Fößel, André Kaup

Figure 1 for TreeNet: A Light Weight Model for Low Bitrate Image Compression

Figure 2 for TreeNet: A Light Weight Model for Low Bitrate Image Compression

Figure 3 for TreeNet: A Light Weight Model for Low Bitrate Image Compression

Figure 4 for TreeNet: A Light Weight Model for Low Bitrate Image Compression

Abstract:Reducing computational complexity remains a critical challenge for the widespread adoption of learning-based image compression techniques. In this work, we propose TreeNet, a novel low-complexity image compression model that leverages a binary tree-structured encoder-decoder architecture to achieve efficient representation and reconstruction. We employ attentional feature fusion mechanism to effectively integrate features from multiple branches. We evaluate TreeNet on three widely used benchmark datasets and compare its performance against competing methods including JPEG AI, a recent standard in learning-based image compression. At low bitrates, TreeNet achieves an average improvement of 4.83% in BD-rate over JPEG AI, while reducing model complexity by 87.82%. Furthermore, we conduct extensive ablation studies to investigate the influence of various latent representations within TreeNet, offering deeper insights into the factors contributing to reconstruction.

Via

Access Paper or Ask Questions

A Study on the Effect of Color Spaces in Learned Image Compression

Jun 19, 2024

Srivatsa Prativadibhayankaram, Mahadev Prasad Panda, Jürgen Seiler, Thomas Richter, Heiko Sparenberg, Siegfried Fößel, André Kaup

Figure 1 for A Study on the Effect of Color Spaces in Learned Image Compression

Figure 2 for A Study on the Effect of Color Spaces in Learned Image Compression

Figure 3 for A Study on the Effect of Color Spaces in Learned Image Compression

Figure 4 for A Study on the Effect of Color Spaces in Learned Image Compression

Abstract:In this work, we present a comparison between color spaces namely YUV, LAB, RGB and their effect on learned image compression. For this we use the structure and color based learned image codec (SLIC) from our prior work, which consists of two branches - one for the luminance component (Y or L) and another for chrominance components (UV or AB). However, for the RGB variant we input all 3 channels in a single branch, similar to most learned image codecs operating in RGB. The models are trained for multiple bitrate configurations in each color space. We report the findings from our experiments by evaluating them on various datasets and compare the results to state-of-the-art image codecs. The YUV model performs better than the LAB variant in terms of MS-SSIM with a Bj{\o}ntegaard delta bitrate (BD-BR) gain of 7.5\% using VTM intra-coding mode as the baseline. Whereas the LAB variant has a better performance than YUV model in terms of CIEDE2000 having a BD-BR gain of 8\%. Overall, the RGB variant of SLIC achieves the best performance with a BD-BR gain of 13.14\% in terms of MS-SSIM and a gain of 17.96\% in CIEDE2000 at the cost of a higher model complexity.

* Accepter pre-print version for ICIP 2024

Via

Access Paper or Ask Questions

Efficient Learned Wavelet Image and Video Coding

May 21, 2024

Anna Meyer, Srivatsa Prativadibhayankaram, André Kaup

Figure 1 for Efficient Learned Wavelet Image and Video Coding

Figure 2 for Efficient Learned Wavelet Image and Video Coding

Figure 3 for Efficient Learned Wavelet Image and Video Coding

Figure 4 for Efficient Learned Wavelet Image and Video Coding

Abstract:Learned wavelet image and video coding approaches provide an explainable framework with a latent space corresponding to a wavelet decomposition. The wavelet image coder iWave++ achieves state-of-the-art performance and has been employed for various compression tasks, including lossy as well as lossless image, video, and medical data compression. However, the approaches suffer from slow decoding speed due to the autoregressive context model used in iWave++. In this paper, we show how a parallelized context model can be integrated into the iWave++ framework. Our experimental results demonstrate a speedup factor of over 350 and 240 for image and video compression, respectively. At the same time, the rate-distortion performance in terms of Bj{\o}ntegaard delta bitrate is slightly worse by 1.5\% for image coding and 1\% for video coding. In addition, we analyze the learned wavelet decomposition by visualizing its subband impulse responses.

* 7 pages, 11 figures, submitted to ICIP2024

Via

Access Paper or Ask Questions

SLIC: A Learned Image Codec Using Structure and Color

Jan 30, 2024

Srivatsa Prativadibhayankaram, Mahadev Prasad Panda, Thomas Richter, Heiko Sparenberg, Siegfried Fößel, André Kaup

Figure 1 for SLIC: A Learned Image Codec Using Structure and Color

Figure 2 for SLIC: A Learned Image Codec Using Structure and Color

Figure 3 for SLIC: A Learned Image Codec Using Structure and Color

Figure 4 for SLIC: A Learned Image Codec Using Structure and Color

Abstract:We propose the structure and color based learned image codec (SLIC) in which the task of compression is split into that of luminance and chrominance. The deep learning model is built with a novel multi-scale architecture for Y and UV channels in the encoder, where the features from various stages are combined to obtain the latent representation. An autoregressive context model is employed for backward adaptation and a hyperprior block for forward adaptation. Various experiments are carried out to study and analyze the performance of the proposed model, and to compare it with other image codecs. We also illustrate the advantages of our method through the visualization of channel impulse responses, latent channels and various ablation studies. The model achieves Bj{\o}ntegaard delta bitrate gains of 7.5% and 4.66% in terms of MS-SSIM and CIEDE2000 metrics with respect to other state-of-the-art reference codecs.

* Accepter paper for Data Compression Conference 2024

Via

Access Paper or Ask Questions

Color Learning for Image Compression

Jun 30, 2023

Srivatsa Prativadibhayankaram, Thomas Richter, Heiko Sparenberg, Siegfried Fößel

Figure 1 for Color Learning for Image Compression

Figure 2 for Color Learning for Image Compression

Figure 3 for Color Learning for Image Compression

Figure 4 for Color Learning for Image Compression

Abstract:Deep learning based image compression has gained a lot of momentum in recent times. To enable a method that is suitable for image compression and subsequently extended to video compression, we propose a novel deep learning model architecture, where the task of image compression is divided into two sub-tasks, learning structural information from luminance channel and color from chrominance channels. The model has two separate branches to process the luminance and chrominance components. The color difference metric CIEDE2000 is employed in the loss function to optimize the model for color fidelity. We demonstrate the benefits of our approach and compare the performance to other codecs. Additionally, the visualization and analysis of latent channel impulse response is performed.

Via

Access Paper or Ask Questions

Compressive Online Robust Principal Component Analysis with Optical Flow for Video Foreground-Background Separation

Oct 25, 2017

Srivatsa Prativadibhayankaram, Huynh Van Luong, Thanh-Ha Le, Andre Kaup

Figure 1 for Compressive Online Robust Principal Component Analysis with Optical Flow for Video Foreground-Background Separation

Figure 2 for Compressive Online Robust Principal Component Analysis with Optical Flow for Video Foreground-Background Separation

Figure 3 for Compressive Online Robust Principal Component Analysis with Optical Flow for Video Foreground-Background Separation

Figure 4 for Compressive Online Robust Principal Component Analysis with Optical Flow for Video Foreground-Background Separation

Abstract:In the context of online Robust Principle Component Analysis (RPCA) for the video foreground-background separation, we propose a compressive online RPCA with optical flow that separates recursively a sequence of frames into sparse (foreground) and low-rank (background) components. Our method considers a small set of measurements taken per data vector (frame), which is different from conventional batch RPCA, processing all the data directly. The proposed method also incorporates multiple prior information, namely previous foreground and background frames, to improve the separation and then updates the prior information for the next frame. Moreover, the foreground prior frames are improved by estimating motions between the previous foreground frames using optical flow and compensating the motions to achieve higher quality foreground prior. The proposed method is applied to online video foreground and background separation from compressive measurements. The visual and quantitative results show that our method outperforms the existing methods.

* preprint accepted

Via

Access Paper or Ask Questions