Variable Rate Deep Image Compression With Modulated Autoencoder

Loading...
Thumbnail Image

Identifiers

Publication date

Authors

Yang, Fei
Herranz, Luis
Van de Weijer, Joost
López, Antonio M.
Mozerov, Mikhail G.

Advisors

Yang, Fei

Other responsabilities

Journal Title

Bibliographic citation

F. Yang, L. Herranz, J. V. D. Weijer, J. A. I. Guitian, A. M. Lopez, y M. G. Mozerov, «Variable Rate Deep Image Compression With Modulated Autoencoder», IEEE Signal Process. Lett., vol. 27, pp. 331-335, 2020, doi: 10.1109/LSP.2020.2970539

Type of academic work

Academic degree

Abstract

[Abstract]: Variable rate is a requirement for flexible and adaptable image and video compression. However, deep image compression methods (DIC) are optimized for a single fixed rate-distortion (R-D) tradeoff. While this can be addressed by training multiple models for different tradeoffs, the memory requirements increase proportionally to the number of models. Scaling the bottleneck representation of a shared autoencoder can provide variable rate compression with a single shared autoencoder. However, the R-D performance using this simple mechanism degrades in low bitrates, and also shrinks the effective range of bitrates. To address these limitations, we formulate the problem of variable R-D optimization for DIC, and propose modulated autoencoders (MAEs), where the representations of a shared autoencoder are adapted to the specific R-D tradeoff via a modulation network. Jointly training this modulated autoencoder and the modulation network provides an effective way to navigate the R-D operational curve. Our experiments show that the proposed method can achieve almost the same R-D performance of independent models with significantly fewer parameters

Description

This version of the paper has been accepted for publication. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Rights

© 2020, IEEE