Use this link to cite:
http://hdl.handle.net/2183/41940 Variable Rate Deep Image Compression With Modulated Autoencoder
Loading...
Identifiers
Publication date
Authors
Yang, Fei
Herranz, Luis
Van de Weijer, Joost
López, Antonio M.
Mozerov, Mikhail G.
Advisors
Yang, Fei
Other responsabilities
Journal Title
Bibliographic citation
F. Yang, L. Herranz, J. V. D. Weijer, J. A. I. Guitian, A. M. Lopez, y M. G. Mozerov, «Variable Rate Deep Image Compression With Modulated Autoencoder», IEEE Signal Process. Lett., vol. 27, pp. 331-335, 2020, doi: 10.1109/LSP.2020.2970539
Type of academic work
Academic degree
Abstract
[Abstract]: Variable rate is a requirement for flexible and adaptable image and video compression. However, deep image compression methods (DIC) are optimized for a single fixed rate-distortion (R-D) tradeoff. While this can be addressed by training multiple models for different tradeoffs, the memory requirements increase proportionally to the number of models. Scaling the bottleneck representation of a shared autoencoder can provide variable rate compression with a single shared autoencoder. However, the R-D performance using this simple mechanism degrades in low bitrates, and also shrinks the effective range of bitrates. To address these limitations, we formulate the problem of variable R-D optimization for DIC, and propose modulated autoencoders (MAEs), where the representations of a shared autoencoder are adapted to the specific R-D tradeoff via a modulation network. Jointly training this modulated autoencoder and the modulation network provides an effective way to navigate the R-D operational curve. Our experiments show that the proposed method can achieve almost the same R-D performance of independent models with significantly fewer parameters
Description
This version of the paper has been accepted for publication. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Editor version
Rights
© 2020, IEEE







