Variable Rate Deep Image Compression With Modulated Autoencoder

Yang, Fei; Herranz, Luis; Van de Weijer, Joost; Iglesias-Guitian, Jose A.; López, Antonio M.; Mozerov, Mikhail G.

Variable Rate Deep Image Compression With Modulated Autoencoder

UDC.coleccion	Investigación	es_ES
UDC.departamento	Ciencias da Computación e Tecnoloxías da Información	es_ES
UDC.endPage	335	es_ES
UDC.grupoInv	Computer Graphics & Visual Computing (XLab)	es_ES
UDC.journalTitle	IEEE Signal Processing Letters	es_ES
UDC.startPage	331	es_ES
UDC.volume	27	es_ES
dc.contributor.advisor	Yang, Fei
dc.contributor.author	Yang, Fei
dc.contributor.author	Herranz, Luis
dc.contributor.author	Van de Weijer, Joost
dc.contributor.author	Iglesias-Guitian, Jose A.
dc.contributor.author	López, Antonio M.
dc.contributor.author	Mozerov, Mikhail G.
dc.date.accessioned	2025-05-08T12:50:16Z
dc.date.available	2025-05-08T12:50:16Z
dc.date.issued	2020-01-31
dc.description	This version of the paper has been accepted for publication. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.	es_ES
dc.description.abstract	[Abstract]: Variable rate is a requirement for flexible and adaptable image and video compression. However, deep image compression methods (DIC) are optimized for a single fixed rate-distortion (R-D) tradeoff. While this can be addressed by training multiple models for different tradeoffs, the memory requirements increase proportionally to the number of models. Scaling the bottleneck representation of a shared autoencoder can provide variable rate compression with a single shared autoencoder. However, the R-D performance using this simple mechanism degrades in low bitrates, and also shrinks the effective range of bitrates. To address these limitations, we formulate the problem of variable R-D optimization for DIC, and propose modulated autoencoders (MAEs), where the representations of a shared autoencoder are adapted to the specific R-D tradeoff via a modulation network. Jointly training this modulated autoencoder and the modulation network provides an effective way to navigate the R-D operational curve. Our experiments show that the proposed method can achieve almost the same R-D performance of independent models with significantly fewer parameters	es_ES
dc.description.sponsorship	This work was supported in part by Audi Electronics Venture GmbH. The Computer Vision Center (CVC) was supported by the Generalitat de Catalunya CERCA Program and its ACCIO agency. The work of F. Yang was supported by the Chinese Scholarship Council under Grant 201706290127. The work of L. Herranz was supported by the Spanish project RTI2018-102285-A-I00. The work of J. van de Weijer was supported by the Spanish project TIN2016-79717-R. The work of A. M. López and J. A. I. Guitián was supported by the project TIN2017-88709-R (MINECO/AEI/FEDER, UE). The work of A. M. López was also supported by the ICREA Academia programme. The work of J. A. I. Guitián and L. Herranz was supported by the EU’s Horizon 2020 Research and Innovation Programme under the Marie Skłodowska-Curie Grant 665919	es_ES
dc.identifier.citation	F. Yang, L. Herranz, J. V. D. Weijer, J. A. I. Guitian, A. M. Lopez, y M. G. Mozerov, «Variable Rate Deep Image Compression With Modulated Autoencoder», IEEE Signal Process. Lett., vol. 27, pp. 331-335, 2020, doi: 10.1109/LSP.2020.2970539	es_ES
dc.identifier.issn	1558-2361
dc.identifier.issn	1070-9908
dc.identifier.uri	http://hdl.handle.net/2183/41940
dc.language.iso	eng	es_ES
dc.publisher	IEEE Xplore	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/RTI2018-102285-A-I00/ES/APRENDIZAJE PROFUNDO MULTIMODAL ESTRUCTURADO	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2016-79717-R/ES/APRENDIZAJE PROFUNDO MULTI-TAREA PARA RECONOCIMIENTO DE OBJETOS	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2017-88709-R/ES/DESARROLLO DE AGENTES DE NAVEGACION AUTONOMOS	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/EC/H2020/665919	es_ES
dc.relation.uri	https://doi.org/10.1109/LSP.2020.2970539	es_ES
dc.rights	© 2020, IEEE	es_ES
dc.rights.accessRights	open access	es_ES
dc.subject	Deep image compression	es_ES
dc.subject	Variable bitrate	es_ES
dc.subject	Autoencoder	es_ES
dc.subject	Modulated autoencoder	es_ES
dc.title	Variable Rate Deep Image Compression With Modulated Autoencoder	es_ES
dc.type	journal article	es_ES
dc.type.hasVersion	AM	es_ES
dspace.entity.type	Publication
relation.isAuthorOfPublication	2baabfcd-ac55-477b-a5db-4f31be84703f
relation.isAuthorOfPublication.latestForDiscovery	2baabfcd-ac55-477b-a5db-4f31be84703f

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Yang_Fei_2020_Variable_rate_deep_image_autoencoder.pdf
Size:: 3.09 MB
Format:: Adobe Portable Document Format
Description:: Accepted Manuscript

Download

Collections

Investigación (FIC)