Variable Rate Deep Image Compression With Modulated Autoencoder

UDC.coleccionInvestigaciónes_ES
UDC.departamentoCiencias da Computación e Tecnoloxías da Informaciónes_ES
UDC.endPage335es_ES
UDC.grupoInvComputer Graphics & Visual Computing (XLab)es_ES
UDC.journalTitleIEEE Signal Processing Letterses_ES
UDC.startPage331es_ES
UDC.volume27es_ES
dc.contributor.advisorYang, Fei
dc.contributor.authorYang, Fei
dc.contributor.authorHerranz, Luis
dc.contributor.authorVan de Weijer, Joost
dc.contributor.authorIglesias-Guitian, Jose A.
dc.contributor.authorLópez, Antonio M.
dc.contributor.authorMozerov, Mikhail G.
dc.date.accessioned2025-05-08T12:50:16Z
dc.date.available2025-05-08T12:50:16Z
dc.date.issued2020-01-31
dc.descriptionThis version of the paper has been accepted for publication. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.es_ES
dc.description.abstract[Abstract]: Variable rate is a requirement for flexible and adaptable image and video compression. However, deep image compression methods (DIC) are optimized for a single fixed rate-distortion (R-D) tradeoff. While this can be addressed by training multiple models for different tradeoffs, the memory requirements increase proportionally to the number of models. Scaling the bottleneck representation of a shared autoencoder can provide variable rate compression with a single shared autoencoder. However, the R-D performance using this simple mechanism degrades in low bitrates, and also shrinks the effective range of bitrates. To address these limitations, we formulate the problem of variable R-D optimization for DIC, and propose modulated autoencoders (MAEs), where the representations of a shared autoencoder are adapted to the specific R-D tradeoff via a modulation network. Jointly training this modulated autoencoder and the modulation network provides an effective way to navigate the R-D operational curve. Our experiments show that the proposed method can achieve almost the same R-D performance of independent models with significantly fewer parameterses_ES
dc.description.sponsorshipThis work was supported in part by Audi Electronics Venture GmbH. The Computer Vision Center (CVC) was supported by the Generalitat de Catalunya CERCA Program and its ACCIO agency. The work of F. Yang was supported by the Chinese Scholarship Council under Grant 201706290127. The work of L. Herranz was supported by the Spanish project RTI2018-102285-A-I00. The work of J. van de Weijer was supported by the Spanish project TIN2016-79717-R. The work of A. M. López and J. A. I. Guitián was supported by the project TIN2017-88709-R (MINECO/AEI/FEDER, UE). The work of A. M. López was also supported by the ICREA Academia programme. The work of J. A. I. Guitián and L. Herranz was supported by the EU’s Horizon 2020 Research and Innovation Programme under the Marie Skłodowska-Curie Grant 665919es_ES
dc.identifier.citationF. Yang, L. Herranz, J. V. D. Weijer, J. A. I. Guitian, A. M. Lopez, y M. G. Mozerov, «Variable Rate Deep Image Compression With Modulated Autoencoder», IEEE Signal Process. Lett., vol. 27, pp. 331-335, 2020, doi: 10.1109/LSP.2020.2970539es_ES
dc.identifier.issn1558-2361
dc.identifier.issn1070-9908
dc.identifier.urihttp://hdl.handle.net/2183/41940
dc.language.isoenges_ES
dc.publisherIEEE Xplorees_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/RTI2018-102285-A-I00/ES/APRENDIZAJE PROFUNDO MULTIMODAL ESTRUCTURADOes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2016-79717-R/ES/APRENDIZAJE PROFUNDO MULTI-TAREA PARA RECONOCIMIENTO DE OBJETOSes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2017-88709-R/ES/DESARROLLO DE AGENTES DE NAVEGACION AUTONOMOSes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/EC/H2020/665919es_ES
dc.relation.urihttps://doi.org/10.1109/LSP.2020.2970539es_ES
dc.rights© 2020, IEEEes_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectDeep image compressiones_ES
dc.subjectVariable bitratees_ES
dc.subjectAutoencoderes_ES
dc.subjectModulated autoencoderes_ES
dc.titleVariable Rate Deep Image Compression With Modulated Autoencoderes_ES
dc.typejournal articlees_ES
dc.type.hasVersionAMes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication2baabfcd-ac55-477b-a5db-4f31be84703f
relation.isAuthorOfPublication.latestForDiscovery2baabfcd-ac55-477b-a5db-4f31be84703f

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Yang_Fei_2020_Variable_rate_deep_image_autoencoder.pdf
Size:
3.09 MB
Format:
Adobe Portable Document Format
Description:
Accepted Manuscript