Variable Rate Deep Image Compression With Modulated Autoencoder
| UDC.coleccion | Investigación | es_ES |
| UDC.departamento | Ciencias da Computación e Tecnoloxías da Información | es_ES |
| UDC.endPage | 335 | es_ES |
| UDC.grupoInv | Computer Graphics & Visual Computing (XLab) | es_ES |
| UDC.journalTitle | IEEE Signal Processing Letters | es_ES |
| UDC.startPage | 331 | es_ES |
| UDC.volume | 27 | es_ES |
| dc.contributor.advisor | Yang, Fei | |
| dc.contributor.author | Yang, Fei | |
| dc.contributor.author | Herranz, Luis | |
| dc.contributor.author | Van de Weijer, Joost | |
| dc.contributor.author | Iglesias-Guitian, Jose A. | |
| dc.contributor.author | López, Antonio M. | |
| dc.contributor.author | Mozerov, Mikhail G. | |
| dc.date.accessioned | 2025-05-08T12:50:16Z | |
| dc.date.available | 2025-05-08T12:50:16Z | |
| dc.date.issued | 2020-01-31 | |
| dc.description | This version of the paper has been accepted for publication. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | es_ES |
| dc.description.abstract | [Abstract]: Variable rate is a requirement for flexible and adaptable image and video compression. However, deep image compression methods (DIC) are optimized for a single fixed rate-distortion (R-D) tradeoff. While this can be addressed by training multiple models for different tradeoffs, the memory requirements increase proportionally to the number of models. Scaling the bottleneck representation of a shared autoencoder can provide variable rate compression with a single shared autoencoder. However, the R-D performance using this simple mechanism degrades in low bitrates, and also shrinks the effective range of bitrates. To address these limitations, we formulate the problem of variable R-D optimization for DIC, and propose modulated autoencoders (MAEs), where the representations of a shared autoencoder are adapted to the specific R-D tradeoff via a modulation network. Jointly training this modulated autoencoder and the modulation network provides an effective way to navigate the R-D operational curve. Our experiments show that the proposed method can achieve almost the same R-D performance of independent models with significantly fewer parameters | es_ES |
| dc.description.sponsorship | This work was supported in part by Audi Electronics Venture GmbH. The Computer Vision Center (CVC) was supported by the Generalitat de Catalunya CERCA Program and its ACCIO agency. The work of F. Yang was supported by the Chinese Scholarship Council under Grant 201706290127. The work of L. Herranz was supported by the Spanish project RTI2018-102285-A-I00. The work of J. van de Weijer was supported by the Spanish project TIN2016-79717-R. The work of A. M. López and J. A. I. Guitián was supported by the project TIN2017-88709-R (MINECO/AEI/FEDER, UE). The work of A. M. López was also supported by the ICREA Academia programme. The work of J. A. I. Guitián and L. Herranz was supported by the EU’s Horizon 2020 Research and Innovation Programme under the Marie Skłodowska-Curie Grant 665919 | es_ES |
| dc.identifier.citation | F. Yang, L. Herranz, J. V. D. Weijer, J. A. I. Guitian, A. M. Lopez, y M. G. Mozerov, «Variable Rate Deep Image Compression With Modulated Autoencoder», IEEE Signal Process. Lett., vol. 27, pp. 331-335, 2020, doi: 10.1109/LSP.2020.2970539 | es_ES |
| dc.identifier.issn | 1558-2361 | |
| dc.identifier.issn | 1070-9908 | |
| dc.identifier.uri | http://hdl.handle.net/2183/41940 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | IEEE Xplore | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/RTI2018-102285-A-I00/ES/APRENDIZAJE PROFUNDO MULTIMODAL ESTRUCTURADO | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2016-79717-R/ES/APRENDIZAJE PROFUNDO MULTI-TAREA PARA RECONOCIMIENTO DE OBJETOS | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2017-88709-R/ES/DESARROLLO DE AGENTES DE NAVEGACION AUTONOMOS | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/EC/H2020/665919 | es_ES |
| dc.relation.uri | https://doi.org/10.1109/LSP.2020.2970539 | es_ES |
| dc.rights | © 2020, IEEE | es_ES |
| dc.rights.accessRights | open access | es_ES |
| dc.subject | Deep image compression | es_ES |
| dc.subject | Variable bitrate | es_ES |
| dc.subject | Autoencoder | es_ES |
| dc.subject | Modulated autoencoder | es_ES |
| dc.title | Variable Rate Deep Image Compression With Modulated Autoencoder | es_ES |
| dc.type | journal article | es_ES |
| dc.type.hasVersion | AM | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 2baabfcd-ac55-477b-a5db-4f31be84703f | |
| relation.isAuthorOfPublication.latestForDiscovery | 2baabfcd-ac55-477b-a5db-4f31be84703f |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Yang_Fei_2020_Variable_rate_deep_image_autoencoder.pdf
- Size:
- 3.09 MB
- Format:
- Adobe Portable Document Format
- Description:
- Accepted Manuscript

