Should I render or should AI Generate? Crafting Synthetic Semantic Segmentation Datasets with Controlled Generation

UDC.coleccionInvestigaciónes_ES
UDC.departamentoCiencias da Computación e Tecnoloxías da Informaciónes_ES
UDC.departamentoEnxeñaría Civiles_ES
UDC.departamentoEnxeñaría de Computadoreses_ES
UDC.grupoInvComputer Graphics & Visual Computing (XLab)es_ES
UDC.institutoCentroCITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicaciónes_ES
UDC.issueEarly Publicationes_ES
UDC.journalTitleIEEE Computer Graphics and Applicationses_ES
dc.contributor.authorMures, Omar A.
dc.contributor.authorSilva, Manuel
dc.contributor.authorLijó-Sanchez, Manuel
dc.contributor.authorPadrón, Emilio J.
dc.contributor.authorIglesias-Guitian, Jose A.
dc.date.accessioned2025-04-30T11:14:13Z
dc.date.available2025-04-30T11:14:13Z
dc.date.issued2025-03-21
dc.descriptionVersión aceptada, publicada en modo "Early Publication" por el editor.es_ES
dc.description.abstract[Abstract]: This work explores the integration of generative AI models for automatically generating synthetic image-labeled data. Our approach leverages controllable Diffusion Models to generate synthetic variations of semantically labeled images. Synthetic datasets for semantic segmentation struggle to represent real-world subtleties, such as different weather conditions or fine details, typically relying on costly simulations and rendering. However, Diffusion Models can generate diverse images using input text prompts and guidance images, like semantic masks. Our work introduces and tests a novel methodology for generating labeled synthetic images, with an initial focus on semantic segmentation, a demanding computer vision task. We showcase our approach in two distinct image segmentation domains, outperforming traditional computer graphics simulations in efficiently creating diverse datasets and training downstream models. We leverage generative models for crafting synthetically labeled images, posing the question: “Should I render or should AI generate?”. Our results endorse a paradigm shift towards controlled generation modelses_ES
dc.description.sponsorshipXunta de Galicia: ED431F 2021/11es_ES
dc.description.sponsorshipXunta de Galicia: ED431G 2019/01es_ES
dc.description.sponsorshipThe authors would like to thank José R. López from Deep Design Systems S.L. for the car parts dataset (PART). This work has been supported by Xunta de Galicia: ED431F 2021/11 and ED431G 2019/01. It was also partially supported through the research projects AEI/PID2020-115734RB-C22 and PID2022-136435NB-I00, funded by MCIN/AEI/10.13039/501100011033 and by ‘ERDF A way of making Europe’, EU. Jose A. Iglesias-Guitian also acknowledges the UDC-Inditex InTalent programme and the Spanish Ministry of Science and Innovation (AEI/RYC2018-025385-I)es_ES
dc.identifier.citationO. A. Mures, M. Silva, M. Lijó-Sanchez, E. J. Padrón and J. A. Iglesias-Guitian, "Should I render or should AI Generate? Crafting Synthetic Semantic Segmentation Datasets with Controlled Generation," in IEEE Computer Graphics and Applications, doi: 10.1109/MCG.2025.3553494es_ES
dc.identifier.issn0272-1716
dc.identifier.issn1558-1756
dc.identifier.urihttp://hdl.handle.net/2183/41893
dc.language.isoenges_ES
dc.publisherIEEEes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-115734RB-C22/ES/GENERACIÓN PROCEDURAL DE ESCENARIOS AUMENTADOS CON ANOTACIÓN DE DATOS AUTOMÁTICAes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-136435NB-I00/ES/ARQUITECTURAS, FRAMEWORKS Y APLICACIONES DE LA COMPUTACION DE ALTAS PRESTACIONESes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/RYC2018-025385-I/ES/es_ES
dc.relation.urihttps://doi.org/10.1109/MCG.2025.3553494es_ES
dc.rightsAtribución 3.0 Españaes_ES
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es/*
dc.subjectAutomobileses_ES
dc.subjectComputational modelinges_ES
dc.subjectTraininges_ES
dc.subjectRoadses_ES
dc.subjectSemanticses_ES
dc.subjectAnnotationses_ES
dc.subjectSynthetic dataes_ES
dc.subjectUrban areases_ES
dc.subjectSnowes_ES
dc.subjectGenerative AIes_ES
dc.titleShould I render or should AI Generate? Crafting Synthetic Semantic Segmentation Datasets with Controlled Generationes_ES
dc.typejournal articlees_ES
dc.type.hasVersionAMes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication532a32fe-d0a1-4634-84b5-d8f87c2ccae3
relation.isAuthorOfPublicationbdccb1db-e727-4b63-b2ca-1941cc096c00
relation.isAuthorOfPublication2baabfcd-ac55-477b-a5db-4f31be84703f
relation.isAuthorOfPublication.latestForDiscovery532a32fe-d0a1-4634-84b5-d8f87c2ccae3

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Mures_Omar_2025_Should_I_render_or_should_AI_generate_crafting.pdf
Size:
8.2 MB
Format:
Adobe Portable Document Format
Description:
Accepted version