Should I render or should AI Generate? Crafting Synthetic Semantic Segmentation Datasets with Controlled Generation

Mures, Omar A.; Silva, Manuel; Lijó-Sanchez, Manuel; Padrón, Emilio J.; Iglesias-Guitian, Jose A.

Use this link to cite:

http://hdl.handle.net/2183/41893

Should I render or should AI Generate? Crafting Synthetic Semantic Segmentation Datasets with Controlled Generation

Files

Mures_Omar_2025_Should_I_render_or_should_AI_generate_crafting.pdf (8.2 MB)

Identifiers

URI: http://hdl.handle.net/2183/41893

Publication date

2025-03-21

Authors

Mures, Omar A.

Silva, Manuel

Lijó-Sanchez, Manuel

Padrón, Emilio J.

Iglesias-Guitian, Jose A.

Bibliographic citation

O. A. Mures, M. Silva, M. Lijó-Sanchez, E. J. Padrón and J. A. Iglesias-Guitian, "Should I render or should AI Generate? Crafting Synthetic Semantic Segmentation Datasets with Controlled Generation," in IEEE Computer Graphics and Applications, doi: 10.1109/MCG.2025.3553494

Abstract

[Abstract]: This work explores the integration of generative AI models for automatically generating synthetic image-labeled data. Our approach leverages controllable Diffusion Models to generate synthetic variations of semantically labeled images. Synthetic datasets for semantic segmentation struggle to represent real-world subtleties, such as different weather conditions or fine details, typically relying on costly simulations and rendering. However, Diffusion Models can generate diverse images using input text prompts and guidance images, like semantic masks. Our work introduces and tests a novel methodology for generating labeled synthetic images, with an initial focus on semantic segmentation, a demanding computer vision task. We showcase our approach in two distinct image segmentation domains, outperforming traditional computer graphics simulations in efficiently creating diverse datasets and training downstream models. We leverage generative models for crafting synthetically labeled images, posing the question: “Should I render or should AI generate?”. Our results endorse a paradigm shift towards controlled generation models

Description

Versión aceptada, publicada en modo "Early Publication" por el editor.

Keywords

Automobiles Computational modeling Training Roads Semantics Annotations Synthetic data Urban areas Snow Generative AI

Editor version

https://doi.org/10.1109/MCG.2025.3553494

Rights

Atribución 3.0 España

Collections

Investigación (FIC)

Full item page

Except where otherwise noted, this item's license is described as Atribución 3.0 España

Should I render or should AI Generate? Crafting Synthetic Semantic Segmentation Datasets with Controlled Generation

Files

Identifiers

Publication date

Authors

Advisors

Other responsabilities

Journal Title

Bibliographic citation

Type of academic work

Academic degree

Abstract

Description

Keywords

Editor version

Rights

Collections