DSLXpert: LLM-driven Generic DSL Code Generation

UDC.coleccionInvestigaciónes_ES
UDC.conferenceTitleMODELS Companion ’24es_ES
UDC.departamentoCiencias da Computación e Tecnoloxías da Informaciónes_ES
UDC.grupoInvLaboratorio de Bases de Datos (LBD)es_ES
dc.contributor.authorGarcía-González, Daniel
dc.contributor.authorLamas Sardiña, Víctor Juan
dc.contributor.authorRodríguez Luaces, Miguel
dc.date.accessioned2024-11-07T11:26:24Z
dc.date.available2024-11-07T11:26:24Z
dc.date.issued2024
dc.descriptionPresented at: In ACM/IEEE 27th International Conference on Model Driven Engineering Languages and Systems (MODELS Companion ’24), September 22–27, 2024, Linz, Austriaes_ES
dc.description.abstract[Abstract]: Nowadays, large language models (LLMs) are an extremely useful and fast tool to complement and help in many jobs and current problems. However, there are cases where a pretty specific vocabulary is used in which these models were not previously trained, leading to less satisfactory results. More specifically, these models are less effective when dealing with less-known or unpublished domain-specific languages (DSLs). Within this field, the automatic generation of code based on such languages, starting from natural language, would speed up the development times of any related project, as well as the understanding of such DSLs. Therefore, this paper presents a tool in which developers can perform what is known as semantic parsing. In other words, the developer can ask a pre-trained LLM to translate a natural language instruction into the vocabulary of the established DSL. Thus, by setting the DSL grammar as context (grammar prompting) and providing usage examples (few-shot learning), the LLM can quickly generate reliable domain-specific code, significantly improving the quality of life of the developers. A video demonstration of the tool is shown in the following link: https://zenodo.org/records/12610506.es_ES
dc.description.sponsorshipCITIC is funded by the Xunta de Galicia through the collaboration agreement between the Department of Culture, Education, Vocational Training and Universities and the Galician universities for the reinforcement of the research centers of the Galician University System (CIGUS); partially funded by MCIN/AEI/10.13039/501100011033 and “NextGenerationEU”/PRTR: [PLAGEMIS: TED2021-129245BC21]; partially funded by MCIN/AEI/10.13039/501100011033 and EU/ERDF A way of making Europe: [EarthDL: PID2022-141027NBC21]; partially funded by GAIN/Xunta de Galicia: [GRC: ED431C2021/53 and ED431G 2023/01]es_ES
dc.description.sponsorshipXunta de Galicia; ED431C2021/53es_ES
dc.description.sponsorshipXunta de Galicia; ED431G 2023/01es_ES
dc.identifier.citationDaniel Garcia-Gonzalez, Victor Lamas, and Miguel R. Luaces. 2024. DSLXpert: LLM-driven Generic DSL Code Generation. In ACM/IEEE 27th International Conference on Model Driven Engineering Languages and Systems (MODELS Companion ’24), September 22–27, 2024, Linz, Austria. ACM, New York, NY, USA, 5 pages. https://doi.org/10.1145/3652620.3687782es_ES
dc.identifier.doi10.1145/3652620.3687782
dc.identifier.urihttp://hdl.handle.net/2183/39984
dc.language.isoenges_ES
dc.publisherAssociation for Computing Machinery, Inces_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/TED2021-129245B-C21/ES/PLAGEMISes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/MINECO/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-141027NB-C21/ES/MODELADO, DESCUBRIMIENTO, EXPLORACION Y ANALISIS DE DATA LAKES MEDIOAMBIENTALES [UDC]es_ES
dc.relation.urihttps://doi.org/10.1145/3652620.3687782es_ES
dc.rights© 2024 Owner/Author|ACM. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in ACM/IEEE 27th International Conference on Model Driven Engineering Languages and Systems, https://doi.org/10.1145/3652620.3687782es_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectDomain-specific languages (DSLs)es_ES
dc.subjectLarge language models (LLMs)es_ES
dc.subjectSemantic parsinges_ES
dc.subjectGrammar promptinges_ES
dc.subjectFew-shot learninges_ES
dc.titleDSLXpert: LLM-driven Generic DSL Code Generationes_ES
dc.typeconference outputes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication21118aa4-994c-45e0-aff0-4750156048b5
relation.isAuthorOfPublicationf5f01d97-f28d-46b7-b99d-3bd795ee2677
relation.isAuthorOfPublicationfbde3bd9-d786-4ef0-89ec-6af2091fa415
relation.isAuthorOfPublication.latestForDiscovery21118aa4-994c-45e0-aff0-4750156048b5

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
RodriguezLuaces_Miguel_2024_DSLXpert_LLM_driven_Generic_DSL_Code_Generation.pdf
Size:
2 MB
Format:
Adobe Portable Document Format
Description:
Versión aceptada