DSLXpert: LLM-driven Generic DSL Code Generation
| UDC.coleccion | Investigación | es_ES |
| UDC.conferenceTitle | MODELS Companion ’24 | es_ES |
| UDC.departamento | Ciencias da Computación e Tecnoloxías da Información | es_ES |
| UDC.grupoInv | Laboratorio de Bases de Datos (LBD) | es_ES |
| dc.contributor.author | García-González, Daniel | |
| dc.contributor.author | Lamas Sardiña, Víctor Juan | |
| dc.contributor.author | Rodríguez Luaces, Miguel | |
| dc.date.accessioned | 2024-11-07T11:26:24Z | |
| dc.date.available | 2024-11-07T11:26:24Z | |
| dc.date.issued | 2024 | |
| dc.description | Presented at: In ACM/IEEE 27th International Conference on Model Driven Engineering Languages and Systems (MODELS Companion ’24), September 22–27, 2024, Linz, Austria | es_ES |
| dc.description.abstract | [Abstract]: Nowadays, large language models (LLMs) are an extremely useful and fast tool to complement and help in many jobs and current problems. However, there are cases where a pretty specific vocabulary is used in which these models were not previously trained, leading to less satisfactory results. More specifically, these models are less effective when dealing with less-known or unpublished domain-specific languages (DSLs). Within this field, the automatic generation of code based on such languages, starting from natural language, would speed up the development times of any related project, as well as the understanding of such DSLs. Therefore, this paper presents a tool in which developers can perform what is known as semantic parsing. In other words, the developer can ask a pre-trained LLM to translate a natural language instruction into the vocabulary of the established DSL. Thus, by setting the DSL grammar as context (grammar prompting) and providing usage examples (few-shot learning), the LLM can quickly generate reliable domain-specific code, significantly improving the quality of life of the developers. A video demonstration of the tool is shown in the following link: https://zenodo.org/records/12610506. | es_ES |
| dc.description.sponsorship | CITIC is funded by the Xunta de Galicia through the collaboration agreement between the Department of Culture, Education, Vocational Training and Universities and the Galician universities for the reinforcement of the research centers of the Galician University System (CIGUS); partially funded by MCIN/AEI/10.13039/501100011033 and “NextGenerationEU”/PRTR: [PLAGEMIS: TED2021-129245BC21]; partially funded by MCIN/AEI/10.13039/501100011033 and EU/ERDF A way of making Europe: [EarthDL: PID2022-141027NBC21]; partially funded by GAIN/Xunta de Galicia: [GRC: ED431C2021/53 and ED431G 2023/01] | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431C2021/53 | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431G 2023/01 | es_ES |
| dc.identifier.citation | Daniel Garcia-Gonzalez, Victor Lamas, and Miguel R. Luaces. 2024. DSLXpert: LLM-driven Generic DSL Code Generation. In ACM/IEEE 27th International Conference on Model Driven Engineering Languages and Systems (MODELS Companion ’24), September 22–27, 2024, Linz, Austria. ACM, New York, NY, USA, 5 pages. https://doi.org/10.1145/3652620.3687782 | es_ES |
| dc.identifier.doi | 10.1145/3652620.3687782 | |
| dc.identifier.uri | http://hdl.handle.net/2183/39984 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | Association for Computing Machinery, Inc | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/TED2021-129245B-C21/ES/PLAGEMIS | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/MINECO/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-141027NB-C21/ES/MODELADO, DESCUBRIMIENTO, EXPLORACION Y ANALISIS DE DATA LAKES MEDIOAMBIENTALES [UDC] | es_ES |
| dc.relation.uri | https://doi.org/10.1145/3652620.3687782 | es_ES |
| dc.rights | © 2024 Owner/Author|ACM. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in ACM/IEEE 27th International Conference on Model Driven Engineering Languages and Systems, https://doi.org/10.1145/3652620.3687782 | es_ES |
| dc.rights.accessRights | open access | es_ES |
| dc.subject | Domain-specific languages (DSLs) | es_ES |
| dc.subject | Large language models (LLMs) | es_ES |
| dc.subject | Semantic parsing | es_ES |
| dc.subject | Grammar prompting | es_ES |
| dc.subject | Few-shot learning | es_ES |
| dc.title | DSLXpert: LLM-driven Generic DSL Code Generation | es_ES |
| dc.type | conference output | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 21118aa4-994c-45e0-aff0-4750156048b5 | |
| relation.isAuthorOfPublication | f5f01d97-f28d-46b7-b99d-3bd795ee2677 | |
| relation.isAuthorOfPublication | fbde3bd9-d786-4ef0-89ec-6af2091fa415 | |
| relation.isAuthorOfPublication.latestForDiscovery | 21118aa4-994c-45e0-aff0-4750156048b5 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- RodriguezLuaces_Miguel_2024_DSLXpert_LLM_driven_Generic_DSL_Code_Generation.pdf
- Size:
- 2 MB
- Format:
- Adobe Portable Document Format
- Description:
- Versión aceptada

