Cost-sensitive reinforcement learning for credit risk
| UDC.coleccion | Investigación | es_ES |
| UDC.departamento | Matemáticas | es_ES |
| UDC.grupoInv | Modelización, Optimización e Inferencia Estatística (MODES) | es_ES |
| UDC.institutoCentro | CITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación | es_ES |
| UDC.journalTitle | Expert Systems with Applications | es_ES |
| UDC.startPage | 126708 | es_ES |
| UDC.volume | 272 | es_ES |
| dc.contributor.author | C-Rella, Jorge | |
| dc.contributor.author | Martínez Rego, David | |
| dc.contributor.author | Vilar, Juan M. | |
| dc.date.accessioned | 2025-05-12T11:45:35Z | |
| dc.date.available | 2025-05-12T11:45:35Z | |
| dc.date.issued | 2025-02-04 | |
| dc.description.abstract | [Abstract]: Credit risk problems are dynamic because customer behavior is not stable, and they are cost-sensitive because the impact of a decision depends on the amount of the loan. Online learning algorithms, which evolve as more information becomes available, are an appropriate tool to study these dynamic problems. However, only information on approved transactions is available, which can lead to unfair biases and opportunity costs. Within reinforcement learning, bandit algorithms address this by balancing exploitation (acting according to the current model) and exploration (considering an action with limited information to improve predictions). The only remaining gap is to address the problem taking into account the different classification costs. This paper introduces cost-sensitive reinforcement learning algorithms to solve the credit risk problem from a dynamic perspective maximizing long-term benefits, proposing a cost-sensitive passive-aggressive algorithm and a cost-sensitive logistic bandit. Experiments on benchmark datasets and extensive simulation studies demonstrate the effectiveness and efficiency of the proposed algorithms | es_ES |
| dc.description.sponsorship | This research is part of the grants PID2020-113578RB-I00 and PID2023-147127OB-I00 "ERDF/EU", funded by MCIN/AEI/10.13039/ 501100011033/. It has also been supported by the Xunta de Galicia, Spain (Grupos de Referencia Competitiva ED431C-2024/14) and by CITIC as a center accredited for excellence within the Galician University System and a member of the CIGUS Network, receives subsidies from the Department of Education, Science, Universities, and Vocational Training of the Xunta de Galicia. Additionally, it is co-financed by the EU through the FEDER Galicia 2021–27 operational program (Ref. ED431G 2023/01). The first author was financed by the Axencia Galega de Innovación Industrial PhD Grant 14-IN606D-2021-2607768 | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431C-2024/14 | es_ES |
| dc.description.sponsorship | Xunta de Galicia; ED431G 2023/01 | es_ES |
| dc.description.sponsorship | Xunta de Galicia; 14-IN606D-2021-2607768 | es_ES |
| dc.identifier.citation | J. C-Rella, D. Martinez Rego, y J. M. Vilar, «Cost-sensitive reinforcement learning for credit risk», Expert Systems with Applications, vol. 272, p. 126708, may 2025, doi: 10.1016/j.eswa.2025.126708 | es_ES |
| dc.identifier.issn | 1873-6793 | |
| dc.identifier.issn | 0957-4174 | |
| dc.identifier.uri | http://hdl.handle.net/2183/41967 | |
| dc.language.iso | eng | es_ES |
| dc.publisher | Elsevier | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-113578RB-I00/ES/MÉTODOS ESTADÍSTICOS FLEXIBLES EN CIENCIA DE DATOS PARA DATOS COMPLEJOS Y DE GRAN VOLUMEN: TEORÍA Y APLICACIONES | es_ES |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2023-147127OB-I00 | es_ES |
| dc.relation.uri | https://doi.org/10.1016/j.eswa.2025.126708 | es_ES |
| dc.rights | Atribución 3.0 España | es_ES |
| dc.rights | © 2025 The Authors | es_ES |
| dc.rights.accessRights | open access | es_ES |
| dc.rights.uri | http://creativecommons.org/licenses/by/3.0/es/ | * |
| dc.subject | Cost-sensitive classification | es_ES |
| dc.subject | Reinforcement learning | es_ES |
| dc.subject | Bandit algorithms | es_ES |
| dc.subject | Online learning | es_ES |
| dc.subject | Credit risk | es_ES |
| dc.subject | Decision making | es_ES |
| dc.title | Cost-sensitive reinforcement learning for credit risk | es_ES |
| dc.type | journal article | es_ES |
| dc.type.hasVersion | VoR | es_ES |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 8266f7ba-97e2-451f-9c0a-5501266378e0 | |
| relation.isAuthorOfPublication.latestForDiscovery | 8266f7ba-97e2-451f-9c0a-5501266378e0 |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- C-Rella_Jorge_2025_Cost_sensitive_reinforcement.pdf
- Size:
- 4.23 MB
- Format:
- Adobe Portable Document Format
- Description:

