Cost-sensitive reinforcement learning for credit risk

C-Rella, Jorge; Martínez Rego, David; Vilar, Juan M.

Use this link to cite:

http://hdl.handle.net/2183/41967

Cost-sensitive reinforcement learning for credit risk

Files

C-Rella_Jorge_2025_Cost_sensitive_reinforcement.pdf (4.23 MB)

Identifiers

URI: http://hdl.handle.net/2183/41967

Publication date

2025-02-04

Authors

C-Rella, Jorge

Martínez Rego, David

Vilar, Juan M.

Bibliographic citation

J. C-Rella, D. Martinez Rego, y J. M. Vilar, «Cost-sensitive reinforcement learning for credit risk», Expert Systems with Applications, vol. 272, p. 126708, may 2025, doi: 10.1016/j.eswa.2025.126708

Abstract

[Abstract]: Credit risk problems are dynamic because customer behavior is not stable, and they are cost-sensitive because the impact of a decision depends on the amount of the loan. Online learning algorithms, which evolve as more information becomes available, are an appropriate tool to study these dynamic problems. However, only information on approved transactions is available, which can lead to unfair biases and opportunity costs. Within reinforcement learning, bandit algorithms address this by balancing exploitation (acting according to the current model) and exploration (considering an action with limited information to improve predictions). The only remaining gap is to address the problem taking into account the different classification costs. This paper introduces cost-sensitive reinforcement learning algorithms to solve the credit risk problem from a dynamic perspective maximizing long-term benefits, proposing a cost-sensitive passive-aggressive algorithm and a cost-sensitive logistic bandit. Experiments on benchmark datasets and extensive simulation studies demonstrate the effectiveness and efficiency of the proposed algorithms