Document-level event extraction from Italian crime news using minimal data

Loading...
Thumbnail Image

Identifiers

Publication date

Authors

Bonisoli, Giovanni
Rollo, Federica
Po, Laura

Advisors

Other responsabilities

Journal Title

Bibliographic citation

G. Bonisoli, D. Vilares, F. Rollo, y L. Po, «Document-level event extraction from Italian crime news using minimal data», Knowledge-Based Systems, vol. 317, p. 113386, may 2025, doi: 10.1016/j.knosys.2025.113386

Type of academic work

Academic degree

Abstract

[Abstract]: Event extraction from unstructured text is a critical task in natural language processing, often requiring substantial annotated data. This study presents an approach to document-level event extraction applied to Italian crime news, utilizing large language models (LLMs) with minimal labeled data. Our method leverages zero-shot prompting and in-context learning to effectively extract relevant event information. We address three key challenges: (1) identifying text spans corresponding to event entities, (2) associating related spans dispersed throughout the text with the same entity, and (3) formatting the extracted data into a structured JSON. The findings are promising: LLMs achieve an F1-score of approximately 60% for detecting event-related text spans, demonstrating their potential even in resource-constrained settings. This work represents a significant advancement in utilizing LLMs for tasks traditionally dependent on extensive data, showing that meaningful results are achievable with minimal data annotation. Additionally, the proposed approach outperforms several baselines, confirming its robustness and adaptability to various event extraction scenarios

Description

Financiado para publicación en acceso aberto: Universidade da Coruña/CISUG

Rights

Atribución 3.0 España
© 2025 The Authors. Published by Elsevier B.V.
Atribución 3.0 España

Except where otherwise noted, this item's license is described as Atribución 3.0 España