“Play by play”: A dataset of handball and basketball game situations in a standardized space

UDC.coleccionInvestigaciónes_ES
UDC.departamentoCiencias da Computación e Tecnoloxías da Informaciónes_ES
UDC.grupoInvLaboratorio de Investigación e Desenvolvemento en Intelixencia Artificial (LIDIA)es_ES
UDC.institutoCentroCITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicaciónes_ES
UDC.issue111265es_ES
UDC.journalTitleData in Briefes_ES
UDC.volume58es_ES
dc.contributor.authorCabado, Bruno
dc.contributor.authorGuijarro-Berdiñas, Bertha
dc.contributor.authorPadrón, Emilio J.
dc.date.accessioned2025-04-23T08:23:36Z
dc.date.available2025-04-23T08:23:36Z
dc.date.issued2025-02
dc.descriptionData Availability: "Play by Play": a dataset of handball and basketball game situations in a standardized space (Original data) (Zenodo): https://doi.org/10.5281/zenodo.12698090es_ES
dc.description.abstract[Abstract]: This paper presents a synthetic dataset of labeled game situations in recordings of federated handball and basketball matches played in Galicia, Spain. The dataset consists of synthetic data generated from real video frames, including 308,805 labeled handball frames and 56,578 labeled basketball frames extracted from 2105 handball and 383 basketball 5-s video clips. Experts manually labeled the video clips based on the respective sports, while the individual frames were automatically labeled using computer vision and machine learning techniques. The dataset encompasses seven classes of game situations: left attack, left counterattack, left penalty, right attack, right counterattack, right penalty, and timeout. In basketball, the penalty class refers to the free throws attempted by players after they have been fouled by an opposing player. Each frame in the dataset is assigned to one of these classes, considering the game situation and specific context. Importantly, the dataset does not contain actual video frames; instead, it provides a synthetic, normalized representation of each frame in JSON format. This tabular data includes player, referee, and ball positions on a normalized field, player and referee velocities, and key regions on the court. Positions of players, referees, and the ball were automatically inferred in each frame by an object detector, followed by a tracking step to detect object positions across frames and compute the velocity vectors. Finally, the obtained coordinates underwent normalization through a perspective transformation, ensuring that the data remained unaffected by variations in camera configurations across different arenas and camera setups. We refer to this standardized coordinate space as the 'unified space'. The dataset holds significant potential for reuse in various domains related to sports analytics and machine learning research. It can serve as a valuable resource for researchers, coaches, and sports enthusiasts, contributing to improvements in player performance, game strategies, match retransmissions, and sports-related technologies.es_ES
dc.description.sponsorshipThis work was supported by Grants PID2019-109238GB-C22 and PID2022-136435NB-I00, funded by MICINN/AEI/10.13039/501100011033, by Xunta de Galicia (Grants ED431C 2022/44, ED431C 2021/30, ED431F 2021/11) and by `ERDF A way of making Europe', EU. CITIC, as a center accredited for excellence within the Galician University System and a member of the CIGUS Network, receives subsidies from the Department of Education, Science, Universities, and Vocational Training of the Xunta de Galicia. Additionally, it is co-financed by the EU through the FEDER Galicia 2021-27 operational program (Ref. ED431G 2023/01). Cabado wish to thanks the Axencia Galega de Innovación the grant received through its Industrial Doctorate program (23/IN606D/2021/2612054).es_ES
dc.description.sponsorshipXunta de Galicia; ED431C 2022/44es_ES
dc.description.sponsorshipXunta de Galicia; ED431C 2021/30es_ES
dc.description.sponsorshipXunta de Galicia; ED431F 2021/11es_ES
dc.description.sponsorshipXunta de Galicia; ED431G 2023/01es_ES
dc.description.sponsorshipXunta de Galicia; IN606D/2021/2612054es_ES
dc.identifier.citationB. Cabado, B. Guijarro-Berdiñas, and E. J. Padrón, "Play by play: A dataset of handball and basketball game situations in a standardized space", Data in Brief, Vol. 58, Feb. 2025, 111265, doi: 10.1016/j.dib.2024.111265es_ES
dc.identifier.doi10.1016/j.dib.2024.111265
dc.identifier.issn2352-3409
dc.identifier.urihttp://hdl.handle.net/2183/41853
dc.language.isoenges_ES
dc.publisherElsevieres_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2019-109238GB-C22/ES/APRENDIZAJE AUTOMATICO ESCALABLE Y EXPLICABLEes_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-136435NB-I00/ES/ARQUITECTURAS, FRAMEWORKS Y APLICACIONES DE LA COMPUTACION DE ALTAS PRESTACIONESes_ES
dc.relation.referencestohttps://doi.org/10.5281/zenodo.12698090
dc.relation.urihttps://doi.org/10.1016/j.dib.2024.111265es_ES
dc.rightsAtribución 3.0 Españaes_ES
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es/*
dc.subjectSportses_ES
dc.subjectPlayerses_ES
dc.subjectBalles_ES
dc.subjectPositiones_ES
dc.subjectVelocityes_ES
dc.subjectNormalizedes_ES
dc.subjectGame situationes_ES
dc.title“Play by play”: A dataset of handball and basketball game situations in a standardized spacees_ES
dc.typejournal articlees_ES
dc.type.hasVersionVoRes_ES
dspace.entity.typePublication
relation.isAuthorOfPublicationd839396d-454e-4ccd-9322-d3e89a876865
relation.isAuthorOfPublicationbdccb1db-e727-4b63-b2ca-1941cc096c00
relation.isAuthorOfPublication.latestForDiscoveryd839396d-454e-4ccd-9322-d3e89a876865

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
GuijarroBerdinas_Bertha_2025_Play_by_play.pdf
Size:
1.34 MB
Format:
Adobe Portable Document Format
Description: