Advanced ocean modelling and machine learning to forecast Dinophysis acuminata blooms: A tool to support shellfish farming management

UDC.coleccionInvestigación
UDC.departamentoCiencias da Computación e Tecnoloxías da Información
UDC.grupoInvRNASA - IMEDIR (INIBIC)
UDC.institutoCentroCITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación
UDC.journalTitleEcological Informatics
UDC.startPage103438
UDC.volume92
dc.contributor.authorRocruz, Elisabet
dc.contributor.authorMolares-Ulloa, Andrés
dc.contributor.authorPadin, Xosé A.
dc.contributor.authorNolasco, Rita
dc.contributor.authorRivero, Daniel
dc.contributor.authorFernández-Blanco, Enrique
dc.contributor.authorPazos, Yolanda
dc.contributor.authorDubert, Jesús
dc.date.accessioned2025-10-21T07:53:38Z
dc.date.available2025-10-21T07:53:38Z
dc.date.issued2025-12
dc.descriptionData related to cell counts of D. acuminata is available upon request to the Instituto Tecnolóxico para o Control do Medio Mariño de Galicia (INTECMAR). The remaining dataset used in this study is available on Zenodo ( https://doi.org/10.5281/zenodo.17143117) and the code used to reproduce our machine learning models results is accessible on GitHub ( https://github.com/AndresMolares/hab_featureSelection_study).
dc.description.abstract[Abstract]: The presence of toxin-producers phytoplankton is a natural phenomenon that threatens marine ecosystems, endangers human health, and causes significant economic losses in shellfish harvesting. The Galician Rías Baixas (NW Spain) are one of the main mussels producing regions worldwide and the leading producer in Europe. Annual occurrence of Dinophysis acuminata, responsible for diarrhetic shellfish poisoning toxins, lead to a ban on the mussel harvesting for several months, each year. To help mitigate these impacts, this study explores the prediction of D. acuminata cells densities 3-days ahead in the outer and inner parts of three of the Rías Baixas (Arousa, Pontevedra and Vigo), testing three local machine learning (ML) models: Artificial Neural Network (ANN), Random Forest (RF) and Support Vector Machine (SVM). Local ML models were selected to account for the differences in occurrence and variability of D. acuminata densities across the different parts of each Ría. These ML models were assessed by (1) reducing the number of features through a feature selection approach to identify the most relevant ones, (2) exploring different sets of features and (3) comparing models trained with 7 and 30 days of past information. The dataset combined daily hydrodynamic and biological features, from 2013 to 2019, obtained from a high-resolution 3D hydrodynamic model (CROCO), and in-situ observations. Our results show that RF provided the best predictive performance. Increasing the number of days of past information did not significantly improve results, as similar averaged R2 values were obtained for 7 and 30 days: 0.75 for Ría de Arousa, 0.72 for Ría de Pontevedra, and 0.67 for Ría de Vigo. Feature selection process showed that, as expected, previous cells densities of D. acuminata were essential for capturing bloom timing and amplitude. Also, the temperature, salinity, and the vertical and meridional components of current velocity were key predictors at outer stations of the Ría de Pontevedra and Vigo, where more features were required. In contrast, for the other stations, good predictions were achieved using only five features. This study represents one of the first attempts to predict D. acuminata in the Rías Baixas using local ML models. Our findings highlight the need for local approaches, as bloom dynamics vary between Rías and within different parts of each Ría. We also demonstrate the value of hydrodynamic model outputs to train ML models and compensate for the lack of long-term, spatially extensive in-situ data.
dc.description.sponsorshipThe authors would like to thank INTECMAR for creating the dataset related to the cell count of Dinophysis acuminata and CESGA, who allowed the run of the simulations in their installations. Thanks are also due for the financial support to CESAM by FCT/MCTES (UIDP/50017/2020+UIDB/50017/2020+LA/P/0094/2020), through national funds. ER was supported by the Portuguese Science and Technology Foundation (FCT) through PhD fellowship PD/BD/143085/2018, within the scope of the National Strategic Reference Framework (NSRF) and the Human Potential Operational Programme (POPH), co-financed by the European Fund and national funds from the Ministry of Science, Technology and Higher Education (MC-TES). ER has also received funding through DATAMARE, from Galicia Marine Science programme, which forms part of the Complementary Science Plans for Marine Science of Ministerio de Ciencia, Innovación and Universidades included in the Recovery, Transformation and Resilience Plan (PRTR-C17.I1), funded through Xunta de Galicia with NextGenerationEU and the European Maritime Fisheries and Aquaculture Funds. ER and RN were supported by MITECO programme for the Spanish Recovery, Transformation and Resilience Plan (European Union Recovery and Resilience Mechanism established by Regulation (EU) 2020/2094), funded by the European Union -NexGenerationEU-. CITIC is funded by the Xunta de Galicia through the collaboration agreement between the Regional Ministry of Culture, Education, Vocational Training and Universities and the Galician universities to strengthen the research centres of the Galician University System (CIGUS). Grant PID2021-126289OA-I00 funded by MCIN/AEI/10.13039/501100011033 and by ERDF A way of making Europe. This work was also partially supported by the Xunta de Galicia and the ERDF Funds A way of making Europe with grant (Ref. ED431C 2022/46). This work was supported by MAGIC project (PID2024-156623OB-C22) funded by MICIU/AEI/10.13039/501100011033 and by FEDER, UE; and by REDEIRA project (TED2021-132188B-I00) funded by MICIU/AEI/10.13039/501100011033 and by Unión Europea NextGenerationEU/PRTR . Part of the funding for this work also comes from the consolidation and structuring funds for Competitive Research Units of the GAIN-Xunta de Galicia, modality A: Competitive Reference Groups (IN607A2025-05). We thank to the editor and reviewers for their constructive comments and rapid response, which greatly helped us improve the manuscript.
dc.description.sponsorshipPortugal. Fundação para a Ciência e a Tecnologia; UIDP/50017/2020
dc.description.sponsorshipPortugal. Fundação para a Ciência e a Tecnologia; UIDB/50017/2020
dc.description.sponsorshipPortugal. Fundação para a Ciência e a Tecnologia; LA/P/0094/2020
dc.description.sponsorshipPortugal. Fundação para a Ciência e a Tecnologia; PD/BD/143085/2018
dc.description.sponsorshipXunta de Galicia; ED431C 2022/46
dc.description.sponsorshipXunta de Galicia; IN607A2025-05
dc.description.urihttps://zenodo.org/records/17143117
dc.description.urihttps://github.com/AndresMolares/hab_featureSelection_study
dc.identifier.citationE. Rocruz, A. Molares-Ulloa, X. A. Padin, R. Nolasco, D. Rivero, E. Fernandez-Blanco, Y. Pazos and J. Dubert, "Advanced ocean modelling and machine learning to forecast Dinophysis acuminata blooms: A tool to support shellfish farming management", Ecological Informatics, Vol. 92, Dec. 2025, 103438, https://doi.org/10.1016/j.ecoinf.2025.103438
dc.identifier.doi10.1016/j.ecoinf.2025.103438
dc.identifier.issn1878-0512
dc.identifier.issn1574-9541
dc.identifier.urihttps://hdl.handle.net/2183/46029
dc.language.isoeng
dc.publisherElsevier
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2021-126289OA-I00/ES/TRACKING Y ANÁLISIS DEL COMPORTAMIENTO ANIMAL CON TÉCNICAS DE VISIÓN ARTIFICIAL Y DEEP LEARNING
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2024-156623OB-C22/ES/MODELADO E ANÁLISE DO CRECEMENTO DO FITOPLANCTON NAS RÍAS
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/TED2021-132188B-I00/ES/INVESTIGACION, DESARROLLO E INNOVACION DE UNA RED DE OBSERVACION COSTERA: RIA DE AROUSA
dc.relation.urihttps://doi.org/10.1016/j.ecoinf.2025.103438
dc.rights.accessRightsopen access
dc.subjectMachine learning
dc.subjectHarmful algal blooms
dc.subjectDinophysis acuminata
dc.subject3D hydrodynamic model
dc.subjectFeature selection
dc.titleAdvanced ocean modelling and machine learning to forecast Dinophysis acuminata blooms: A tool to support shellfish farming management
dc.typejournal article
dc.type.hasVersionVoR
dspace.entity.typePublication
relation.isAuthorOfPublicationd8e10433-ea19-4a35-8cc6-0c7b9f143a6d
relation.isAuthorOfPublication244a6828-de1c-45f3-86b6-69bb81250814
relation.isAuthorOfPublication.latestForDiscoveryd8e10433-ea19-4a35-8cc6-0c7b9f143a6d

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Rivero_Daniel_2025_Advanced_ocean_modelling_and_machine_learning_to_forecast.pdf
Size:
3.97 MB
Format:
Adobe Portable Document Format