Advanced ocean modelling and machine learning to forecast Dinophysis acuminata blooms: A tool to support shellfish farming management
| UDC.coleccion | Investigación | |
| UDC.departamento | Ciencias da Computación e Tecnoloxías da Información | |
| UDC.grupoInv | RNASA - IMEDIR (INIBIC) | |
| UDC.institutoCentro | CITIC - Centro de Investigación de Tecnoloxías da Información e da Comunicación | |
| UDC.journalTitle | Ecological Informatics | |
| UDC.startPage | 103438 | |
| UDC.volume | 92 | |
| dc.contributor.author | Rocruz, Elisabet | |
| dc.contributor.author | Molares-Ulloa, Andrés | |
| dc.contributor.author | Padin, Xosé A. | |
| dc.contributor.author | Nolasco, Rita | |
| dc.contributor.author | Rivero, Daniel | |
| dc.contributor.author | Fernández-Blanco, Enrique | |
| dc.contributor.author | Pazos, Yolanda | |
| dc.contributor.author | Dubert, Jesús | |
| dc.date.accessioned | 2025-10-21T07:53:38Z | |
| dc.date.available | 2025-10-21T07:53:38Z | |
| dc.date.issued | 2025-12 | |
| dc.description | Data related to cell counts of D. acuminata is available upon request to the Instituto Tecnolóxico para o Control do Medio Mariño de Galicia (INTECMAR). The remaining dataset used in this study is available on Zenodo ( https://doi.org/10.5281/zenodo.17143117) and the code used to reproduce our machine learning models results is accessible on GitHub ( https://github.com/AndresMolares/hab_featureSelection_study). | |
| dc.description.abstract | [Abstract]: The presence of toxin-producers phytoplankton is a natural phenomenon that threatens marine ecosystems, endangers human health, and causes significant economic losses in shellfish harvesting. The Galician Rías Baixas (NW Spain) are one of the main mussels producing regions worldwide and the leading producer in Europe. Annual occurrence of Dinophysis acuminata, responsible for diarrhetic shellfish poisoning toxins, lead to a ban on the mussel harvesting for several months, each year. To help mitigate these impacts, this study explores the prediction of D. acuminata cells densities 3-days ahead in the outer and inner parts of three of the Rías Baixas (Arousa, Pontevedra and Vigo), testing three local machine learning (ML) models: Artificial Neural Network (ANN), Random Forest (RF) and Support Vector Machine (SVM). Local ML models were selected to account for the differences in occurrence and variability of D. acuminata densities across the different parts of each Ría. These ML models were assessed by (1) reducing the number of features through a feature selection approach to identify the most relevant ones, (2) exploring different sets of features and (3) comparing models trained with 7 and 30 days of past information. The dataset combined daily hydrodynamic and biological features, from 2013 to 2019, obtained from a high-resolution 3D hydrodynamic model (CROCO), and in-situ observations. Our results show that RF provided the best predictive performance. Increasing the number of days of past information did not significantly improve results, as similar averaged R2 values were obtained for 7 and 30 days: 0.75 for Ría de Arousa, 0.72 for Ría de Pontevedra, and 0.67 for Ría de Vigo. Feature selection process showed that, as expected, previous cells densities of D. acuminata were essential for capturing bloom timing and amplitude. Also, the temperature, salinity, and the vertical and meridional components of current velocity were key predictors at outer stations of the Ría de Pontevedra and Vigo, where more features were required. In contrast, for the other stations, good predictions were achieved using only five features. This study represents one of the first attempts to predict D. acuminata in the Rías Baixas using local ML models. Our findings highlight the need for local approaches, as bloom dynamics vary between Rías and within different parts of each Ría. We also demonstrate the value of hydrodynamic model outputs to train ML models and compensate for the lack of long-term, spatially extensive in-situ data. | |
| dc.description.sponsorship | The authors would like to thank INTECMAR for creating the dataset related to the cell count of Dinophysis acuminata and CESGA, who allowed the run of the simulations in their installations. Thanks are also due for the financial support to CESAM by FCT/MCTES (UIDP/50017/2020+UIDB/50017/2020+LA/P/0094/2020), through national funds. ER was supported by the Portuguese Science and Technology Foundation (FCT) through PhD fellowship PD/BD/143085/2018, within the scope of the National Strategic Reference Framework (NSRF) and the Human Potential Operational Programme (POPH), co-financed by the European Fund and national funds from the Ministry of Science, Technology and Higher Education (MC-TES). ER has also received funding through DATAMARE, from Galicia Marine Science programme, which forms part of the Complementary Science Plans for Marine Science of Ministerio de Ciencia, Innovación and Universidades included in the Recovery, Transformation and Resilience Plan (PRTR-C17.I1), funded through Xunta de Galicia with NextGenerationEU and the European Maritime Fisheries and Aquaculture Funds. ER and RN were supported by MITECO programme for the Spanish Recovery, Transformation and Resilience Plan (European Union Recovery and Resilience Mechanism established by Regulation (EU) 2020/2094), funded by the European Union -NexGenerationEU-. CITIC is funded by the Xunta de Galicia through the collaboration agreement between the Regional Ministry of Culture, Education, Vocational Training and Universities and the Galician universities to strengthen the research centres of the Galician University System (CIGUS). Grant PID2021-126289OA-I00 funded by MCIN/AEI/10.13039/501100011033 and by ERDF A way of making Europe. This work was also partially supported by the Xunta de Galicia and the ERDF Funds A way of making Europe with grant (Ref. ED431C 2022/46). This work was supported by MAGIC project (PID2024-156623OB-C22) funded by MICIU/AEI/10.13039/501100011033 and by FEDER, UE; and by REDEIRA project (TED2021-132188B-I00) funded by MICIU/AEI/10.13039/501100011033 and by Unión Europea NextGenerationEU/PRTR . Part of the funding for this work also comes from the consolidation and structuring funds for Competitive Research Units of the GAIN-Xunta de Galicia, modality A: Competitive Reference Groups (IN607A2025-05). We thank to the editor and reviewers for their constructive comments and rapid response, which greatly helped us improve the manuscript. | |
| dc.description.sponsorship | Portugal. Fundação para a Ciência e a Tecnologia; UIDP/50017/2020 | |
| dc.description.sponsorship | Portugal. Fundação para a Ciência e a Tecnologia; UIDB/50017/2020 | |
| dc.description.sponsorship | Portugal. Fundação para a Ciência e a Tecnologia; LA/P/0094/2020 | |
| dc.description.sponsorship | Portugal. Fundação para a Ciência e a Tecnologia; PD/BD/143085/2018 | |
| dc.description.sponsorship | Xunta de Galicia; ED431C 2022/46 | |
| dc.description.sponsorship | Xunta de Galicia; IN607A2025-05 | |
| dc.description.uri | https://zenodo.org/records/17143117 | |
| dc.description.uri | https://github.com/AndresMolares/hab_featureSelection_study | |
| dc.identifier.citation | E. Rocruz, A. Molares-Ulloa, X. A. Padin, R. Nolasco, D. Rivero, E. Fernandez-Blanco, Y. Pazos and J. Dubert, "Advanced ocean modelling and machine learning to forecast Dinophysis acuminata blooms: A tool to support shellfish farming management", Ecological Informatics, Vol. 92, Dec. 2025, 103438, https://doi.org/10.1016/j.ecoinf.2025.103438 | |
| dc.identifier.doi | 10.1016/j.ecoinf.2025.103438 | |
| dc.identifier.issn | 1878-0512 | |
| dc.identifier.issn | 1574-9541 | |
| dc.identifier.uri | https://hdl.handle.net/2183/46029 | |
| dc.language.iso | eng | |
| dc.publisher | Elsevier | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2021-126289OA-I00/ES/TRACKING Y ANÁLISIS DEL COMPORTAMIENTO ANIMAL CON TÉCNICAS DE VISIÓN ARTIFICIAL Y DEEP LEARNING | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2024-156623OB-C22/ES/MODELADO E ANÁLISE DO CRECEMENTO DO FITOPLANCTON NAS RÍAS | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/TED2021-132188B-I00/ES/INVESTIGACION, DESARROLLO E INNOVACION DE UNA RED DE OBSERVACION COSTERA: RIA DE AROUSA | |
| dc.relation.uri | https://doi.org/10.1016/j.ecoinf.2025.103438 | |
| dc.rights.accessRights | open access | |
| dc.subject | Machine learning | |
| dc.subject | Harmful algal blooms | |
| dc.subject | Dinophysis acuminata | |
| dc.subject | 3D hydrodynamic model | |
| dc.subject | Feature selection | |
| dc.title | Advanced ocean modelling and machine learning to forecast Dinophysis acuminata blooms: A tool to support shellfish farming management | |
| dc.type | journal article | |
| dc.type.hasVersion | VoR | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | d8e10433-ea19-4a35-8cc6-0c7b9f143a6d | |
| relation.isAuthorOfPublication | 244a6828-de1c-45f3-86b6-69bb81250814 | |
| relation.isAuthorOfPublication.latestForDiscovery | d8e10433-ea19-4a35-8cc6-0c7b9f143a6d |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Rivero_Daniel_2025_Advanced_ocean_modelling_and_machine_learning_to_forecast.pdf
- Size:
- 3.97 MB
- Format:
- Adobe Portable Document Format

