Mostrar o rexistro simple do ítem

dc.contributor.authorRosado, Eduardo
dc.contributor.authorGarcía Remesal, Miguel
dc.contributor.authorParaiso-Medina, Sergio
dc.contributor.authorPazos, A.
dc.contributor.authorMaojo García, Victor Manuel
dc.date.accessioned2021-04-15T13:50:51Z
dc.date.available2021-04-15T13:50:51Z
dc.date.issued2021-02-25
dc.identifier.citationRosado, E., Garcia-Remesal, M., Paraiso-Medina, S., Pazos, A., & Maojo, V. (2021). Using Machine Learning to Collect and Facilitate Remote Access to Biomedical Databases: Development of the Biomedical Database Inventory. JMIR Medical Informatics, 9(2), e22976.es_ES
dc.identifier.issn2291-9694
dc.identifier.urihttp://hdl.handle.net/2183/27757
dc.description.abstract[Abstract] Background: Currently, existing biomedical literature repositories do not commonly provide users with specific means to locate and remotely access biomedical databases. Objective: To address this issue, we developed the Biomedical Database Inventory (BiDI), a repository linking to biomedical databases automatically extracted from the scientific literature. BiDI provides an index of data resources and a path to access them seamlessly. Methods: We designed an ensemble of deep learning methods to extract database mentions. To train the system, we annotated a set of 1242 articles that included mentions of database publications. Such a data set was used along with transfer learning techniques to train an ensemble of deep learning natural language processing models targeted at database publication detection. Results: The system obtained an F1 score of 0.929 on database detection, showing high precision and recall values. When applying this model to the PubMed and PubMed Central databases, we identified over 10,000 unique databases. The ensemble model also extracted the weblinks to the reported databases and discarded irrelevant links. For the extraction of weblinks, the model achieved a cross-validated F1 score of 0.908. We show two use cases: one related to “omics” and the other related to the COVID-19 pandemic. Conclusions: BiDI enables access to biomedical resources over the internet and facilitates data-driven research and other scientific initiatives. The repository is openly available online and will be regularly updated with an automatic text processing pipeline. The approach can be reused to create repositories of different types (ie, biomedical and others).es_ES
dc.description.sponsorshipProyecto colaborativo de integración de datos genómicos; PI17/01561es_ES
dc.language.isoenges_ES
dc.publisherJ M I R Publications, Inc.es_ES
dc.relation.urihttps://doi.org/10.2196/22976es_ES
dc.rightsAtribución 3.0 Españaes_ES
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es/*
dc.subjectBiomedical databaseses_ES
dc.subjectNatural language processinges_ES
dc.subjectDeep learninges_ES
dc.subjectBiomedical knowledgees_ES
dc.subjectInternetes_ES
dc.titleUsing Machine Learning to Collect and Facilitate Remote Access to Biomedical Databases: Development of the Biomedical Database Inventoryes_ES
dc.typeinfo:eu-repo/semantics/articlees_ES
dc.rights.accessinfo:eu-repo/semantics/openAccesses_ES
UDC.journalTitleJMIR Medical Informaticses_ES
UDC.volume9es_ES
UDC.issue2es_ES
dc.identifier.doi10.2196/22976


Ficheiros no ítem

Thumbnail
Thumbnail

Este ítem aparece na(s) seguinte(s) colección(s)

Mostrar o rexistro simple do ítem