GALIASdoc: Automatic Intermediate Language Generator for fast Syntactic Analysis over massive document sets

UDC.coleccionInvestigación
UDC.departamentoCiencias da Computación e Tecnoloxías da Información
UDC.grupoInvTelemática
dc.contributor.authorDafonte, Carlos
dc.contributor.authorCarneiro, Víctor
dc.contributor.authorGómez García, Ángel
dc.contributor.authorTrabazo-Sardón, Diego
dc.contributor.authorSantoveña, Raúl
dc.contributor.authorSilvelo, Arturo
dc.contributor.authorNóvoa, Francisco
dc.contributor.authorFernández, Diego
dc.contributor.authorManteiga, Minia
dc.date.accessioned2026-01-23T10:45:52Z
dc.date.available2026-01-23T10:45:52Z
dc.date.issued2020
dc.descriptionRegistration of the intellectual property (of a software)
dc.description.abstract[Abstract]: The GALIASdoc software is a system for extracting relevant information from large volumes of documents with common formats and heterogeneous origins. The data obtained are ready to be exploited by other applications such as content management systems (CMS), enterprise resource planning (ERP) systems, databases, and similar platforms. The system is responsible for identifying the document model in order to locate the semantic information it contains. During the ingestion process, an initial version in text format is obtained, applying optical character recognition (OCR) techniques when necessary. The model includes geometric data defining the areas of interest presented in the document. This record has been in operational use since 2020 through the signing of two exploitation contracts with companies in the ICT sector.
dc.identifier.urihttps://hdl.handle.net/2183/47072
dc.language.isoeng
dc.rightsRight holders: Universidade da Coruña (100%)
dc.rights.accessRightsopen access
dc.subjectInformation extraction
dc.subjectDocument processing
dc.subjectSoftware
dc.titleGALIASdoc: Automatic Intermediate Language Generator for fast Syntactic Analysis over massive document sets
dc.typeother
dspace.entity.typePublication
relation.isAuthorOfPublicationc3c2021f-0b5d-408f-afff-ec09ab5eaeee
relation.isAuthorOfPublication652c136c-eea5-4a78-947c-538b1c99f81b
relation.isAuthorOfPublication29e6d257-7aab-4d8c-bf2d-007f2edffb9d
relation.isAuthorOfPublicationabfb4c11-222e-48e0-9374-2fd0261c519f
relation.isAuthorOfPublication6f38fb90-68db-4d7c-89e0-8cff7f9d673c
relation.isAuthorOfPublication9b9fbda3-512a-4143-986b-c7b60305e041
relation.isAuthorOfPublicationac152b53-40d7-47ed-a5d2-036b0374adb7
relation.isAuthorOfPublication.latestForDiscoveryc3c2021f-0b5d-408f-afff-ec09ab5eaeee

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
CarneiroDiaz_Victor_2020_GaliasDOC.pdf
Size:
98.58 KB
Format:
Adobe Portable Document Format