GALIASdoc: Automatic Intermediate Language Generator for fast Syntactic Analysis over massive document sets
| UDC.coleccion | Investigación | |
| UDC.departamento | Ciencias da Computación e Tecnoloxías da Información | |
| UDC.grupoInv | Telemática | |
| dc.contributor.author | Dafonte, Carlos | |
| dc.contributor.author | Carneiro, Víctor | |
| dc.contributor.author | Gómez García, Ángel | |
| dc.contributor.author | Trabazo-Sardón, Diego | |
| dc.contributor.author | Santoveña, Raúl | |
| dc.contributor.author | Silvelo, Arturo | |
| dc.contributor.author | Nóvoa, Francisco | |
| dc.contributor.author | Fernández, Diego | |
| dc.contributor.author | Manteiga, Minia | |
| dc.date.accessioned | 2026-01-23T10:45:52Z | |
| dc.date.available | 2026-01-23T10:45:52Z | |
| dc.date.issued | 2020 | |
| dc.description | Registration of the intellectual property (of a software) | |
| dc.description.abstract | [Abstract]: The GALIASdoc software is a system for extracting relevant information from large volumes of documents with common formats and heterogeneous origins. The data obtained are ready to be exploited by other applications such as content management systems (CMS), enterprise resource planning (ERP) systems, databases, and similar platforms. The system is responsible for identifying the document model in order to locate the semantic information it contains. During the ingestion process, an initial version in text format is obtained, applying optical character recognition (OCR) techniques when necessary. The model includes geometric data defining the areas of interest presented in the document. This record has been in operational use since 2020 through the signing of two exploitation contracts with companies in the ICT sector. | |
| dc.identifier.uri | https://hdl.handle.net/2183/47072 | |
| dc.language.iso | eng | |
| dc.rights | Right holders: Universidade da Coruña (100%) | |
| dc.rights.accessRights | open access | |
| dc.subject | Information extraction | |
| dc.subject | Document processing | |
| dc.subject | Software | |
| dc.title | GALIASdoc: Automatic Intermediate Language Generator for fast Syntactic Analysis over massive document sets | |
| dc.type | other | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | c3c2021f-0b5d-408f-afff-ec09ab5eaeee | |
| relation.isAuthorOfPublication | 652c136c-eea5-4a78-947c-538b1c99f81b | |
| relation.isAuthorOfPublication | 29e6d257-7aab-4d8c-bf2d-007f2edffb9d | |
| relation.isAuthorOfPublication | abfb4c11-222e-48e0-9374-2fd0261c519f | |
| relation.isAuthorOfPublication | 6f38fb90-68db-4d7c-89e0-8cff7f9d673c | |
| relation.isAuthorOfPublication | 9b9fbda3-512a-4143-986b-c7b60305e041 | |
| relation.isAuthorOfPublication | ac152b53-40d7-47ed-a5d2-036b0374adb7 | |
| relation.isAuthorOfPublication.latestForDiscovery | c3c2021f-0b5d-408f-afff-ec09ab5eaeee |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- CarneiroDiaz_Victor_2020_GaliasDOC.pdf
- Size:
- 98.58 KB
- Format:
- Adobe Portable Document Format

