Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation

UDC.coleccionInvestigaciónes_ES
UDC.departamentoCiencias da Computación e Tecnoloxías da Informaciónes_ES
UDC.grupoInvInformation Retrieval Lab (IRlab)es_ES
UDC.issue1es_ES
UDC.journalTitleEurasip Journal on Audio, Speech, and Music Processinges_ES
UDC.startPage13es_ES
UDC.volume2019es_ES
dc.contributor.authorTejedor, Javier
dc.contributor.authorToledano, Doroteo T.
dc.contributor.authorLópez-Otero, Paula
dc.contributor.authorDocío-Fernández, Laura
dc.contributor.authorPeñagarikano, Mikel
dc.contributor.authorRodríguez-Fuentes, Luis Javier
dc.contributor.authorMoreno-Sandoval, Antonio
dc.date.accessioned2019-09-25T14:29:29Z
dc.date.available2019-09-25T14:29:29Z
dc.date.issued2019-07-19
dc.description.abstract[Abstract] The huge amount of information stored in audio and video repositories makes search on speech (SoS) a priority area nowadays. Within SoS, Query-by-Example Spoken Term Detection (QbE STD) aims to retrieve data from a speech repository given a spoken query. Research on this area is continuously fostered with the organization of QbE STD evaluations. This paper presents a multi-domain internationally open evaluation for QbE STD in Spanish. The evaluation aims at retrieving the speech files that contain the queries, providing their start and end times, and a score that reflects the confidence given to the detection. Three different Spanish speech databases that encompass different domains have been employed in the evaluation: MAVIR database, which comprises a set of talks from workshops; RTVE database, which includes broadcast television (TV) shows; and COREMAH database, which contains 2-people spontaneous speech conversations about different topics. The evaluation has been designed carefully so that several analyses of the main results can be carried out. We present the evaluation itself, the three databases, the evaluation metrics, the systems submitted to the evaluation, the results, and the detailed post-evaluation analyses based on some query properties (within-vocabulary/out-of-vocabulary queries, single-word/multi-word queries, and native/foreign queries). Fusion results of the primary systems submitted to the evaluation are also presented. Three different teams took part in the evaluation, and ten different systems were submitted. The results suggest that the QbE STD task is still in progress, and the performance of these systems is highly sensitive to changes in the data domain. Nevertheless, QbE STD strategies are able to outperform text-based STD in unseen data domains.es_ES
dc.description.sponsorshipCentro singular de investigación de Galicia; ED431G/04es_ES
dc.description.sponsorshipUniversidad del País Vasco; GIU16/68es_ES
dc.description.sponsorshipMinisterio de Economía y Competitividad; TEC2015-68172-C2-1-Pes_ES
dc.description.sponsorshipMinisterio de Ciencia, Innovación y Competitividad; RTI2018-098091-B-I00es_ES
dc.description.sponsorshipXunta de Galicia; ED431G/01es_ES
dc.identifier.citationTejedor, Javier, et al. Search on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluation. EURASIP Journal on Audio, Speech, and Music Processing, 2019, vol. 2019, no 1, p. 13.es_ES
dc.identifier.doi10.1186/s13636-019-0156-x
dc.identifier.issn1687-4714
dc.identifier.issn1687-4722
dc.identifier.urihttp://hdl.handle.net/2183/23979
dc.language.isoenges_ES
dc.publisherSpringerOpenes_ES
dc.relation.urihttps://doi.org/10.1186/s13636-019-0156-xes_ES
dc.rightsAtribución 3.0 Españaes_ES
dc.rights.accessRightsopen accesses_ES
dc.rights.urihttp://creativecommons.org/licenses/by/3.0/es/*
dc.subjectQuery-by-Example Spoken Term Detectiones_ES
dc.subjectInternational evaluationes_ES
dc.subjectSpanish languagees_ES
dc.subjectSearch on speeches_ES
dc.titleSearch on speech from spoken queries: the Multi-domain International ALBAYZIN 2018 Query-by-Example Spoken Term Detection Evaluationes_ES
dc.typejournal articlees_ES
dspace.entity.typePublication
relation.isAuthorOfPublicationeec0c53b-d226-4e1f-b2f8-1d9719fa3b0a
relation.isAuthorOfPublication.latestForDiscoveryeec0c53b-d226-4e1f-b2f8-1d9719fa3b0a

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
J.Tejedor_2019_Search_on_speech_from_spoken_queries.pdf
Size:
3.46 MB
Format:
Adobe Portable Document Format
Description: