Mostrar o rexistro simple do ítem

dc.contributor.authorParapar, Javier
dc.contributor.authorLosada, David E.
dc.contributor.authorPresedo-Quindimil, Manuel-Antonio
dc.contributor.authorBarreiro, Álvaro
dc.date.accessioned2019-02-13T15:03:42Z
dc.date.available2019-02-13T15:03:42Z
dc.date.issued2019-01-11
dc.identifier.issn2330-1643
dc.identifier.urihttp://hdl.handle.net/2183/21729
dc.descriptionPreprint of our Journal of the Association for Information Science and Technology (JASIST) paperes_ES
dc.description.abstract[Abstract] Statistical significance tests can provide evidence that the observed difference in performance between two methods is not due to chance. In Information Retrieval, some studies have examined the validity and suitability of such tests for comparing search systems.We argue here that current methods for assessing the reliability of statistical tests suffer from some methodological weaknesses, and we propose a novel way to study significance tests for retrieval evaluation. Using Score Distributions, we model the output of multiple search systems, produce simulated search results from such models, and compare them using various significance tests. A key strength of this approach is that we assess statistical tests under perfect knowledge about the truth or falseness of the null hypothesis. This new method for studying the power of significance tests in Information Retrieval evaluation is formal and innovative. Following this type of analysis, we found that both the sign test and Wilcoxon signed test have more power than the permutation test and the t-test. The sign test and Wilcoxon signed test also have a good behavior in terms of type I errors. The bootstrap test shows few type I errors, but it has less power than the other methods tested.es_ES
dc.description.sponsorshipMinisterio de Econom´ıa y Competitividad; TIN2015-64282-R
dc.description.sponsorshipXunta de Galicia; GPC 2016/035
dc.description.sponsorshipXunta de Galicia; ED431G/01
dc.description.sponsorshipXunta de Galicia; ED431G/08
dc.language.isoenges_ES
dc.publisherWilleyes_ES
dc.relation.urihttps://onlinelibrary.wiley.com/journal/23301643es_ES
dc.subjectInformation retrievales_ES
dc.subjectStatistical testes_ES
dc.subjectSignificance testinges_ES
dc.subjectWilcoxones_ES
dc.subjectPermutationes_ES
dc.subjectSignes_ES
dc.subjectBootstrapes_ES
dc.subjectT-ttestes_ES
dc.titleUsing score distributions to compare statistical significance tests for information retrieval evaluationes_ES
dc.title.alternativeCompare statistical significance tests for information retrieval evaluationes_ES
dc.typeinfo:eu-repo/semantics/preprintes_ES
dc.rights.accessinfo:eu-repo/semantics/openAccesses_ES
UDC.journalTitleJournal of the Association for Information Science and Technologyes_ES
UDC.volumetbpes_ES
UDC.issuetbpes_ES


Ficheiros no ítem

Thumbnail

Este ítem aparece na(s) seguinte(s) colección(s)

Mostrar o rexistro simple do ítem