Skip navigation
  •  Home
  • UDC 
    • Getting started
    • RUC Policies
    • FAQ
    • FAQ on Copyright
    • More information at INFOguias UDC
  • Browse 
    • Communities
    • Browse by:
    • Issue Date
    • Author
    • Title
    • Subject
  • Help
    • español
    • Gallegan
    • English
  • Login
  •  English 
    • Español
    • Galego
    • English
  
View Item 
  •   DSpace Home
  • Facultade de Informática
  • Investigación (FIC)
  • View Item
  •   DSpace Home
  • Facultade de Informática
  • Investigación (FIC)
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Bagging cross-validated bandwidths with application to big data

Thumbnail
View/Open
Barreiro_Ures_Daniel_2021_Bagging_cross_validated_bandwidths_with_application_to_big_data.pdf (447.8Kb)
Use this link to cite
http://hdl.handle.net/2183/34333
Collections
  • Investigación (FIC) [1701]
Metadata
Show full item record
Title
Bagging cross-validated bandwidths with application to big data
Author(s)
Barreiro-Ures, Daniel
Cao, Ricardo
Francisco-Fernández, Mario
Hart, Jeffrey D.
Date
2021
Citation
D Barreiro-Ures, R Cao, M Francisco-Fernández, J D Hart, Bagging cross-validated bandwidths with application to big data, Biometrika, Volume 108, Issue 4, December 2021, Pages 981–988, https://doi.org/10.1093/biomet/asaa092
Is version of
https://doi.org/10.1093/biomet/asaa092
Abstract
Hall & Robinson (2009) proposed and analysed the use of bagged cross-validation to choose the band-width of a kernel density estimator. They established that bagging greatly reduces the noise inherent in ordinary cross-validation, and hence leads to a more efficient bandwidth selector. The asymptotic theory of Hall & Robinson (2009) assumes that N , the number of bagged subsamples, is ∞. We expand upon their theoretical results by allowing N to be finite, as it is in practice. Our results indicate an important difference in the rate of convergence of the bagged cross-validation bandwidth for the cases N = ∞ and N < ∞. Simulations quantify the improvement in statistical efficiency and computational speed that can result from using bagged cross-validation as opposed to a binned implementation of ordinary cross-validation. The performance of the bagged bandwidth is also illustrated on a real, very large, dataset. Finally, a byproduct of our study is the correction of errors appearing in the Hall & Robinson (2009) expression for the asymptotic mean squared error of the bagging selector
Keywords
Bagging
Bandwidth
Big data
Cross-validation
Kernel density
 
Description
Versión final aceptada de: https://doi.org/10.1093/biomet/asaa092
 
This is a pre-copyedited, author-produced version of an article accepted for publication in [insert journal title] following peer review. The version of record of: D Barreiro-Ures, R Cao, M Francisco-Fernández, J D Hart, Bagging cross-validated bandwidths with application to big data, Biometrika, Volume 108, Issue 4, December 2021, Pages 981– 988, https://doi.org/10.1093/biomet/asaa092, published by Oxford University Press, is available online at: https:// doi.org/10.1093/biomet/asaa092.
 
Editor version
https://doi.org/10.1093/biomet/asaa092
Rights
Todos os dereitos reservados. All rights reserved.

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsResearch GroupAcademic DegreeThis CollectionBy Issue DateAuthorsTitlesSubjectsResearch GroupAcademic Degree

My Account

LoginRegister

Statistics

View Usage Statistics
Sherpa
OpenArchives
OAIster
Scholar Google
UNIVERSIDADE DA CORUÑA. Servizo de Biblioteca.    DSpace Software Copyright © 2002-2013 Duraspace - Send Feedback