A general bootstrap performance diagnostic

As datasets become larger, more complex, and more available to diverse groups of analysts, it would be quite useful to be able to automatically and generically assess the quality of estimates, much as we are able to automatically train and evaluate predictive models such as classifiers. However, despite the fundamental importance of estimator quality assessment in data analysis, this task has eluded highly automatic solutions. While the bootstrap provides perhaps the most promising step in this direction, its level of automation is limited by the difficulty of evaluating its finite sample performance and even its asymptotic consistency. Thus, we present here a general diagnostic procedure which directly and automatically evaluates the accuracy of the bootstrap's outputs, determining whether or not the bootstrap is performing satisfactorily when applied to a given dataset and estimator. We show that our proposed diagnostic is effective via an extensive empirical evaluation on a variety of estimators and simulated and real datasets, including a real-world query workload from Conviva, Inc. involving 1.7TB of data (i.e., approximately 0.5 billion data points).

[1]  D. Freedman,et al.  Some Asymptotic Theory for the Bootstrap , 1981 .

[2]  H. Künsch The Jackknife and the Bootstrap for General Stationary Observations , 1989 .

[3]  E. Giné,et al.  Bootstrapping General Empirical Measures , 1990 .

[4]  Regina Y. Liu Moving blocks jackknife and bootstrap capture weak dependence , 1992 .

[5]  B. Efron Jackknife‐After‐Bootstrap Standard Errors and Influence Functions , 1992 .

[6]  Joseph P. Romano,et al.  The stationary bootstrap , 1994 .

[7]  E. Mammen,et al.  On General Resampling Algorithms and their Performance in Distribution Estimation , 1994 .

[8]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[9]  H. Putter,et al.  Resampling: Consistency of Substitution Estimators , 1996 .

[10]  R. Beran Diagnosing Bootstrap Success , 1997 .

[11]  E. Mammen The Bootstrap and Edgeworth Expansion , 1997 .

[12]  Arnold J Stromberg,et al.  Subsampling , 2001, Technometrics.

[13]  Angelo J. Canty,et al.  Bootstrap diagnostics and remedies , 2006 .

[14]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[15]  David Hinkley,et al.  Bootstrap Methods: Another Look at the Jackknife , 2008 .

[16]  F. Götze,et al.  RESAMPLING FEWER THAN n OBSERVATIONS: GAINS, LOSSES, AND REMEDIES FOR LOSSES , 2012 .

[17]  Purnamrita Sarkar,et al.  The Big Data Bootstrap , 2012, ICML.

[18]  Carlo Zaniolo,et al.  Early Accurate Results for Advanced Analytics on MapReduce , 2012, Proc. VLDB Endow..

[19]  Ion Stoica,et al.  BlinkDB: queries with bounded errors and bounded response times on very large data , 2012, EuroSys '13.