Evaluating Model-based Trees in Practice

A recently suggested algorithm for recursive partitioning of statistical models (Zeileis, Hothorn and Hornik, 2005), such as models estimated by maximum likelihood or least squares, is evaluated in practice. The general algorithm is applied to linear regression, logisitic regression and survival regression and applied to economical and medical regression problems. Furthermore, its performance with respect to prediction quality and model complexity is compared in a benchmark study with a large collection of other tree-based algorithms showing that the algorithm yields interpretable trees, competitive with previously suggested approaches.

[1]  R. Gentleman,et al.  Graphical Methods for Censored Data , 1991 .

[2]  M. Schemper Predictive accuracy and explained variation , 2003, Statistics in medicine.

[3]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[4]  K. Hornik,et al.  Model-Based Recursive Partitioning , 2008 .

[5]  R Henderson,et al.  Problems and prediction in survival-data analysis. , 1995, Statistics in medicine.

[6]  K. Hornik,et al.  Unbiased Recursive Partitioning: A Conditional Inference Framework , 2006 .

[7]  W. Loh,et al.  SPLIT SELECTION METHODS FOR CLASSIFICATION TREES , 1997 .

[8]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[9]  Kurt Hornik,et al.  The Design and Analysis of Benchmark Experiments , 2005 .

[10]  H. Theil Introduction to econometrics , 1978 .

[11]  E Graf,et al.  Assessment and comparison of prognostic classification schemes for survival data. , 1999, Statistics in medicine.

[12]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[13]  T. Bergstrom Free Labor for Costly Journals , 2001 .

[14]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[15]  D G Altman,et al.  What do we mean by validating a prognostic model? , 2000, Statistics in medicine.

[16]  Ian H. Witten,et al.  Induction of model trees for predicting continuous classes , 1996 .

[17]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[18]  J. Friedman,et al.  Estimating Optimal Transformations for Multiple Regression and Correlation. , 1985 .

[19]  João Gama,et al.  Functional Trees , 2001, Machine Learning.

[20]  W. Loh,et al.  REGRESSION TREES WITH UNBIASED VARIABLE SELECTION AND INTERACTION DETECTION , 2002 .

[21]  Kurt Hornik,et al.  The support vector machine under test , 2003, Neurocomputing.

[22]  Eibe Frank,et al.  Logistic Model Trees , 2003, Machine Learning.

[23]  Ian Witten,et al.  Data Mining , 2000 .