Discovering the hidden structure of house prices with a non-parametric latent manifold model

In many regression problems, the variable to be predicted depends not only on a sample-specific feature vector, but also on an unknown (latent) manifold that must satisfy known constraints. An example is house prices, which depend on the characteristics of the house, and on the desirability of the neighborhood, which is not directly measurable. The proposed method comprises two trainable components. The first one is a parametric model that predicts the "intrinsic" price of the house from its description. The second one is a smooth, non-parametric model of the latent "desirability" manifold. The predicted price of a house is the product of its intrinsic price and desirability. The two components are trained simultaneously using a deterministic form of the EM algorithm. The model was trained on a large dataset of houses from Los Angeles county. It produces better predictions than pure parametric and non-parametric models. It also produces useful estimates of the desirability surface at each location.

[1]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[2]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[3]  A. Goodman Hedonic prices, price indices and housing markets , 1978 .

[4]  R. Halvorsen,et al.  Choice of functional form for hedonic price equations , 1981 .

[5]  Aysegul Can The Measurement of Neighborhood Dynamics in Urban House Prices , 1990 .

[6]  R. Meese,et al.  Nonparametric Estimation of Dynamic Hedonic Price Models and the Construction of Residential Housing Price Indices , 1991 .

[7]  R. Dubin Spatial autocorrelation and neighborhood quality , 1992 .

[8]  Aysegul Can Specification and estimation of hedonic housing price models , 1992 .

[9]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[10]  P. Anglin,et al.  SEMIPARAMETRIC ESTIMATION OF A HEDONIC PRICE FUNCTION , 1996 .

[11]  Otis W. Gilley,et al.  Using the Spatial Configuration of the Data to Improve Estimation , 1997 .

[12]  Ronald P. Barry,et al.  Spatiotemporal Autoregressive Models of Neighborhood Effects , 1998 .

[13]  S. Basu,et al.  Analysis of Spatial Autocorrelation in House Prices , 1998 .

[14]  Nghiep Nguyen,et al.  Predicting Housing Value: A Comparison of Multiple Regression Analysis and Artificial Neural Networks , 2001 .

[15]  T. Kauko Modelling the locational determinants of house prices: neural network and value tree approaches , 2002 .

[16]  J. Clapp A Semi Parametric Method for Estimating Local House Price Indices , 2003 .

[17]  A. Gelfand,et al.  The Dynamics of Location in Home Price , 2004 .

[18]  Fu Jie Huang,et al.  A Tutorial on Energy-Based Learning , 2006 .