Data‐sparse price indexes by spatio‐temporal regularization and PCA: An application to the Australian housing market
提出两种新方法克服数据稀疏的本地房价指数局限,结合成简单高效可解释的模型,发现澳大利亚市场可由少数全国性因素描述。
Abstract We present two novel approaches to overcome the limitations of data‐sparse local house price indexes and combine them into a single model pipeline that is simple, computationally efficient, and interpretable. The first contribution is a new spatio‐temporal regularization of least squares dummy variable models, such as the repeat sales regression used here. This regularization encodes prior knowledge of the proximity of houses in space and their sales in time. It handles missing values in a natural way. The second is nonlocal regularization using truncated principal component analysis (PCA) applied to the resulting national collection of local price indexes. The PCA loadings show that there are important underlying socioeconomic factors that can be leveraged in the construction of Australian market indexes. This PCA reveals important socioeconomic factors, showing that many local markets can be described by a few broad aspects of the national market, consisting of a general trend that contrasts regions influenced by the mining industry with Sydney and Melbourne, and another trend that highlights lifestyle.