经验相似性

Empirical Similarity

Review of Economics and Statistics · 2006
被引 82
人大 AFT50ABS 4

中文导读

公理化了一种基于相似性加权平均的预测规则,并开发了从历史数据中估计相似性函数的统计推断工具,适用于连续和离散变量。

Abstract

An agent is asked to assess a real-valued variable Y p based on certain characteristics X p = (X p -super-1, ..., X p -super-m), and on a database consisting of X i -super-1, ... X i -super-m, Y i ) for i = 1, ..., n. A possible approach to combine past observations of X and Y with the current values of X to generate an assessment of Y is similarity-weighted averaging. It suggests that the predicted value of Y, Ȳ p -super-s, be the weighted average of all previously observed values Y i , where the weight of Y i for every i = 1, ..., n, is the similarity between the vector X p -super-1, ..., X p -super-m, associated with Y p , and the previously observed vector, X i -super-1, ..., X i -super-m. We axiomatize this rule. We assume that, given every database, a predictor has a ranking over possible values, and we show that certain reasonable conditions on these rankings imply that they are determined by the proximity to a similarity-weighted average for a certain similarity function. The axiomatization does not suggest a particular similarity function, or even a particular form of this function. We therefore proceed to suggest that the similarity function be estimated from past observations.We develop tools of statistical inference for parametric estimation of the similarity function, for the case of a continuous as well as a discrete variable. Finally, we discuss the relationship of the proposed method to other methods of estimation and prediction. Copyright by the President and Fellows of Harvard College and the Massachusetts Institute of Technology.

相似性加权平均公理化相似性函数参数估计