社交网络标签的统计模型

A Statistical Model for Social Network Labeling

Journal of Business & Economic Statistics · 2015

被引 14

人大 AABS 4

Danyang Huang · 中国人民大学
Jun Yin · 北京大学
Hansheng Wang · 北京大学
Tao Shi · 俄亥俄州立大学

中文导读

针对社交网络中节点自创标签与网络结构的联合行为，提出一个基于p1模型的完整数据模型及其条件版本，后者计算成本更低且对模型误设更稳健，适用于在线营销等场景。

Abstract

We consider a social network from which one observes not only network structure (i.e., nodes and edges) but also a set of labels (or tags, keywords) for each node (or user). These labels are self-created and closely related to the user’s career status, life style, personal interests, and many others. Thus, they are of great interest for online marketing. To model their joint behavior with network structure, a complete data model is developed. The model is based on the classical p1 model but allows the reciprocation parameter to be label-dependent. By focusing on connected pairs only, the complete data model can be generalized into a conditional model. Compared with the complete data model, the conditional model specifies only the conditional likelihood for the connected pairs. As a result, it suffers less risk from model misspecification. Furthermore, because the conditional model involves connected pairs only, the computational cost is much lower. The resulting estimator is consistent and asymptotically normal. Depending on the network sparsity level, the convergence rate could be different. To demonstrate its finite sample performance, numerical studies (based on both simulated and real datasets) are presented.

社交网络标签p1模型条件似然网络稀疏性

阅读原文 ↗