Efficient Learning of Quadratic Variance Function Directed Acyclic Graphs via Topological Layers
研究了一类非高斯有向无环图模型,其中每个节点的条件方差是条件均值的二次函数,涵盖泊松、二项、几何等分布。提出拓扑层概念,开发高效学习算法,先分层重建拓扑结构再恢复边,计算成本低于现有方法,并在NBA球员数据和阿里化妆品销售数据上验证了优势。
Directed acyclic graph (DAG) models are widely used to represent casual relationships among random variables in many application domains. This article studies a special class of non-Gaussian DAG models, where the conditional variance of each node given its parents is a quadratic function of its conditional mean. Such a class of non-Gaussian DAG models are fairly flexible and admit many popular distributions as special cases, including Poisson, Binomial, Geometric, Exponential, and Gamma. To facilitate learning, we introduce a novel concept of topological layers, and develop an efficient DAG learning algorithm. It first reconstructs the topological layers in a hierarchical fashion and then recovers the directed edges between nodes in different layers, which requires much less computational cost than most existing algorithms in literature. Its advantage is also demonstrated in a number of simulated examples, as well as its applications to two real-life datasets, including an NBA player statistics data and a cosmetic sales data collected by Alibaba. Supplementary materials for this article are available online.