CA-GNN: A Competence-Aware Graph Neural Network for Semi-Supervised Learning on Streaming Data
提出一种胜任力感知图神经网络CA-GNN,通过胜任力模型评估数据可靠性并动态调整参数,解决流数据中标签不可靠和图结构静态的问题,在多个数据集上优于现有方法。
One challenge of learning from streaming data is that only a limited number of labeled examples are available, making semi-supervised learning (SSL) algorithms becoming an efficient tool for streaming data mining. Recently, the graph-based SSL algorithms have been proposed to improve SSL performance because the graph structure can utilize the interactivity between surrounding nodes. However, graph-based SSL algorithms have two main limitations when applied to streaming data. First, not all the labels of the data in the streaming data may be reliable, and direct classification using a graph can lead to suboptimal performance. Second, graph-based SSL algorithms assume the structure of the graph is static, but the learning environment of streaming data is dynamic. Hence, we propose a competence-aware graph neural network (CA-GNN) to deal with these two limitations. Unlike other models, CA-GNN does not directly rely on graph information that could include mislabeled nodes. Instead, a competence model is used to explore latent semantic correlations in the streaming data and capture the reliability for each data. A streaming learning strategy then evolves CA-GNN's parameters to capture the dynamism of the graph sequences. We conducted experiments using seven real datasets and four synthetic datasets, respectively, and compared the outcomes across various methods. The results demonstrate that CA-GNN classifies streaming data more effectively than the state-of-the-art (SOTA) methods.