Collaborative Ontology Matching With Dual Population Genetic Programming and Active Meta-Learning
提出一种双种群遗传编程与主动元学习相结合的方法,构建高质量相似性特征,提升本体匹配的准确性和效率,适用于知识集成与信息交换场景。
Ontology provides a structured language to encapsulate domain-specific knowledge and harmonize diverse data. Ontology matching identifies similar entities in distinct ontologies, facilitating knowledge integration and information exchange. Similarity features are crucial for ontology matching by measuring entity resemblance, but noisy and redundant features can obscure relevant ones, reducing matching quality. To improve the accuracy of matching results, we propose a dual population genetic programming with an active meta-learning to build a high-quality similarity feature, which owns three novel components. First, a dual population genetic programming is developed to construct high-level similarity feature with a two-layer individual representation, a dual population based co-evolutionary mechanism, and a novel fitness function based on partial standard alignment. Second, a new active learning model is presented to update the partial standard alignment through an efficient interactive procedure, guiding the algorithm towards building more reliable similarity features. Finally, a weighted random forest meta-learning model is designed to train the expert vote aggregation model with their historical behaviors, and fine-tunes the model’s performance with a compact genetic algorithm. Experimental results on the Ontology Alignment Evaluation Initiative’s interactive matching tasks demonstrate that our method consistently achieves higher accuracy and better efficiency compared to advanced matching techniques across various expert error rates.