🌙

基于多模态大语言模型的图结构组合问题视觉进化优化:以影响力最大化为例

Visual Evolutionary Optimization on Graph-Structured Combinatorial Problems With MLLMs: A Case Study of Influence Maximization

IEEE Transactions on Evolutionary Computation · 2025
被引 4
ABS 4

中文导读

提出视觉进化优化框架,利用多模态大语言模型将网络解表示为图像,通过图像处理能力实现类人求解,在影响力最大化任务上验证了优于传统进化算法的效果。

Abstract

Graph-structured combinatorial problems in complex networks are prevalent in many domains, and are computationally demanding due to their complexity and non-linear nature. Traditional evolutionary algorithms (EAs), while robust, often face obstacles due to content-shallow encoding limitations and lack of structural awareness, necessitating hand-crafted modifications for effective application. In this work, we introduce an original framework, visual evolutionary optimization (VEO), leveraging multimodal large language models (MLLMs) as the backbone evolutionary optimizer in this context. Specifically, we propose a context-aware encoding scheme, representing the solution of the network as an image. In this manner, we can utilize MLLMs’ image processing capabilities to intuitively comprehend network configurations, thus enabling machines to solve these problems in a human-like way. We develop MLLM-based operators tailored for various evolutionary optimization stages, including initialization, crossover, and mutation. Furthermore, we propose that graph sparsification can effectively enhance the applicability and scalability of VEO on large-scale networks, owing to the scale-free nature of real-world networks. We demonstrate the effectiveness of our method using a well-known task in complex networks, influence maximization, and validate it on eight different realworld networks of various structures. The results confirm VEO’s reliability and enhanced effectiveness compared to traditional evolutionary optimization.

组合优化进化算法多模态大语言模型复杂网络影响力最大化