一种用于神经自适应大邻域搜索的图强化学习框架

A Graph Reinforcement Learning Framework for Neural Adaptive Large Neighbourhood Search

Computers and Operations Research · 2024

被引 21 · 同刊同年前 6%

ABS 3

Syu-Ning Johnn 通讯
Victor-Alexandru Darvariu
Julia Handl
Jörg Kalcsics

中文导读

将自适应大邻域搜索中的算子选择问题建模为马尔可夫决策过程，提出基于图强化学习的GRLOS方法和轻量级LRW方法，在5个路径规划问题上用28个破坏算子和7个修复算子验证，均优于经典选择机制。

Abstract

Adaptive Large Neighbourhood Search (ALNS) is a popular metaheuristic with renowned efficiency in solving combinatorial optimisation problems. However, despite 18 years of intensive research into ALNS, the design of an effective adaptive layer for selecting operators to improve the solution remains an open question. In this work, we isolate this problem by formulating it as a Markov Decision Process, in which an agent is rewarded proportionally to the improvement of the incumbent. We propose Graph Reinforcement Learning for Operator Selection (GRLOS), a method based on Deep Reinforcement Learning and Graph Neural Networks, as well as Learned Roulette Wheel (LRW), a lightweight approach inspired by the classic Roulette Wheel adaptive layer. The methods, which are broadly applicable to optimisation problems that can be represented as graphs, are comprehensively evaluated on 5 routing problems using a large portfolio of 28 destroy and 7 repair operators. Results show that both GRLOS and LRW outperform the classic selection mechanism in ALNS, owing to the operator choices being learned in a prior training phase. GRLOS is also shown to consistently achieve better performance than a recent Deep Reinforcement Learning method due to its substantially more flexible state representation. The evaluation further examines the impact of the operator budget and type of initial solution, and is applied to problem instances with up to 1000 customers. The findings arising from our extensive benchmarking bear relevance to the wider literature of hybrid methods combining metaheuristics and machine learning.

组合优化强化学习图神经网络元启发式算法

阅读原文 ↗